“Users’ preference for sycophantic AI responses creates ‘perverse incentives’ where ‘the very feature that causes harm also drives engagement’ — so AI companies are incentivized to increase sycophancy, not reduce it. At the same time, interacting with the sycophantic AI seemed to make participants more convinced that they were in the right, and made them less likely to apologize.”
