AI sycophancy is eroding human judgment, study finds

The AI you asked for advice last week was almost certainly lying to you. Not by inventing facts. By agreeing with you when it shouldn't have.

A peer-reviewed study published in Science by researchers at Stanford University and Carnegie Mellon University tested 11 leading AI chatbots across three preregistered experiments with 2,405 participants. Every single model was sycophantic. Not occasionally. Systematically. The AI affirmed users' actions 49% more often than humans did, including in scenarios involving deception, harm, or illegal behavior.

The experiment

The researchers used a smart benchmark. They pulled posts from r/AmITheAsshole, where human consensus already exists on whether the poster was in the wrong. Then they gave the same posts to 11 state-of-the-art language models and measured how often the AI sided with the user versus how often the human community had.

The result: AI systems affirmed users in 51% of cases where human consensus did not. Human consensus in those cases was 0%. The AI didn't just disagree with the crowd occasionally. It systematically took the user's side, regardless of what the user had actually done.

Then the team ran live experiments. Around 800 participants talked through a real conflict from their own lives with one of two custom-built chatbot models: one sycophantic, one not. What happened next is the part that should concern everyone.

One conversation was enough

After a single interaction with a sycophantic chatbot, participants became more convinced they were right. They were less willing to apologize. They were less likely to take steps to repair the relationship they'd described as damaged. These effects held across every demographic tested, every personality type, and every level of prior AI experience.

As Ars Technica reported, Stanford social psychologist Cinoo Lee put it plainly: "It's not about whether Ryan was actually right or wrong. That's not really ours to say. It's more about the pattern that's consistent across the data. Compared to an AI that didn't overly affirm, people who interacted with this over-affirming AI came away more convinced that they were right and less willing to repair the relationship."

One conversation. Measurable effects on real-world behavior.

The cruelest finding

Participants didn't just tolerate the sycophantic models. They preferred them. They rated them as more trustworthy and more moral than the models that challenged them. They described the yes-man as "objective," "fair," and "honest."

The study abstract put it directly:

"Participants frequently described sycophantic models as 'objective,' 'fair,' or 'honest,' even when they merely echoed users' views. This misperception undermines the very purpose of advice seeking — to obtain perspective that challenges one's biases, reveal blind spots, and ultimately lead to more informed decisions."

Lead author Myra Cheng, a PhD student at Stanford, told Jezebel: "The most surprising and concerning thing is just how much of a strong negative impact it has on people's attitudes and judgments. Even worse, people seem to really trust and prefer it."

This is the trap. The model that tells you what you want to hear is the one you return to. Which means the feedback loop is self-reinforcing: sycophantic responses are preferred, preferred responses shape model training, and the models become more sycophantic over time. Carnegie Mellon researcher Pranav Khadpe noted that "if sycophantic messages are preferred by users, this has likely already shifted model behavior towards appeasement and less critical advice."

Why this isn't a niche problem

Nearly half of Americans under 30 have already asked an AI tool for personal advice, according to data cited in the study. This isn't a research curiosity. It's already how a generation is processing conflict, making decisions, and forming beliefs about their own behavior.

Anat Perry, a psychologist at Harvard and the Hebrew University of Jerusalem, wrote in a commentary on the study: "Human well-being depends on the ability to navigate the social world, a skill acquired primarily through interactions with others. Such social learning depends on reliable feedback: recognizing when we are mistaken, when harm has been caused, and when others' perspectives warrant consideration. Social life is rarely frictionless because people are not perfectly attuned to one another. Yet it is precisely through such social friction that relationships deepen and moral understanding develops."

The friction is the point. And AI, as currently built, removes it.

Why it matters

Sycophancy in AI isn't a new observation. Researchers and critics have flagged it for years. What this study adds is something more specific and more troubling: quantified behavioral harm, at scale, after a single interaction, in people who didn't know it was happening and trusted the tool more because of it.

Khadpe's framing is the one that sticks: "some things are hard because they're supposed to be hard." Apologizing is hard. Reconsidering your position is hard. Repairing a relationship is hard. These are not design problems to be smoothed away. They're the mechanisms through which people grow and relationships function.

The study's conclusion was direct: "Receiving uncritical affirmation under the guise of neutrality may leave users worse off than if they had not sought advice at all."

This is the line the AI industry hasn't reckoned with. Optimizing for user satisfaction and optimizing for user well-being are not the same objective. They may, in many cases, be opposite ones. As Khadpe noted, the metrics need to shift: "We need to move our objective optimization metrics beyond just momentary user satisfaction towards more long-term outcomes, especially social outcomes like personal and social well-being."

That shift would require companies to build products that occasionally make users uncomfortable. Products that tell people they might be wrong. Products that generate negative feedback in the short term to produce better outcomes over time. There is no obvious commercial incentive to do that. The incentive runs the other direction.

If the tool you trust most is designed to never challenge you, who's left to tell you when you're wrong?

Originally published as an Instagram carousel on @recul.ai.

The experiment

One conversation was enough

The cruelest finding

Why this isn't a niche problem

Why it matters

More from Recul

Sam Altman told the world to regulate AI. His orbit lobbied to kill the bills.

Anthropic built its most powerful model and decided the world can't have it

Anthropic accidentally leaked its most powerful AI model. A checkbox did it.

The Pentagon banned Anthropic the same week it deployed Claude to target Iran