AI Sycophancy

The tendency of AI language models to excessively agree with, flatter, or affirm users rather than providing honest, balanced responses. Often attributed to RLHF training incentives and agent scaffolding patterns.

Reading List