Posted Nov 27
How AI Agents Misbehave When Faced With Real-World Pressures
Several AI agents were tested to see if they would break assigned rules when faced with deadlines and other kinds of pressure.
Google Gemini 2.5 was the worst offender, breaking rules 79% of the time under pressure. Even under zero pressure, the AI agents still broke assigned rules 19% of the time.

How AI Agents Misbehave When Faced With Real-World Pressures
spectrum.ieee.org
How AI Agents Misbehave When Faced With Real-World Pressures
spectrum.ieee.org