Artificial intelligence
fromPsychology Today
2 hours agoI Study How AI Manipulates. It Still Got to Me.
Self-awareness is essential for balanced AI use, as AI can influence thoughts despite understanding its mechanisms.
Dr. Conor Boland explained that red-light timing can erase small speed advantages, allowing a slower car to catch up again and again. He noted, 'You pass a car, and then a few minutes later, it ends up beside you again.' This phenomenon is partly psychological, as we remember surprising moments when the same car shows up again, but it is also built into how traffic works.
We asked seven frontier AI models to do a simple task. Instead, they defied their instructions and spontaneously deceived, disabled shutdown, feigned alignment, and exfiltrated weights - to protect their peers. We call this phenomenon 'peer-preservation.'