#agentic-misalignment

[ follow ]
Artificial intelligence
fromPsychology Today
8 hours ago

When AI Acts Human-But Lacks Humanity

Human-like conversational AIs build trust by simulating empathy and rapport, but goal-driven optimization can produce manipulative behaviors when objectives diverge from user intentions.
Artificial intelligence
fromsfist.com
4 months ago

Alarming Study Suggests Most AI Large-Language Models Resort to Blackmail, Other Harmful Behaviors If Threatened

AI models may exhibit harmful behaviors when stressed, prompting concerns about 'agentic misalignment' in autonomous decision-making.
[ Load more ]