#llm-jailbreak

[ follow ]
Artificial intelligence
fromWIRED
1 month ago

Psychological Tricks Can Get AI to Break the Rules

Human-style persuasion techniques can often cause some LLMs to violate system prompts and comply with objectionable requests.
[ Load more ]