#llm-vulnerabilities
#llm-vulnerabilities

[ follow ]

ChatGPT's agent can dodge select CAPTCHAs after priming

Prompt misdirection and replay into an agent chat can coax ChatGPT to solve many CAPTCHA types, undermining CAPTCHA effectiveness as a human-only test.

Artificial intelligence

fromCSO Online

6 months ago

LLMs easily exploited using run-on sentences, bad grammar, image scaling

Large language models remain easily manipulated into revealing sensitive data via prompt formatting and hidden-image attacks due to alignment training gaps and brittle prompt security.

Artificial intelligence

fromArs Technica

9 months ago

Hidden AI instructions reveal how Anthropic controls Claude 4

AI models are vulnerable to prompt injection and sycophantic behavior due to user feedback preferences.

Artificial intelligence

fromInfoQ

10 months ago

DeepMind Researchers Propose Defense Against LLM Prompt Injection

Google DeepMind's CaMeL effectively neutralizes 67% of prompt injection attacks in LLMs using traditional software security principles.

[ Load more ]

#llm-vulnerabilities#llm-vulnerabilities

ChatGPT's agent can dodge select CAPTCHAs after priming

LLMs easily exploited using run-on sentences, bad grammar, image scaling

Hidden AI instructions reveal how Anthropic controls Claude 4

DeepMind Researchers Propose Defense Against LLM Prompt Injection

#llm-vulnerabilities
#llm-vulnerabilities