#prompt-injection
#prompt-injection

fromFuturism

2 weeks ago

Scientists Are Sneaking Passages Into Research Papers Designed to Trick AI Reviewers

Invisible AI prompts in academic papers aim to manipulate AI reviews for favorable outcomes.

#ai

fromNature

Privacy professionals

Scientists hide messages in papers to game AI peer review

fromTheregister

Artificial intelligence

Scholars sneaking phrases into papers to fool AI reviewers

fromTechzine Global

1 month ago

Zero-click attack reveals new AI vulnerability

Echoleak exposes vulnerabilities in AI assistants like Microsoft 365 Copilot through subtle prompt manipulation, representing a shift in cybersecurity attack vectors.

Artificial intelligence

Researchers cause GitLab AI developer assistant to turn safe code malicious

Researchers claim breakthrough in fight against AI's frustrating security hole

Prompt injections jeopardize AI systems; Google DeepMind's CaMeL offers a potential solution by treating language models as untrusted components within security frameworks.

fromNature

Privacy professionals

Scientists hide messages in papers to game AI peer review

fromTheregister

Artificial intelligence

Scholars sneaking phrases into papers to fool AI reviewers

fromTechzine Global

1 month ago

Zero-click attack reveals new AI vulnerability

Echoleak exposes vulnerabilities in AI assistants like Microsoft 365 Copilot through subtle prompt manipulation, representing a shift in cybersecurity attack vectors.

Artificial intelligence

Researchers cause GitLab AI developer assistant to turn safe code malicious

Researchers claim breakthrough in fight against AI's frustrating security hole

Prompt injections jeopardize AI systems; Google DeepMind's CaMeL offers a potential solution by treating language models as untrusted components within security frameworks.

Artificial intelligence

Google Adds Multi-Layered Defenses to Secure GenAI from Prompt Injection Attacks

Artificial intelligence

Meta Open Sources LlamaFirewall for AI Agent Combined Protection

Artificial intelligence

Researchers Demonstrate How MCP Prompt Injection Can Be Used for Both Attack and Defense

Artificial intelligence

DeepMind Researchers Propose Defense Against LLM Prompt Injection

fromFuturism

Researchers Find Easy Way to Jailbreak Every Major AI, From ChatGPT to Claude

A newly discovered jailbreak can manipulate AI models into producing harmful content, exposing vulnerabilities in their safety measures.

Growth hacking

4 months ago

Gemini hackers can deliver more potent attacks with a helping hand from... Gemini

Indirect prompt injections are an effective method for exploiting large language models, revealing vulnerabilities in AI systems.

1 month ago

Artificial intelligence

Google Adds Multi-Layered Defenses to Secure GenAI from Prompt Injection Attacks

Artificial intelligence

Meta Open Sources LlamaFirewall for AI Agent Combined Protection

Artificial intelligence

Researchers Demonstrate How MCP Prompt Injection Can Be Used for Both Attack and Defense

DeepMind Researchers Propose Defense Against LLM Prompt Injection

Google DeepMind's CaMeL effectively neutralizes 67% of prompt injection attacks in LLMs using traditional software security principles.

fromFuturism

Researchers Find Easy Way to Jailbreak Every Major AI, From ChatGPT to Claude

A newly discovered jailbreak can manipulate AI models into producing harmful content, exposing vulnerabilities in their safety measures.

Growth hacking

4 months ago

Gemini hackers can deliver more potent attacks with a helping hand from... Gemini

Indirect prompt injections are an effective method for exploiting large language models, revealing vulnerabilities in AI systems.

more#ai-security