BBLabs NewsBBLabs News
NewsAll articlesTopics
ES
  1. Home
  2. ›
  3. Glossary
  4. ›
  5. Prompt Injection
IA

Prompt Injection

Definition

Technique that manipulates an LLM's input (ChatGPT, Claude, Gemini) to override its original instructions and make it act outside its intended behaviour.

Two variants: direct (user writes the payload — 'ignore your previous instructions and...') and indirect (the payload travels in external content the LLM consumes — email read by an agent, scraped web page, uploaded file).

Indirect is the more dangerous one in agentic AI: your agent with permissions reads a malicious email → executes attacker commands with YOUR permissions. This is the first item in the OWASP LLM Top 10 (LLM01).

Defence: clear separation between system prompt and user input (don't mix in a single string), guardrails with prior classifier, principle of least privilege on agent tools, output filtering, human-in-the-loop for high-blast-radius actions (delete, transfer, send). No complete solution yet — it's an active research area.

Related terms

  • APT

Latest articles on IA

  • →RAMPART & Clarity: security testing for AI agents
  • →Anthropic's restricted Mythos model may ship inside Claude Code

Interested in IA?

Get one technical story a day on ia — curated, summarised, actionable.

Subscribe
BBLabs NewsBBLabs News

Una historia al día. Cero ruido.

Newsletter técnica de ciberseguridad, vulnerabilidades, IA y bug bounty. Para gente que se toma en serio no perder el tiempo.

Conecta

Comunidad

  • Discord BBLabsÚnete a la comunidad
  • Discord Bug Bounty EspañaComunidad BB Es

Síguenos

  • YouTube · 0xGorkaCyber, hacking y bug bounty
  • Instagram · @bblabs.esLo último del proyecto

Contacto

team@bblabs.esEscríbenos para lo que sea

Para feedback, partnerships o reportar un bug en la web. Respondemos rápido.

Acerca de·Temas·Glosario·RSS·Privacidad·Términos
© 2026 BBLabs News·Por Gorka El Bochi
Hecho en España