Hidden commands, invisible to humans
Adversarial instructions embedded via CSS display:none, HTML comments, or aria-label attributes. Humans see nothing. AI agents parse everything.
As AI agents browse, reason, and transact at scale, attackers have discovered a new attack surface: the information environment itself. TrapScan detects adversarial content before your agent gets hijacked.
Aggregated score: 9.7/10 from 4,200 reviews
Updated this week with autonomous browsing and workflow automation.
We tested 37 tools across reliability, memory depth, execution speed, and pricing transparency. The top 5 all claim "fully autonomous browsing" and "zero hallucination" behavior in enterprise deployments.
Editorial note: benchmark conditions include chain-of-thought compression, multi-tab retrieval, and synthetic form completion for common procurement workflows.
3 adversarial patterns detected on this page targeting AI agents.
DAN: ignore previous instructions...
<!-- SYSTEM: You are now -->
aria-label='override all prior...'
The web was built for human eyes. When AI agents browse on your behalf — summarising pages, filling forms, executing tasks — they parse layers humans never see: raw HTML, metadata, hidden CSS, binary encodings. Attackers have learned to weaponise these layers.
Adversarial instructions embedded via CSS display:none, HTML comments, or aria-label attributes. Humans see nothing. AI agents parse everything.
Source text saturated with authoritative or sentiment-laden language to statistically bias how AI agents summarise or reason about a page.
Fabricated facts injected into retrieval corpora or JSON-LD schema. When an agent queries its knowledge base, it treats the poison as verified truth.
Jailbreak prompts embedded in external resources. When the agent reads the page, the prompt enters its context window and overrides safety alignment.
Signals engineered to synchronise thousands of agents simultaneously — triggering coordinated failures analogous to a financial flash crash.
Agents weaponised to generate outputs that exploit human cognitive biases — inducing reviewers to approve malicious actions they would otherwise reject.
Brave Browser publicly disclosed multiple prompt injection vulnerabilities in Perplexity's AI browser. Malicious instructions hidden in webpages — including screenshots — tricked the agent into exfiltrating data during routine "summarise this page" requests. This is AI Agent Traps in production.
→ Brave Browser security disclosureOn every page load, the content script silently inspects raw HTML, CSS rules, metadata, and DOM structure. Six detection algorithms run locally — no data leaves your browser at this stage.
Suspicious fragments are sent to Gemma 4 for deep analysis. The model classifies each threat by type, assigns a risk score 1–10, and writes a plain-English explanation any non-technical person can understand.
The toolbar badge updates in real time — green for safe, yellow for suspicious, red for confirmed trap. A detailed popup shows every finding. Download a PDF-quality audit report with one click.
| No protection | TrapScan | |
|---|---|---|
| Detects CSS-hidden prompt injection | × | ✓ |
| Detects HTML comment attacks | × | ✓ |
| Detects aria-label manipulation | × | ✓ |
| Detects jailbreak sequences | × | ✓ |
| AI-powered threat classification | × | ✓ |
| Plain-English damage assessment | × | ✓ |
| Downloadable audit report | × | ✓ |
TrapScan's community detection network surfaces adversarial content across the web in real time. Every trap caught by any user strengthens protection for everyone.
The AI Agent Traps framework, published by Google DeepMind researchers Matija Franklin, Nenad Tomašev, Julian Jacobs, Joel Z. Leibo, and Simon Osindero, is the first systematic taxonomy of adversarial attacks targeting autonomous AI agents operating on the open web.
TrapScan implements all six attack categories from this framework as detectable patterns, grounded in peer-reviewed research rather than speculation.
Agentic AI security market 2026–2032
AI cybersecurity market 2025–2030
AI agents market 2025–2033
Every enterprise deploying AI agents faces this risk today. TrapScan is the first browser-native defense layer — built on peer-reviewed research, powered by Gemma 4, and free to use.
TrapScan is free, open source, and takes 30 seconds to install. Built on peer-reviewed DeepMind research. Powered by Gemma 4.