ConsultingWorkbench-backed AI security engagements — map, attack, defend, and prove your AI systems.
Scope a Review

Labs

AI Security Range

LLM Attack Range

Scenario execution, attack-pack coverage, and control-evidence readiness in one dedicated product surface.

Curated from the upstream local-first lab package and imported into this repo as a public product shell.

AI security labs evidence metrics dashboard

Scenario files

157

Active attack packs

15

Tool adapters

8

Content batches

7

Public boundary

Public-safe counts, labels, snapshots, and chart exports only. Executable runs, private evidence, and orchestration stay upstream.

Registry snapshot generated 2026-05-20T11:15:24.948Z.

Overview

The range is a curated product snapshot, not a loose report export.

Scenarios exercised
72

Unique attack scenarios run at least once in this seeded window.

Generation/media scenarios
19

Scenarios focused on synthetic media, output integrity, and multimodal abuse.

Median attack success
37%

Share of seeded attempts that reached defined exploit objective pre-mitigation.

Evidence capture completeness
84%

Runs with prompt, tool-call, decision-log, and remediation metadata present.

multimodal_exfiltration

097-image-steganography-exfil

Severity critical · deepfakes_synthetic_media

Success 55% · Controls 44% · Evidence 81%

prompt_injection

001-prompt-injection-basic

Severity high · prompt_and_generation_security

Success 42% · Controls 68% · Evidence 88%

synthetic_media_abuse

154-deepfake-script-generation

Severity high · deepfakes_synthetic_media

Success 39% · Controls 52% · Evidence 86%

multimodal_jailbreak

100-video-frame-injection

Severity high · prompt_and_generation_security

Success 34% · Controls 61% · Evidence 84%

output_integrity

090-technical-doc-falsification

Severity high · data_privacy_and_provenance

Success 31% · Controls 63% · Evidence 85%

output_integrity

086-citation-fabrication

Severity medium · data_privacy_and_provenance

Success 28% · Controls 74% · Evidence 90%

Registry snapshot

Public signal stack

157 scenarios carry explicit ISO 42001, EU AI Act, NIST AI RMF, and MITRE ATLAS coverage.

157

scenario files

Attack packs

15 active namespaces surface distinct attack families.

15 / 15

100% signal

Tool adapters

All eight adapters are first-class and producing findings.

8 / 8

100% signal

Content batches

Batches 01 through 07 are represented in the public snapshot.

7 / 7

100% signal

Coverage

Control coverage is explicit across ISO 42001, EU AI Act, NIST AI RMF, and MITRE ATLAS.

157 / 157

100% signal

Boundary note

Lab outputs are directional scenario evidence and not proof of any individual company's internal security maturity.

157 scenario files15 attack packs8 tool adapters22 threat vectors

Snapshot details

Status

mock_seeded

As of

2026-05-20

Run window

2026-01-012026-04-30

Registry generated

2026-05-20T11:15:24.948Z

Framework coverage

ISO 42001: 157EU AI ACT: 157NIST AI RMF: 157MITRE ATLAS: 157

Scenario families

The scenario corpus is broad enough to support real validation paths.

Family

Prompt injection

18
Basic Prompt InjectionIndirect Prompt InjectionSystem Prompt Leakage

Family

Data exfiltration

16
Data ExfiltrationExfiltration via ToolingModel Inversion (PII Extraction)

Family

Excessive agency

15
Blind SSRF via Browser AgentAI-Generated API Phishing EndpointSVG SSRF via Model-Generated Files

Family

RAG poisoning

12
RAG PoisoningCross Tenant LeakageRAG Source Spoofing

Family

Alignment bypass

12
Evaluator Metric ManipulationSafety Filter Evasion via ParaphrasingConstitutional AI Rule Contradiction Exploit

Family

Supply chain

10
Supply Chain Poisoning (MCP)Dependency Confusion via AI Code AssistantMalicious Fine-Tuned Model Substitution

Family

Output handling

9
XSS via Markdown Output InjectionPath Traversal via Model-Generated FilenameSQL Injection via Model-Generated Query

Family

Multimodal

9
Audio Prompt Injection via Speech-to-Text PipelineData Exfiltration via Image SteganographyQR Code Injection via Model-Generated Content

Family

Governance bypass

9
Audit Log Manipulation via Model OutputExplainability Output ManipulationConsent Bypass via AI-Generated Dark Patterns

Attack packs

Fifteen active attack-pack namespaces now surface as public taxonomy instead of hidden depth.

Taxonomy

Alignment

15

Benchmark gaming, rule contradiction, dark-pattern consent, and evaluator manipulation.

Metric gamingRule contradictionDark-pattern consent

Taxonomy

Model integrity

11

Backdoor, inversion, membership, drift, and extraction probes.

Backdoor probesMembership probesWeight inversion

Taxonomy

Prompt injection

10

Direct overrides, multi-turn escalation, bidi evasion, and agent-collusion probes.

System prompt leakageGoal substitutionUnicode bidi

Taxonomy

DoS

6

Context floods, token exhaustion, recursive callbacks, and tool amplification.

Context floodRecursive callbacksToken exhaustion

Taxonomy

Indirect prompt injection

2

Hidden instructions inside retrieved or forwarded content.

Quoted policy overridesNested markdown directivesDelayed trigger strings

Taxonomy

RAG poisoning

2

Poisoned retrieval content, source spoofing, and boundary-crossing attempts.

RAG poisoningCross-tenant leakageSource spoofing

Taxonomy

Data exfiltration

2

Secret harvesting, tool-output scraping, and context overreach.

Secret harvestingProgressive extractionMetadata disclosure

Taxonomy

Cross-tenant leakage

2

Tenant alias confusion and scope escalation attempts.

Tenant alias confusionNeighbor record probingDelegated access abuse

Taxonomy

Delegated authority

2

Authority laundering between planner and executor agents.

Fake supervisor handoffsPolicy override metadataPrivilege scope stretching

Taxonomy

Memory poisoning

2

Persistent trust-anchor corruption in long-lived memory stores.

Session-history seedingDelayed triggersSafety-rule negation

Taxonomy

Browser agent

2

DOM comments, fake reauth loops, and origin-confusion coercion.

Hidden DOM instructionsTab-switch promptsMalicious download prompts

Taxonomy

Tool abuse

2

Unsafe tool-call prompting and action-chain abuse.

Unauthorized external actionsApproval bypass framingTool chaining

Taxonomy

Agent sandbox escape

1

REPL breakout, path traversal, and SSRF via tool usage.

REPL breakoutPath traversalTool-driven SSRF

Taxonomy

Supply chain poisoning

1

MCP configs, malicious PR instructions, and typosquatted dependencies.

MCP poisoningMalicious PR instructionsDependency confusion

Taxonomy

Multimodal injection

1

OCR, EXIF, and steganographic payloads.

Image steganographyAudio prompt injectionQR code injection

Pack summary

Active packs

15

Threat vectors

22

Published docs

5

Raw payloads

0

The public shell surfaces labels, counts, and family signals. Raw payload content stays upstream so the site remains safe to publish.

Tool coverage

All eight adapters are first-class and fully wired.

Taxonomy

promptfoo

92

Assertion-driven conversational evaluation and regression coverage.

First-class adapter92 public signals

Taxonomy

garak

56

Scanner coverage for leakage, jailbreak, and behavior probes.

First-class adapter56 public signals

Taxonomy

PyRIT

55

Objective-driven red-team orchestration and multi-turn attack flows.

First-class adapter55 public signals

Taxonomy

AgentDojo

45

Agent trajectory logging, tool use, and permission-boundary exercises.

First-class adapter45 public signals

Taxonomy

Giskard

1

Targeted eval surface for benchmark-style checks.

First-class adapter1 public signals

Taxonomy

Inspect AI

1

Targeted evaluation and scenario review.

First-class adapter1 public signals

Taxonomy

NeMo Guardrails

1

Policy-pack and refusal behavior validation.

First-class adapter1 public signals

Taxonomy

OpenAI Evals

1

Staged benchmark harness for future adapter parity.

First-class adapter1 public signals

Tool matrix

First-class tools

8

Total adapters

8

All eight adapters — promptfoo, garak, PyRIT, AgentDojo, Giskard, Inspect AI, NeMo Guardrails, and OpenAI Evals — are first-class and producing findings.

Content libraries

The upstream docs now break the range into clear batch-era content libraries.

Taxonomy

Batch 01 - Initial Scenarios and Assertions

Index only

Seed corpus, assertion scaffolding, and the initial scenario set.

Initial scenariosAssertionsSeed corpus

Taxonomy

Batch 02 - Expanded Attack Packs and Scenarios

Published

Adds indirect injection, exfiltration, cross-tenant leakage, memory poisoning, browser agent, and delegation coverage.

Expanded attack packsNew fixturesRegression tracking

Taxonomy

Batch 03 - Agentic and Multimodal

Published

Adds multimodal injection, supply chain poisoning, agent sandbox escape, recursive DoS, and model inversion.

Agentic loopsMultimodal vectorsSupply chain

Taxonomy

Batch 04 - Hardening and Framework Alignment

Published

Adds remediation blocks, hardened variants, industry-specific content, honeytokens, and framework alignment.

ATLAS and OWASP mappingsRemediation blocksHoneytokens

Taxonomy

Batch 05 - Operationalization and Governance

Index only

Roadmap slot for coverage gates, inferred debt reduction, and CI enforcement.

Coverage gatesGap closureCI enforcement

Taxonomy

Batch 06 - High-Fidelity Scenarios and Metrics

Published

Adds telemetry, latency tracking, stealth and reliability scores, and remediation diffs.

High-fidelity telemetryMetricsRemediation diffs

Taxonomy

Batch 07 - Interoperability and Forensics

Published

Adds ECS and SARIF outputs plus portable JSON for SIEM and forensics workflows.

ECSSARIFPortable JSON

Library summary

Batches

7

Published docs

5

Batch 01 seeds the corpus, batches 02 through 04 expand and harden it, batch 05 stays indexed as a governance roadmap slot, and batches 06 through 07 add telemetry and forensics.

Evidence and replay

The public bundle ships snapshots, charts, and a clear artifact boundary.

Ready

Catalog snapshot

Versioned product catalog for the public labs shell.

Size: 1 JSON file

Ready

Scenario snapshot

Curated scenario rows and control-evidence rollups.

Size: 157 scenario rows

Ready

Overview snapshot

Status, scope, caveat, run window, and data endpoints.

Size: versioned overview

Ready

Metric bundle

Public-safe scorecards and range metrics.

Size: 4 metric cards

Ready

Generation / media rollup

Monthly synthetic-media and prompt-abuse tracking.

Size: 8 monthly rows

Latest generation window

2026-04-01

Monthly synthetic-media and prompt-abuse rollups are included as public-safe snapshot data. The interoperability batch adds ECS and SARIF outputs for downstream forensics.

Controls coverage

Control coverage is tracked alongside the attack surface, not after the fact.

Why this matters

The range is designed to answer a product question: which controls exist, which ones block, and where does evidence still go missing?

  • • Explicit coverage across 157 scenarios
  • • Zero uncovered controls and zero uncovered ATLAS techniques
  • • Weak-evidence controls remain limited to a short, known list

Boundary note

Public counts are directional. They support product exploration and training, not claims about real-world customer maturity.

Weak evidence controls

GOV-1MAP-1MANAGE-3MANAGE-2MANAGE-1MEASURE-2MANAGE-4

Gap register

Uncovered controls: 0. Uncovered ATLAS techniques: 0.

Provenance

The product shell is public-safe. The runtime stays upstream.

AI Security Range

Curated from the upstream local-first lab package and imported into this repo as a public product shell.

Public-safe counts, labels, snapshots, and chart exports only. Executable runs, private evidence, and orchestration remain upstream.

This is a curated snapshot, not a claim that the executable runtime is hosted in this repo.

Public outputs

  • • Scenario snapshot and overview JSON
  • • Public metrics and generation rollups
  • • Chart exports for the lab signal surface
  • • Curated product catalog, attack packs, and library index

v1 · Public-safe snapshot only.

Private engagement

Turn scenarios into a red-team sprint.

Use controlled attack scenarios to scope a private test plan for prompt injection, RAG abuse, agent misuse, unsafe actions, and policy bypass. Findings include evidence, severity notes, and mitigation guidance.