InspectAgents - AI Agent Testing & Safety Platform

Name: AI Agent Failures Database
Creator: InspectAgents
License: https://inspectagents.com/terms/

InspectAgents

Playbook

AI Agent Safety Playbook

Actionable guides, operating policies, and step-by-step testing procedures — all informed by 500+ documented AI failures. Stop reading about what goes wrong. Start implementing what works.

Quick Start Path

New here? Follow these four steps to go from zero to a tested, safe AI agent deployment.

Assess Your Risk

Take the 3-minute quiz to discover your highest-risk areas.

Take the Quiz →

Run the Checklist

Walk through 63 testing checkpoints before you ship.

Get the Checklist →

Learn from Failures

Study 500+ real incidents so you don’t repeat them.

Browse Failures →

Follow the Guides

Implement prevention with step-by-step playbook guides below.

Jump to Guides →

Prevention Guides

In-depth guides from InspectAgents, built on lessons from real-world AI incidents.

TestingStart Here

How to Test AI Agents Before Deployment

End-to-end testing framework: hallucination detection, prompt injection, security validation, and production monitoring.

Case Study

Chevrolet $1 Car Fiasco — Full Breakdown

Deep technical analysis of the prompt injection attack, timeline, and exactly how to prevent it in your own chatbot.

Reference

Complete List of AI Chatbot Failures (2025–2026)

Continuously updated database of AI agent failures, hallucinations, and security breaches. Learn what went wrong — and why.

Operating Policies & Templates

Reusable operating contracts that define evidence boundaries, verification rules, and fail-closed behavior for your AI agents. Curated from the AI Agents Playbook by Tamar Peretz.

Evidence BoundaryExternal ↗

Facts-Only: Authoritative Sources Required

Evidence boundary policy: world-claims require authoritative sources. No speculation, no hedging, no hallucination.

Evidence BoundaryExternal ↗

Facts-Only: Artifacts-Only (No External Sources)

Restrict evidence to user-provided artifacts only. No browsing, no training data claims. Maximum control.

CitationsExternal ↗

Web Verification & Citations Policy

Operating contract for public facts via web sources with mandatory inline citations.

Quality GateExternal ↗

Engineering Quality Gate Policy

Architecture + best-practices + regression-minded review gate as a normative operating contract.

AccuracyExternal ↗

Semantic Accuracy Gate

Prevent overclaims and enforce terminology consistency. Catch subtle misrepresentations before they ship.

ReportingExternal ↗

Confidence Score (0–100) — Reporting Rules

Require a numeric confidence line on every response so users know how much to trust the output.

Testing & Verification Procedures

Step-by-step procedures for fact-checking, quality gates, and verification workflows. Copy-paste-ready prompt templates included.

VerificationExternal ↗

Fact-Checking Kit — Verification Procedure

A repeatable evidence-check loop to run before producing any non-trivial output. Claim-by-claim verification workflow.

PolicyExternal ↗

Choose an Evidence Boundary (Facts-Only Mode)

Define allowed sources and refusal conditions for your AI agent. Fail-closed by default — artifacts-only or authoritative sources.

Quality GateExternal ↗

Engineering Quality Gate — Procedure

Architecture, best-practices, and regression-minded review gate. Validate outputs before they reach users.

VerificationExternal ↗

Chain-of-Verification (CoVe) — Procedure

Structured self-check loop before final output. The model verifies its own claims step-by-step before responding.

PromptingExternal ↗

Prompt Engineering Guide for Daily Work

Reusable daily-driver prompt template with evidence boundaries, output contracts, and fallback rules baked in.

CitationsExternal ↗

Web Verification & Citations — Prompt Template

How to request web browsing and produce verifiable, citation-grade references in agent outputs.

Agent Security Deep Dives

Technical articles on trust boundaries, attack surfaces, and control-plane failures in tool-using AI agent systems.

SecurityExternal ↗

The Attack Surface Starts Before Agents — The LLM Boundary

Where the first LLM-to-system boundary appears, and why exposure starts before you even have "agents."

SecurityExternal ↗

The Attack Surface Isn’t the LLM — It’s the Controller Loop

Why multi-step orchestration increases risk, and what to audit in plan → tools → retry loops.

Threat ModelExternal ↗

Request Assembly Threat Model

Context hub, truncation/ordering, tool loops, observability, and R1–R8 audit checkpoints.

AuditExternal ↗

Agentic Systems: 8 Trust-Boundary Audit Checkpoints

Concrete checkpoints for auditing trust boundaries in tool-using, multi-step AI agent systems.

PatternsExternal ↗

Control-Plane Failure Patterns in Agentic Systems

Session binding, memory reuse, routing/selection, tool enforcement, and observability failure patterns.

How the Playbook Fits Together

📋

Policies define the rules

Operating contracts that set evidence boundaries, citation requirements, and fail-closed behavior. Your AI agent's constitution.

🔧

Procedures implement the rules

Step-by-step guides with copy-paste prompt templates. Follow the procedure, get a tested output.

✅

Checklists verify the results

The 63-point checklist and verification gates ensure nothing slips through before deployment.

🔍

Failures teach what to watch for

500+ real incidents show exactly what goes wrong when these practices are skipped. Learn from others' mistakes.

Community Resources

Open-source playbooks and references from the AI safety community.

AI Agents Playbook ↗

Public, versioned knowledge base for building reliable AI agents. Policies, prompt templates, how-to procedures, and security articles.

by Tamar Peretz • andyagentlab.com

ai-agents-playbook (GitHub) ↗

Source repo for the AI Agents Playbook. Browse raw policy files, prompt templates (.system.txt / .user.txt), and contribute.

github.com/Tamarper63

Ready to Ship a Safer AI Agent?

Start with the quiz to identify your biggest risks, then follow the playbook to fix them — all before your users find them first.

Take the AI Risk Quiz →Get the 63-Point Checklist