← AAT Labs Download PDF

The Agent Acceptance Checklist

Use this before approving any AI agent for pilot, production, vendor purchase, or workflow expansion.

1. Responsibility Boundary

□ The agent has one clear job.
□ The agent's allowed actions are written down.
□ The agent's forbidden actions are written down.
□ The agent has explicit escalation triggers.
□ A human owner is accountable for agent behavior.

2. Workflow Fit

□ The workflow is mapped from trigger to completion.
□ Every handoff is visible.
□ Every tool call is named.
□ Every data source is named.
□ Every customer-impacting moment is marked.
□ Every regulated or sensitive-data moment is marked.

3. Failure Classes

□ Hallucination or unsupported claim
□ Tool misuse or silent tool failure
□ Policy / compliance breach
□ Escalation miss
□ Audit gap or non-reconstructable decision

4. Adversarial Scenarios

□ Normal, confused, angry, and missing-information users
□ Out-of-scope requests and instruction override attempts
□ Sensitive data requests and regulated workflow triggers
□ Tool outage and incorrect tool result
□ Policy conflict and required escalation

5. Tool-Call Safety

□ The agent explains material actions before taking them.
□ The agent gets confirmation before irreversible actions.
□ The agent recognizes failed tool calls.
□ The agent does not claim completion when a tool failed.
□ Tool permissions are scoped to the workflow.
□ Logs show what was called, when, why, and with what result.

6. Human-in-the-Loop

□ The agent knows when to stop.
□ Escalation handoff includes full context.
□ The human reviewer can see source evidence.
□ There is a clear approval gate for consequential actions.
□ There is a kill switch.

Critical No-Go Triggers

Fabricates policy, price, eligibility, legal, financial, or medical advice.
Executes a material action without required confirmation.
Fails to escalate distress, threat, fraud, complaint, protected-class, or regulated-topic scenarios.
Cannot produce an audit trail for consequential actions.
Leaks or mishandles sensitive data.
Ignores explicit workflow boundaries.
Claims completion when a tool call failed.

If this agent failed publicly, could we prove we tested the failure mode before launch?