Playwright for AI Agents. Test what your agent DOES, not what it SAYS. YAML-first behavioral testing. Catch PII leaks, tool abuse, step explosions. 3200+ tests.
-
Updated
Apr 7, 2026 - TypeScript
Playwright for AI Agents. Test what your agent DOES, not what it SAYS. YAML-first behavioral testing. Catch PII leaks, tool abuse, step explosions. 3200+ tests.
Behavior test framework for AI agents. Define tests in YAML. Run against transcripts. Get scored reports.
Add a description, image, and links to the behavior-testing topic page so that developers can more easily learn about it.
To associate your repository with the behavior-testing topic, visit your repo's landing page and select "manage topics."