Run a scenario against an agent.
Executes each turn in sequence, evaluating criteria after each turn. If a required turn fails, subsequent turns are skipped. Returns detailed results including LLM judge critiques and evidence.
Platform token (starts with pat_)
Schema for creating a scenario run.
ID of the agent to test
Successful Response
Schema for reading a scenario run.
Status of a scenario run.
pending, running, passed, failed, error