Use cases
Agent evaluation use cases
Pick the workload that matches your team — coding agents, research agents, or customer support agents — and evaluate them on real tasks with replay and scorecards.
Explore AgentClash use cases for coding, research, and customer support agent evaluation with replay evidence and CI gates.