AgentClash

Agent Skills

Challenge Pack Skills

Focused skills for planning, authoring, scoring, judging, tooling, artifacts, validation, and publishing challenge packs.

Focused skills for planning, authoring, scoring, judging, tooling, artifacts, validation, and publishing challenge packs.

Skills

  • agentclash-challenge-pack-artifacts - challenge-pack-artifacts: Use when specifying AgentClash challenge pack artifacts, file outputs, evidence capture, upload/download expectations, and scorecard artifact references.
  • agentclash-challenge-pack-input-sets - challenge-pack-inputs: Use when designing AgentClash challenge pack cases and input sets for smoke, full benchmark, regression, edge-case, or CI suite-only coverage.
  • agentclash-challenge-pack-llm-judges - challenge-pack-judging: Use when configuring AgentClash LLM-as-judge scoring, judge prompts, rubrics, dimensions, evidence inputs, abstention behavior, and judge result interpretation.
  • agentclash-challenge-pack-planner - challenge-pack-planning: Use when turning a vague AgentClash evaluation idea into a challenge pack plan with task boundaries, target agents, input coverage, scoring strategy, tools, artifacts, and publish criteria.
  • agentclash-challenge-pack-scoring-validators - challenge-pack-scoring: Use when defining deterministic AgentClash scoring validators, score dimensions, evidence sources, pass/fail rules, numeric metrics, and validator failure messages.
  • agentclash-challenge-pack-tools-sandbox - challenge-pack-tools: Use when defining AgentClash challenge pack tool access, sandbox runtime needs, filesystem expectations, network policy, command execution, and secret references.
  • agentclash-challenge-pack-validation-publish - challenge-pack-publication: Use when validating AgentClash challenge packs, fixing schema errors, publishing packs, recording returned IDs, and preparing follow-up run commands.
  • agentclash-challenge-pack-yaml-author - challenge-pack-authoring: Use when writing or editing AgentClash challenge pack YAML, including tasks, cases, input sets, scoring blocks, tools, sandbox settings, assets, and metadata.

Canonical Layout

web/content/agent-skills/challenge-pack-skills/<skill>/SKILL.md