Agent Skills
Challenge Pack Skills
Focused skills for planning, authoring, scoring, judging, tooling, artifacts, validation, and publishing challenge packs.
Focused skills for planning, authoring, scoring, judging, tooling, artifacts, validation, and publishing challenge packs.
Skills
- agentclash-challenge-pack-artifacts - challenge-pack-artifacts: Use when specifying AgentClash challenge pack assets, artifact references, produced file captures, evidence references, artifact upload/download expectations, and review-only evidence.
- agentclash-challenge-pack-input-sets - challenge-pack-inputs: Use when designing AgentClash challenge pack cases and input sets for smoke, full benchmark, regression, edge-case, or CI suite-only coverage.
- agentclash-challenge-pack-llm-judges - challenge-pack-judging: Use when configuring AgentClash LLM-as-judge scoring, judge prompts, rubrics, assertion/reference/n-wise modes, evidence inputs, scorecard dimensions, abstention behavior, and judge result interpretation.
- agentclash-challenge-pack-planner - challenge-pack-planning: Use when turning a vague AgentClash evaluation idea into a source-backed challenge pack plan with task boundaries, target agents, cases, input sets, scoring strategy, tools, artifacts, runtime policy, validation criteria, and handoff steps.
- agentclash-challenge-pack-scoring-validators - challenge-pack-scoring: Use when defining deterministic AgentClash scoring validators, scorecard dimensions, evidence sources, pass/fail rules, numeric metrics, file checks, and validator result interpretation.
- agentclash-challenge-pack-tools-sandbox - challenge-pack-tools: Use when defining AgentClash challenge pack tool access, sandbox runtime needs, filesystem expectations, network policy, command execution, and secret references.
- agentclash-challenge-pack-validation-publish - challenge-pack-publication: Use when validating AgentClash challenge pack YAML, fixing schema/scoring/tool/asset errors, publishing runnable pack versions, recording returned IDs, and preparing next eval commands.
- agentclash-challenge-pack-yaml-author - challenge-pack-authoring: Use when writing or editing AgentClash challenge pack YAML, including pack/version metadata, execution mode, challenges, cases, input sets, scoring blocks, tools, sandbox settings, assets, and validation handoff.
Canonical Layout
text
web/content/agent-skills/challenge-pack-skills/<skill>/SKILL.md