Agent Skills

Challenge Pack Skills

Focused skills for planning, authoring, scoring, judging, tooling, artifacts, validation, and publishing challenge packs.

Skills

agentclash-challenge-pack-artifacts - challenge-pack-artifacts: Use when specifying AgentClash challenge pack assets, artifact references, produced file captures, evidence references, artifact upload/download expectations, and review-only evidence.
agentclash-challenge-pack-input-sets - challenge-pack-inputs: Use when designing AgentClash challenge pack cases and input sets for smoke, full benchmark, regression, edge-case, or CI suite-only coverage.
agentclash-challenge-pack-llm-judges - challenge-pack-judging: Use when configuring AgentClash LLM-as-judge scoring, judge prompts, rubrics, assertion/reference/n-wise modes, evidence inputs, scorecard dimensions, abstention behavior, and judge result interpretation.
agentclash-challenge-pack-planner - challenge-pack-planning: Use when turning a vague AgentClash evaluation idea into a source-backed challenge pack plan with task boundaries, target agents, cases, input sets, scoring strategy, tools, artifacts, runtime policy, validation criteria, and handoff steps.
agentclash-challenge-pack-scoring-validators - challenge-pack-scoring: Use when defining deterministic AgentClash scoring validators, scorecard dimensions, evidence sources, pass/fail rules, numeric metrics, file checks, and validator result interpretation.
agentclash-challenge-pack-tools-sandbox - challenge-pack-tools: Use when defining AgentClash challenge pack tool access, sandbox runtime needs, filesystem expectations, network policy, command execution, and secret references.
agentclash-challenge-pack-validation-publish - challenge-pack-publication: Use when validating AgentClash challenge pack YAML, fixing schema/scoring/tool/asset errors, publishing runnable pack versions, recording returned IDs, and preparing next eval commands.
agentclash-challenge-pack-yaml-author - challenge-pack-authoring: Use when writing or editing AgentClash challenge pack YAML, including pack/version metadata, execution mode, challenges, cases, input sets, scoring blocks, tools, sandbox settings, assets, and validation handoff.

Canonical Layout

text

web/content/agent-skills/challenge-pack-skills/<skill>/SKILL.md

PreviousQuickstart Skill NextAgent Build Skills