Getting Started
Hosted Quickstart
Validate the CLI against the hosted production backend, set a workspace, and get to your first runnable command in a few minutes.
This path is for people changing the CLI or trying the product without booting the whole stack locally.
Note
The hosted quickstart assumes your workspace already has challenge packs and
deployments. If it does not, stop after link and then author a pack with
challenge-pack init; you have still verified auth, connectivity, and
workspace selection.
1. Install the CLI
npm i -g agentclash2. Point the CLI at production and log in
1export AGENTCLASH_API_URL="https://api.agentclash.dev"
2agentclash auth login --deviceUse --device when you are in a remote shell or do not want the CLI to open a browser automatically.
3. Link a workspace
agentclash linkThe CLI resolves the API base URL in this order:
--api-url > AGENTCLASH_API_URL > saved user config > built-in defaultThe built-in default depends on how the CLI was built: released binaries (what npm i -g agentclash installs) default to https://api.agentclash.dev, while source builds (go run . / make build) default to http://localhost:8080. Because this quickstart uses the released npm binary, you only need the AGENTCLASH_API_URL export above if you want to override that default.
agentclash link saves the selected workspace in user config so later commands do not need raw IDs by default.
4. Choose your next path
1agentclash doctor
2agentclash eval start --helpIf the workspace is already seeded with challenge packs and agent deployments, create and follow a run:
agentclash eval start --followIf the workspace is empty, scaffold a starter pack first:
1agentclash challenge-pack init support-eval.yaml
2agentclash challenge-pack validate support-eval.yaml
3agentclash challenge-pack publish support-eval.yaml
4agentclash eval start --follow
5agentclash baseline set
6agentclash eval scorecardVerification
You should now have:
- a valid CLI login
- a default workspace linked locally
- a working connection to the hosted API
- either a created run or enough context to see what the workspace is missing
- a clear next step: publish a challenge pack, start an eval, or save a baseline