Agent Eval Design is a community-contributed Claude Code skill in the testing sub-category. It ships as a SKILL.md file that Claude Code auto-discovers under ~/.claude/skills/agent-eval-design/ and loads when your prompt matches the skill's trigger.
When to invoke it: Use when designing evaluations for AI agents, skills, routers, prompts, tool-use policies, or multi-step workflows: task sets, rubrics, graders, hard negatives, regression cases, traces, and acceptance thresholds. Do NOT use for application test planning (use `testing-strategy`), skill-library health tooling (use `skill-infrastructure`), or live debugging of a failed run (use `debugging`).
The Agent Eval Design Claude Code skill is built for software engineers, backend developers, full-stack teams, and technical leads building and maintaining production systems. It's part of ClaudSkills (also referred to as Claude Skills or Claude Code Skills) — the open community-curated registry of 69,000+ SKILL.md files for Anthropic's Claude Code agent and the wider Claude ecosystem (Claude API, Claude Agent SDK).
mkdir -p ~/.claude/skills/agent-eval-design curl -L https://claudskills.com/skills/agent-eval-design/SKILL.md \ -o ~/.claude/skills/agent-eval-design/SKILL.md
Or just download SKILL.md directly and drop it into ~/.claude/skills/agent-eval-design/. Claude Code auto-discovers it on next session.
Skills live at ~/.claude/skills/agent-eval-design/SKILL.md on macOS/Linux, or %USERPROFILE%\.claude\skills\agent-eval-design\SKILL.md on Windows. See the full install guide for step-by-step instructions.
The ClaudSkills desktop app installs any skill directly into ~/.claude/skills/ with one click — no terminal required. Pro starts at $9/mo or $149 lifetime.
For the full experience including quality scoring and one-click install features for each skill — upgrade to Pro.
Browse all Engineering skills in the ClaudSkills registry, or explore these other picks from the same category:
Part of Acreator Store — Adam Lankamer's AI tools: PerfectStudio · Ucaption · UTagger · AutoXPoster · TestYourSkills · AutomationFlows · Au Naturel
SKILL.md files, not affiliated with, endorsed by, or sponsored by Anthropic.