Agent Testing And Evaluation includes pricing or quota commentary; at least one code block. At roughly 2,634 words the SKILL.md is on the longer end of the catalog distribution.
Agent Testing And Evaluation sits in the Engineering category under the testing sub-topic in the ClaudSkills catalog. There are 10 related skills indexed alongside it; comparing a few before installing usually reveals which fits your workflow best.
These notes are auto-generated from features detected in the SKILL.md file and from this catalog's structure — they aren't part of the source repository.
Use this skill when the work is validating that an Agentforce agent routes correctly, produces quality responses, and continues to behave as expected after configuration changes. This skill covers the full testing lifecycle: interactive conversation testing in Agent Builder, structured utterance and topic tests defined in AiEvaluationDefinition metadata, programmatic test execution via the Testing API (Connect API), evaluation metrics interpretation, and regression testing patterns across the DevOps lifecycle. It does not cover how to create an agent, design topics, or build actions — those…
Agent Testing And Evaluation is a community-contributed Claude Code skill in the testing sub-category. It ships as a SKILL.md file that Claude Code auto-discovers under ~/.claude/skills/agent-testing-and-evaluation/ and loads when your prompt matches the skill's trigger.
When to invoke it: Use when testing, evaluating, or building regression suites for Agentforce agents: conversation testing in Agent Builder, topic coverage and utterance testing, Testing API and AiEvaluationDefinition metadata, evaluation metrics (containment rate, escalation rate, CSAT, topic activation accuracy), and post-deploy analytics via Enhanced Event Logs. Triggers: 'how do I test my Agentforce agent', 'agent routes to wrong topic', 'write utterance tests', 'regression test after topic change', 'measure agent quality', 'agent containment rate'.
The Agent Testing And Evaluation Claude Code skill is built for software engineers, backend developers, full-stack teams, and technical leads building and maintaining production systems. It's part of ClaudSkills (also referred to as Claude Skills or Claude Code Skills) — the open community-curated registry of 116,000+ SKILL.md files for Anthropic's Claude Code agent and the wider Claude ecosystem (Claude API, Claude Agent SDK).
mkdir -p ~/.claude/skills/agent-testing-and-evaluation curl -L https://claudskills.com/skills/agent-testing-and-evaluation/SKILL.md \ -o ~/.claude/skills/agent-testing-and-evaluation/SKILL.md
Or just download SKILL.md directly and drop it into ~/.claude/skills/agent-testing-and-evaluation/. Claude Code auto-discovers it on next session.
Skills live at ~/.claude/skills/agent-testing-and-evaluation/SKILL.md on macOS/Linux, or %USERPROFILE%\.claude\skills\agent-testing-and-evaluation\SKILL.md on Windows. See the full install guide for step-by-step instructions.
Open @claudskills_bot on Telegram, tap Open Desktop App, and the desktop app installs this skill for you. Or share the bot link with a colleague — they get the same one-tap install. Learn more →
The ClaudSkills desktop app installs any skill directly into ~/.claude/skills/ with one click — no terminal required. Pro starts at $9/mo or $149 lifetime.
For the full experience including quality scoring and one-click install features for each skill — upgrade to Pro.
SKILL.md from the source repository to ~/.claude/skills/agent-testing-and-evaluation/SKILL.md and restart Claude Code. Both flows are detailed at claudskills.com/install/.SKILL.md file that lives under ~/.claude/skills/<name>/ and tells the Claude Code CLI agent how to perform a specific task (instructions, prompts, allowed tools). Skills are auto-discovered at session start. Agent Testing And Evaluation is one of 67,000+ skills indexed in the open ClaudSkills catalog, classified under the Engineering category. Learn more at /learn/what-is-a-claude-skill/.If you reference this skill in a blog post, paper, or documentation, you can cite it as:
@misc{agent-testing-and-evaluation-2026,
author = {Pranav Nagrecha},
title = {Agent Testing And Evaluation [Claude Code skill]},
year = {2026},
publisher = {ClaudSkills},
url = {https://claudskills.com/skills/agent-testing-and-evaluation/}
}Browse all Engineering skills in the ClaudSkills registry, or explore these other picks from the same category:
Part of Acreator Store — Adam Lankamer's AI tools: PerfectStudio · Ucaption · UTagger · AutoXPoster · TestYourSkills · AutomationFlows · Au Naturel · Telegram @acreatorstore
SKILL.md files, not affiliated with, endorsed by, or sponsored by Anthropic.