ClaudSkills / General / general-misc

LLM Regression Runner

Category: General  ·  Sub-category: general-misc  ·  Last updated:
ai:llm
Use this skill when a developer wants to test a prompt change against a golden dataset and see what broke. Triggers on: "run my evals", "test this prompt change", "check for regressions", "did I break anything", "run regression tests", "test against golden dataset", "compare prompt versions", "is it safe to deploy", "run offline evals", "what changed after my prompt update", "eval before deploying". Runs a golden dataset against the current prompt, scores each case with available judges, compares results against a saved baseline, and produces a pass/fail report with a clear deploy recommendation.

What this skill does

LLM Regression Runner is a community-contributed Claude Code skill in the general-misc sub-category. It ships as a SKILL.md file that Claude Code auto-discovers under ~/.claude/skills/llm-regression-runner/ and loads when your prompt matches the skill's trigger.

Who uses this skill

The LLM Regression Runner skill is built for Claude Code users and developers across all disciplines looking for general-purpose AI assistance. It is part of the open ClaudSkills registry, a community-curated catalog of 56,000+ capabilities you can install for Claude Code — the Claude CLI agent.

How to install

Free

Manual install (2 steps)

mkdir -p ~/.claude/skills/llm-regression-runner
curl -L https://claudskills.com/skills/llm-regression-runner/SKILL.md \
  -o ~/.claude/skills/llm-regression-runner/SKILL.md

Or just download SKILL.md directly and drop it into ~/.claude/skills/llm-regression-runner/. Claude Code auto-discovers it on next session.

Skills live at ~/.claude/skills/llm-regression-runner/SKILL.md on macOS/Linux, or %USERPROFILE%\.claude\skills\llm-regression-runner\SKILL.md on Windows. See the full install guide for step-by-step instructions.

Pro

One-click install via the desktop app

The ClaudSkills desktop app installs any skill directly into ~/.claude/skills/ with one click — no terminal required. Pro starts at $9/mo or $149 lifetime.

Pro

For the full experience including quality scoring and one-click install features for each skill — upgrade to Pro.

More General skills

Browse all General skills in the ClaudSkills registry, or explore these other picks from the same category:

Browse all General skills → Top 100 skills
Part of ClaudSkills — the open registry for Claude Code skills.  ·  What's New  ·  Install guide  ·  About  ·  llms.txt

Part of Acreator Store — Adam Lankamer's AI tools: GifPerfect · AspectPerfect · SlomoPerfect · Ucaption · UTagger · AutoXPoster · TestYourSkills