Claude Code Skills·Claude Skills·The open SKILL.md registry for Claude
ClaudSkills / Science & Research / science-misc

Os Eval Runner

Category: Science & Research  ·  Sub-category: science-misc  ·  Last updated:
lang:python
Stateless evaluation engine that scores and gates skill improvement iterations using headless Python evaluation scripts. Use when the user says "evaluate this skill", "run autoresearch loop on", "optimize this skill", "run the eval loop", or when another agent proposes a change to an existing skill and needs empirical validation before applying it. Supports autonomous loop mode for iterative improvement and single-shot QA mode for validating one specific proposed change. Requires Python 3.8+ and a git repository.

From the source SKILL.md

<example> <commentary>Start autonomous improvement loop on a skill.</commentary> user: "Run the autoresearch loop on plugins/link-checker/skills/link-checker-agent for 20 iterations" assistant: [triggers os-eval-runner, runs Mode 1 intake, establishes baseline, begins iteration loop] </example>

What this skill does

Os Eval Runner is a community-contributed Claude Code skill in the science-misc sub-category. It ships as a SKILL.md file that Claude Code auto-discovers under ~/.claude/skills/os-eval-runner/ and loads when your prompt matches the skill's trigger.

When to invoke it: Use when the user says "evaluate this skill", "run autoresearch loop on", "optimize this skill", "run the eval loop", or when another agent proposes a change to an existing skill and needs empirical validation before applying it. Supports autonomous loop mode for iterative improvement and single-shot QA mode for validating one specific proposed change.

Who uses this skill

The Os Eval Runner Claude Code skill is built for researchers, data scientists, academics, and analysts working with complex data and scientific literature. It's part of ClaudSkills (also referred to as Claude Skills or Claude Code Skills) — the open community-curated registry of 69,000+ SKILL.md files for Anthropic's Claude Code agent and the wider Claude ecosystem (Claude API, Claude Agent SDK).

How to install

Free

Manual install (2 steps)

mkdir -p ~/.claude/skills/os-eval-runner
curl -L https://claudskills.com/skills/os-eval-runner/SKILL.md \
  -o ~/.claude/skills/os-eval-runner/SKILL.md

Or just download SKILL.md directly and drop it into ~/.claude/skills/os-eval-runner/. Claude Code auto-discovers it on next session.

Skills live at ~/.claude/skills/os-eval-runner/SKILL.md on macOS/Linux, or %USERPROFILE%\.claude\skills\os-eval-runner\SKILL.md on Windows. See the full install guide for step-by-step instructions.

Pro

One-click install via the desktop app

The ClaudSkills desktop app installs any skill directly into ~/.claude/skills/ with one click — no terminal required. Pro starts at $9/mo or $149 lifetime.

Pro

For the full experience including quality scoring and one-click install features for each skill — upgrade to Pro.

Attribution & license

More Science & Research skills

Browse all Science & Research skills in the ClaudSkills registry, or explore these other picks from the same category:

Browse all Science & Research skills → Top 100 skills
Part of ClaudSkills — the open registry for Claude Skills & Claude Code Skills.  ·  What's New  ·  Install guide  ·  About  ·  llms.txt

Part of Acreator Store — Adam Lankamer's AI tools: PerfectStudio · Ucaption · UTagger · AutoXPoster · TestYourSkills · AutomationFlows · Au Naturel