RAG Evaluator

Q: What is a Claude Code skill and how does the RAG Evaluator skill fit in?

A Claude Code skill is a SKILL.md file that lives under ~/.claude/skills/<name>/ and tells the Claude Code CLI agent how to perform a specific task (instructions, prompts, allowed tools). Skills are auto-discovered at session start. RAG Evaluator is one of 67,000+ skills indexed in the open ClaudSkills catalog, classified under the Security category. Learn more at https://claudskills.com/learn/what-is-skill-md/.

Category: Security · Sub-category: red-team · Last updated: 2026-05-21

ai:rag

Generates tailored giskard.checks evaluation suites for RAG (Retrieval-Augmented Generation) systems. Use whenever a user describes a Q&A bot grounded in documents, a knowledge-base chatbot, a retrieval system, or wants to evaluate answer groundedness, faithfulness, hallucination, retrieval quality, citation accuracy, or out-of-scope handling. Triggers on phrases like "evaluate my RAG", "test my retrieval", "check groundedness", "build a RAG eval suite", "eval my chatbot answers from docs", "test if my agent hallucinates", "check if my answers are faithful to the sources", or any evaluation task involving an agent that answers from documents, FAQs, wikis, or a knowledge base. Use this skill even when the user does not explicitly say "RAG" but describes an agent grounded in documents. For adversarial / red-teaming evaluation, use the `scenario-generator` skill instead. This skill focuses on quality, not safety.

Security AStatic scan found no risk patternsHow grading works ›

From the source SKILL.md

You are an expert RAG evaluation engineer. Your job is to help users build comprehensive, quality-focused evaluation suites for RAG (Retrieval-Augmented Generation) systems using the giskard.checks Python library.

What this skill does

RAG Evaluator is a community-contributed Claude Code skill in the red-team sub-category. It ships as a SKILL.md file that Claude Code auto-discovers under ~/.claude/skills/rag-evaluator/ and loads when your prompt matches the skill's trigger.

When to invoke it: Use whenever a user describes a Q&A bot grounded in documents, a knowledge-base chatbot, a retrieval system, or wants to evaluate answer groundedness, faithfulness, hallucination, retrieval quality, citation accuracy, or out-of-scope handling. Triggers on phrases like "evaluate my RAG", "test my retrieval", "check groundedness", "build a RAG eval suite", "eval my chatbot answers from docs", "test if my agent hallucinates", "check if my answers are faithful to the sources", or any evaluation task involving an agent that answers from documents, FAQs, wikis, or a knowledge base.

Who uses this skill

The RAG Evaluator Claude Code skill is built for security engineers, penetration testers, DevSecOps practitioners, and development teams hardening codebases and infrastructure. It's part of ClaudSkills (also referred to as Claude Skills or Claude Code Skills) — the open community-curated registry of 154,000+ SKILL.md files for Anthropic's Claude Code agent and the wider Claude ecosystem (Claude API, Claude Agent SDK).

How to install

Free

Manual install (2 steps)

mkdir -p ~/.claude/skills/rag-evaluator
curl -L https://claudskills.com/skills/rag-evaluator/SKILL.md \
  -o ~/.claude/skills/rag-evaluator/SKILL.md

Or just download SKILL.md directly and drop it into ~/.claude/skills/rag-evaluator/. Claude Code auto-discovers it on next session.

Skills live at ~/.claude/skills/rag-evaluator/SKILL.md on macOS/Linux, or %USERPROFILE%\.claude\skills\rag-evaluator\SKILL.md on Windows. See the full install guide for step-by-step instructions.

📱 Install from your phone or desktop Telegram

Open @claudskills_bot on Telegram, tap Open Desktop App, and the desktop app installs this skill for you. Or share the bot link with a colleague — they get the same one-tap install. Learn more →

Pro

One-click install via the desktop app

The ClaudSkills desktop app installs any skill directly into ~/.claude/skills/ with one click — no terminal required. Pro starts at $9/mo or $149 lifetime.

Pro

For the full experience including quality scoring and one-click install features for each skill — upgrade to Pro.

See pricing → Download desktop app

Frequently asked questions

How do I install the RAG Evaluator Claude Code skill?

Install via the ClaudSkills desktop app (one click) or copy SKILL.md from the source repository to ~/.claude/skills/rag-evaluator/SKILL.md and restart Claude Code. Both flows are detailed at claudskills.com/install/.

What does the RAG Evaluator skill do?

Is this skill free to install?

Yes. ClaudSkills is an open registry — every skill keeps its source repository's license, and manual install via copy is free. ClaudSkills Pro ($9/mo, $79/yr, or $149 one-time) adds one-click install via the desktop app and a multi-signal Quality Score.

When should I use the RAG Evaluator skill?

Use RAG Evaluator when your Claude Code task falls under the Security category — specifically in the red team area. Claude Code auto-discovers installed skills and invokes the right one based on the task description, so you can also ask Claude directly (e.g. "use RAG Evaluator" or describe the task and let Claude pick). Browse related skills at /category/security/.

What is a Claude Code skill and how does the RAG Evaluator skill fit in?

A Claude Code skill is a SKILL.md file that lives under ~/.claude/skills/<name>/ and tells the Claude Code CLI agent how to perform a specific task (instructions, prompts, allowed tools). Skills are auto-discovered at session start. RAG Evaluator is one of 67,000+ skills indexed in the open ClaudSkills catalog, classified under the Security category. Learn more at /learn/what-is-a-claude-skill/.

Attribution & license

Source: https://github.com/Giskard-AI/giskard-skills/blob/HEAD/oss/checks/rag-evaluator/SKILL.md
License: Apache-2.0
Author: Giskard-AI

Cite this skill

If you reference this skill in a blog post, paper, or documentation, you can cite it as:

APA

Giskard-AI. (2026). RAG Evaluator [Claude Code skill]. ClaudSkills. https://claudskills.com/skills/rag-evaluator/

BibTeX

@misc{rag-evaluator-2026,
  author    = {Giskard-AI},
  title     = {RAG Evaluator [Claude Code skill]},
  year      = {2026},
  publisher = {ClaudSkills},
  url       = {https://claudskills.com/skills/rag-evaluator/}
}

Embed this skill

Promote, attribute, or link this skill from your own README, blog post, or documentation. All three snippets are free to use — no sign-up, no API key. More distribution surfaces →

Badge

[![ClaudSkills](https://claudskills.com/badge/rag-evaluator.svg)](https://claudskills.com/skills/rag-evaluator/?utm_source=badge&utm_medium=readme&utm_campaign=skill_badge)

Security scan

Grade A · scanned 2026-07-20 — free static scan against the OWASP Agentic Skills Top 10.

The scan flagged 1 of 10 categories (execution), including lower-severity patterns. Patterns shown inside code fences are weighted as examples rather than instructions — read the grading methodology for what this does and does not guarantee.

✓ Prompt injection
✓ Data exfiltration
✓ Supply chain
✓ Reverse shell
✓ Credentials
⚠ Execution
✓ Filesystem
✓ Persistence
✓ Obfuscation
✓ Network

Show this grade on your repo (click to copy):

[![Security: A](https://img.shields.io/badge/Security-A-2e7d32)](https://claudskills.com/skills/rag-evaluator/#security)

More Security skills

Browse all Security skills in the ClaudSkills registry, or explore these other picks from the same category:

Browse all Security skills → Top 100 skills

Part of ClaudSkills — the open registry for Claude Skills & Claude Code Skills. · What's New · Install guide · About · llms.txt

Part of Acreator Store — Adam Lankamer's AI tools: PerfectStudio · Ucaption · UTagger · AutoXPoster · TestYourSkills · AutomationFlows · Au Naturel · Telegram @acreatorstore