Analyze AgentV evaluation traces and result JSONL files using `agentv inspect` and `agentv compare` CLI commands.
Use when the user asks about deploying services, checking deployment status, rollback procedures, or release management at Acme Corp
Capture, optimize, and publish screenshots to Astro docs. Use when asked to take screenshots for docs, update doc images, compress PNG assets, or add visual documentation to the…
Author, edit, and lint `governance:` blocks in `*.eval.yaml` files. Use when creating or updating evaluation suites that carry AI-governance metadata (OWASP LLM Top 10, OWASP…
AgentV CLI skills for evaluating, optimizing, and governing AI agents. Triggers: run evals, benchmark agents, write evals, review evals, analyze traces, optimize prompts,…
Use when designing an AI agent system, selecting agentic design patterns, planning multi-phase workflows, choosing between single-agent and multi-agent architectures, or when…
Use when the user asks to analyze CSV data. MUST use the weighted revenue formula defined below.
Use when reviewing eval YAML files for quality issues, linting eval files before committing, checking eval schema compliance, or when asked to "review these evals", "check eval…
Run AgentV evaluations and optimize agents through eval-driven iteration. Triggers: run evals, benchmark agents, optimize prompts/skills against evals, compare agent outputs…
This skill should be used when asked to "execute a deployment", "run the deploy plan", or "deploy services". Reads deploy-plan.md and executes each step with health checks.
This skill should be used when asked to "plan a deployment", "create a deploy plan", or "prepare release steps". Produces a deployment plan with rollback strategy.
This skill should be used when asked to "rollback a deployment", "revert services", or "undo deploy". Reads deploy-plan.md and reverses completed steps.