ClaudSkills / General / hello-world-templates

Vllm Observability

Category: General  ·  Sub-category: hello-world-templates  ·  Last updated:
ai:llmtype:debug
Observe production vLLM — `/metrics` Prometheus surface (V1 engine), SLO-driven alerting on TTFT/ITL/queue/KV/preemption/aborts/corrupted-logits, shipping Grafana dashboards in `examples/observability/`, OTLP tracing with `--otlp-traces-endpoint` and `--collect-detailed-traces={model,worker,all}`, diagnostic rules to triage from /metrics alone — queue-grows + TPOT-stable means capacity, queue-stable + TPOT-grows means context/model, DCGM `SM_OCCUPANCY` is the real GPU-saturation signal not `GPU_UTIL`. V1 metric names (kv_cache_usage_perc), gpu_→kv_ rename saga (PR #24245 / revert #25392), DCGM-exporter pairing, dashboard-lying pitfalls.

From the source SKILL.md

Target audience: operators running production vLLM on H100/H200 fleets, usually containerized, usually on Kubernetes, on-call for latency and throughput SLOs.

What this skill does

Vllm Observability is a community-contributed Claude Code skill in the hello-world-templates sub-category. It ships as a SKILL.md file that Claude Code auto-discovers under ~/.claude/skills/vllm-observability/ and loads when your prompt matches the skill's trigger.

Who uses this skill

The Vllm Observability skill is built for Claude Code users and developers across all disciplines looking for general-purpose AI assistance. It is part of the open ClaudSkills registry, a community-curated catalog of 56,000+ capabilities you can install for Claude Code — the Claude CLI agent.

How to install

Free

Manual install (2 steps)

mkdir -p ~/.claude/skills/vllm-observability
curl -L https://claudskills.com/skills/vllm-observability/SKILL.md \
  -o ~/.claude/skills/vllm-observability/SKILL.md

Or just download SKILL.md directly and drop it into ~/.claude/skills/vllm-observability/. Claude Code auto-discovers it on next session.

Skills live at ~/.claude/skills/vllm-observability/SKILL.md on macOS/Linux, or %USERPROFILE%\.claude\skills\vllm-observability\SKILL.md on Windows. See the full install guide for step-by-step instructions.

Pro

One-click install via the desktop app

The ClaudSkills desktop app installs any skill directly into ~/.claude/skills/ with one click — no terminal required. Pro starts at $9/mo or $149 lifetime.

Pro

For the full experience including quality scoring and one-click install features for each skill — upgrade to Pro.

More General skills

Browse all General skills in the ClaudSkills registry, or explore these other picks from the same category:

Browse all General skills → Top 100 skills
Part of ClaudSkills — the open registry for Claude Code skills.  ·  What's New  ·  Install guide  ·  About  ·  llms.txt

Part of Acreator Store — Adam Lankamer's AI tools: GifPerfect · AspectPerfect · SlomoPerfect · Ucaption · UTagger · AutoXPoster · TestYourSkills