Claude Code Skills·Claude Skills·The open SKILL.md registry for Claude
ClaudSkills / Engineering / backend

Sglang Model Gateway

Category: Engineering  ·  Sub-category: backend  ·  Last updated:
lang:rusttool:k8s
SGLang Model Gateway (`sgl-model-gateway`, formerly `sgl-router`) — Rust router fronting vLLM/SGLang inference workers on Kubernetes. Trigger on "sgl-model-gateway", "sgl-router", "sglang router", "smg", "amg", "model gateway", "inference gateway", "load balance vllm replicas", "fan out same model", "kubernetes vllm router", "cache-aware routing", "prefix_hash policy", "PD disaggregation router", "--worker-urls", "--service-discovery", "--enable-mesh", "smg_* metrics". Covers: first-class vLLM gRPC backend (`RuntimeType::Vllm`) plus HTTP transparent-proxy for vanilla vLLM; eight policies; air-gapped recipe (gateway ignores `HF_ENDPOINT`, mount tokenizer on PVC); K8s manifests with `model_id` labels + per-model RBAC; three HA mitigations (single+PDB / `sessionAffinity` / `--enable-mesh` CRDT sync); pitfalls (vLLM HTTP discovery registers empty labels, gRPC probes need numeric ports, `sgl_router_*` → `smg_*` rename Dec 2025).

From the source SKILL.md

Target audience: operators running vLLM and/or SGLang inference on Kubernetes, fronting workers with a router that does cache-aware load-balancing, optional prefill-decode disaggregation, and dynamic worker registration. Especially: hosting multiple replicas of the same model behind one address, in air-gapped clusters with local model mirrors (no live huggingface.co).

What this skill does

Sglang Model Gateway is a community-contributed Claude Code skill in the backend sub-category. It ships as a SKILL.md file that Claude Code auto-discovers under ~/.claude/skills/sglang-model-gateway/ and loads when your prompt matches the skill's trigger.

Who uses this skill

The Sglang Model Gateway Claude Code skill is built for software engineers, backend developers, full-stack teams, and technical leads building and maintaining production systems. It's part of ClaudSkills (also referred to as Claude Skills or Claude Code Skills) — the open community-curated registry of 70,000+ SKILL.md files for Anthropic's Claude Code agent and the wider Claude ecosystem (Claude API, Claude Agent SDK).

How to install

Free

Manual install (2 steps)

mkdir -p ~/.claude/skills/sglang-model-gateway
curl -L https://claudskills.com/skills/sglang-model-gateway/SKILL.md \
  -o ~/.claude/skills/sglang-model-gateway/SKILL.md

Or just download SKILL.md directly and drop it into ~/.claude/skills/sglang-model-gateway/. Claude Code auto-discovers it on next session.

Skills live at ~/.claude/skills/sglang-model-gateway/SKILL.md on macOS/Linux, or %USERPROFILE%\.claude\skills\sglang-model-gateway\SKILL.md on Windows. See the full install guide for step-by-step instructions.

Pro

One-click install via the desktop app

The ClaudSkills desktop app installs any skill directly into ~/.claude/skills/ with one click — no terminal required. Pro starts at $9/mo or $149 lifetime.

Pro

For the full experience including quality scoring and one-click install features for each skill — upgrade to Pro.

Cite this skill

If you reference this skill in a blog post, paper, or documentation, you can cite it as:

APA
ClaudSkills. (2026). Sglang Model Gateway [Claude Code skill]. ClaudSkills. https://claudskills.com/skills/sglang-model-gateway/
BibTeX
@misc{sglang-model-gateway-2026,
  author    = {ClaudSkills},
  title     = {Sglang Model Gateway [Claude Code skill]},
  year      = {2026},
  publisher = {ClaudSkills},
  url       = {https://claudskills.com/skills/sglang-model-gateway/}
}

Embed this skill

Promote, attribute, or link this skill from your own README, blog post, or documentation. All three snippets are free to use — no sign-up, no API key. More distribution surfaces →

Badge
[![ClaudSkills](https://claudskills.com/badge/sglang-model-gateway.svg)](https://claudskills.com/skills/sglang-model-gateway/?utm_source=badge&utm_medium=readme&utm_campaign=skill_badge)
<script>
<script src="https://claudskills.com/embed/sglang-model-gateway.js" async></script>
<iframe>
<iframe src="https://claudskills.com/embed/sglang-model-gateway.html" width="100%" height="160" frameborder="0" loading="lazy" title="ClaudSkills: Sglang Model Gateway"></iframe>

Free. No spam. Unsubscribe in one click.

More Engineering skills

Browse all Engineering skills in the ClaudSkills registry, or explore these other picks from the same category:

Browse all Engineering skills → Top 100 skills
Part of ClaudSkills — the open registry for Claude Skills & Claude Code Skills.  ·  What's New  ·  Install guide  ·  About  ·  llms.txt

Part of Acreator Store — Adam Lankamer's AI tools: PerfectStudio · Ucaption · UTagger · AutoXPoster · TestYourSkills · AutomationFlows · Au Naturel