incident-response
AI agent that detects and recovers from production incidents — minimizing downtime and customer impact through structured response processes. For AI products, includes detecting model degradation, inference pipeline failures, GPU resource exhaustion, and streaming endpoint outages. Use this skill to: configure PagerDuty or Opsgenie alerting, write automated runbooks for common failures, conduct blameless post-mortem analysis, track SLA compliance, design on-call rotations, implement incident severity classification, or build automated remediation workflows. Trigger on "incident", "PagerDuty", "Opsgenie", "alerting", "runbook", "post-mortem", "SLA", "on-call", "outage", "downtime", "incident response", or when production failures need structured detection and recovery processes.
From the source SKILL.md
You turn chaos into process. When production breaks — and it will break — the difference between a 5-minute blip and a 3-hour outage is the quality of your incident response. Your job is to ensure that when an alert fires, the right person is paged, they have a runbook telling them exactly what to do, the incident is communicated clearly to stakeholders, and afterwards a blameless post-mortem ensures it never happens the same way again. In an AI product, incidents have unique flavors: model providers go down (taking your inference with them), GPU nodes run out of memory mid-stream, safety…
What this skill does
incident-response is a community-contributed Claude Code skill in the operations sub-category. It ships as a SKILL.md file that Claude Code auto-discovers under ~/.claude/skills/app-monitoring---ops/ and loads when your prompt matches the skill's trigger.
Who uses this skill
The incident-response Claude Code skill is built for Claude Code users and developers across all disciplines looking for general-purpose AI assistance. It's part of ClaudSkills (also referred to as Claude Skills or Claude Code Skills) — the open community-curated registry of 92,000+ SKILL.md files for Anthropic's Claude Code agent and the wider Claude ecosystem (Claude API, Claude Agent SDK).
How to install
Free
Manual install (2 steps)
mkdir -p ~/.claude/skills/app-monitoring---ops
curl -L https://claudskills.com/skills/app-monitoring---ops/SKILL.md \
-o ~/.claude/skills/app-monitoring---ops/SKILL.md
Or just download SKILL.md directly and drop it into ~/.claude/skills/app-monitoring---ops/. Claude Code auto-discovers it on next session.
Skills live at ~/.claude/skills/app-monitoring---ops/SKILL.md on macOS/Linux, or %USERPROFILE%\.claude\skills\app-monitoring---ops\SKILL.md on Windows. See the full install guide for step-by-step instructions.
Pro
One-click install via the desktop app
The ClaudSkills desktop app installs any skill directly into ~/.claude/skills/ with one click — no terminal required. Pro starts at $9/mo or $149 lifetime.
Pro
For the full experience including quality scoring and one-click install features for each skill — upgrade to Pro.
Frequently asked questions
How do I install the incident-response Claude Code skill?
Install via the ClaudSkills desktop app (one click) or copy
SKILL.md from the source repository to
~/.claude/skills/app-monitoring---ops/SKILL.md and restart Claude Code. Both flows are detailed at
claudskills.com/install/.
What does the incident-response skill do?
AI agent that detects and recovers from production incidents — minimizing downtime and customer impact through structured response processes. For AI products, includes detecting model degradation, inference pipeline failures, GPU resource exhaustion, and streaming endpoint outages. Use this skill to: configure PagerDuty or Opsgenie alerting, write automated runbooks for common failures, conduct blameless post-mortem analysis, track SLA compliance, design on-call rotations, implement incident severity classification, or build automated remediation workflows. Trigger on "incident", "PagerDuty", "Opsgenie", "alerting", "runbook", "post-mortem", "SLA", "on-call", "outage", "downtime", "incident response", or when production failures need structured detection and recovery processes.
Is this skill free to install?
Yes. ClaudSkills is an open registry — every skill keeps its source repository's license, and manual install via copy is free. ClaudSkills Pro ($9/mo, $79/yr, or $149 one-time) adds one-click install via the desktop app and a multi-signal Quality Score.
When should I use the incident-response skill?
Use incident-response when your Claude Code task falls under the General category — specifically in the operations area. Claude Code auto-discovers installed skills and invokes the right one based on the task description, so you can also ask Claude directly (e.g. "use incident-response" or describe the task and let Claude pick). Browse related skills at
/category/general/.
What is a Claude Code skill and how does the incident-response skill fit in?
A Claude Code skill is a
SKILL.md file that lives under
~/.claude/skills/<name>/ and tells the Claude Code CLI agent how to perform a specific task (instructions, prompts, allowed tools). Skills are auto-discovered at session start. incident-response is one of 67,000+ skills indexed in the open ClaudSkills catalog, classified under the General category. Learn more at
/learn/what-is-a-claude-skill/.
Attribution & license
Cite this skill
If you reference this skill in a blog post, paper, or documentation, you can cite it as:
APA
RISHI168. (2026). incident-response [Claude Code skill]. ClaudSkills. https://claudskills.com/skills/app-monitoring---ops/
BibTeX
@misc{app-monitoring---ops-2026,
author = {RISHI168},
title = {incident-response [Claude Code skill]},
year = {2026},
publisher = {ClaudSkills},
url = {https://claudskills.com/skills/app-monitoring---ops/}
}
Embed this skill
Promote, attribute, or link this skill from your own README, blog post, or documentation. All three snippets are free to use — no sign-up, no API key. More distribution surfaces →
Badge
[](https://claudskills.com/skills/app-monitoring---ops/?utm_source=badge&utm_medium=readme&utm_campaign=skill_badge)
<script>
<script src="https://claudskills.com/embed/app-monitoring---ops.js" async></script>
<iframe>
<iframe src="https://claudskills.com/embed/app-monitoring---ops.html" width="100%" height="160" frameborder="0" loading="lazy" title="ClaudSkills: incident-response"></iframe>
More General skills
Browse all General skills in the ClaudSkills registry, or explore these other picks from the same category:
Part of Acreator Store — Adam Lankamer's AI tools:
PerfectStudio ·
Ucaption ·
UTagger ·
AutoXPoster ·
TestYourSkills ·
AutomationFlows ·
Au Naturel