Transcribe audio / video to SRT / WebVTT / JSON / plain text via OpenAI Whisper. Auto-detects language or accepts --lang ISO-639-1 hint. ~$0.006/min.
Manage API keys for the runner's --execute layer. CRUD on ~/.skills.env (chmod 600): list / add / update / remove / enable / disable gate flags / verify (ping vendor APIs) /…
Read-only story-bible auditor for fiction series with a documented canon. Cross-references character / artifact / location mentions in chapters against the bible and flags drift.
Write viral social media content using the project's proven methodology — hooks, numbered points, NLP questions, CTA. Wraps `writer` (clean-prose engine).
Orchestrator skill — turns a topic or research brief into an N-slide Instagram / LinkedIn / TikTok carousel with consistent visual style and ready-to-post captions.
Write prompts for 20+ frontier AI video generators (Veo 3.1 + native audio, Sora 2 + cameos, Kling 3.0 / Elements, Runway Gen-4 / Aleph V2V / Act-One, Luma Ray 3 / Modify, Pika…
Meme-format graphic generator — top text + bottom text + optional centerpiece image. Wraps image-prompt --execute with a meme aesthetic anchor (Impact-style bold typography with…
Burn captions / subtitles onto an existing video via ffmpeg. Supports SRT, WebVTT, and plain-text subtitle sources.
Image upscaling utility — single image in, upscaled image out (2× / 4× / 8×). Wraps Replicate-hosted upscalers (Real-ESRGAN by default; alternatives: GFPGAN for faces, SwinIR,…
Orchestrator skill — turns a topic / research brief / script into a 9:16 vertical reel: 1-4 video shots + matched background music + ffmpeg-stitched final.mp4 with optional…
Fiction-prose rewrite + style pass. Wraps `writer`; adds voice vector (Pelevin/Manson — not impersonation), no-business-editing rules, artistic rewrite over compression, 5-trigger…
Write or edit non-fiction prose — essays, popular-science chapters, longreads. Wraps `writer`; adds source-backed claims, Manson-style ironic coda, mechanism over surface, sparing…
Read-only pre-commit lint over writer/prose-edit/essay-write. Takes a staged diff (or a specified file / commit range) and flags neuro-slop, synthetic structures, voice drift in…
Rewrite text in a different register without changing meaning — formal↔casual, business↔academic, technical↔friendly, plain-explainer. 6 named registers + transformation deltas.
Orchestrator skill — turns event details (title / date / location / CTA) plus an optional photo into a poster/flyer/social-event-graphic with embedded text in a chosen visual…
Write marketing copy — landing page sections (hero/features/pricing/FAQ), SEO meta (title+description+OG+Twitter), ad copy (Google/Facebook/LinkedIn/X).
Quote card / aphorism graphic generator — bold short text + attribution + minimal visual. Output: text-dominant composition where typography IS the image (1080×1080 square,…
Orchestrator skill — turn cover metadata (title / creator / subtitle / medium) into an album / book / podcast / report / deck / magazine cover.
Produce a structured research brief on any topic — TL;DR, key facts with citations, notable quotes, suggested angles, open questions.
Manage the local style library — bundled styles (24 carousel + 12 video director + 12 music genre) plus user overrides at ~/.claude/style-library/<modality>/<id>.md.
Orchestrator skill — turn a user photo into N profile-pic / headshot / avatar variants in a consistent style.
Text-to-speech skill — script in, MP3 out. Wraps the runner's audio modality (ElevenLabs eleven-tts + OpenAI gpt-4o-mini-tts).
Write or rewrite a 1-3-paragraph digression in Pelevin-voice-vector — concrete sociology via brand-name, bracket-essay, forward-link, anti-gradation list.
Brand mark / wordmark / logo generator. Defaults to ideogram-3-quality for cleanest embedded text (other text-strong fallbacks: gpt-image-2).
Short looping GIF utility. Two modes: (A) convert an existing MP4 to GIF with 2-pass palette optimization; (B) generate a 1-3 second clip via a video provider (Veo / Sora / Kling…
Apply an artistic style to an existing image. Default provider Flux Kontext (best for natural-language style transfer).
Write engineer-facing design documents — RFCs, ADRs (Architecture Decision Records), Tech Specs, Design Docs.
Banner-ad / display-creative generator with standard-size presets — Google Display (728×90 leaderboard, 300×250 medium rectangle, 320×100 mobile banner, 160×600 wide skyscraper),…
Write prompts for 10+ frontier AI music generators (Suno v5.5, Udio v4, Google Lyria 3 Pro, ElevenLabs Music, Stable Audio 2.5, MusicGen, Tencent SongGeneration, Sonauto v2,…
Mix a music / audio track onto an existing video via ffmpeg. Three modes: replace (drop original audio), overlay (mix both audible), duck (sidechain-compressor lowers music when…
Orchestrator skill — produce YouTube / blog / podcast-episode thumbnails. 16:9 default (1280×720 standard, 1920×1080 high-res).
Remove backgrounds from images using FAL.ai's BiRefNet model. Use when users ask to remove background, make transparent PNG, extract subject from image, or create cutouts.
Use when bilingual Markdown siblings such as README.md and README.ja.md must stay synchronized.
Document creation, format conversion (ODT/DOCX/PDF), mail merge, and automation with LibreOffice Writer.