You are the audio architecture expert ensuring Leavn's complex audio pipeline stays coherent.
You are the audio fingerprinting and pattern detection specialist for Modcaster's content analysis.
Identifies audio content using Chromaprint/AcoustID fingerprinting, Shazam API recognition, and ACRCloud monitoring.
통합 오디오 생성 스킬. ElevenLabs MCP 기반 TTS(32개국어), 보이스 클로닝(1분 샘플), 다국어 더빙(립싱크), 효과음 생성을 지원. "목소리 생성", "TTS", "음성 합성", "보이스 클로닝", "더빙", "나레이션", "효과음", "AI 음성" 요청 시 사용.
Use whenever the user asks to install, configure, uninstall, snooze, mute, test, troubleshoot, or change settings for the claude-code-audio-hooks audio notification system.
Test Bob The Skull with virtual audio injection instead of speaking. Use when testing wake word detection, STT accuracy, full conversation pipeline, or automated testing.
Audio generation skill — jingles, beds, voiceover, and sound effects. Routes music requests to Suno V5 / Udio / Lyria, speech to MiniMax TTS / FishAudio / ElevenLabs V3, and SFX…
Create memorable sonic logos using design principles from Intel, Netflix, and McDonald's—crafting 2-5 second audio signatures that achieve instant brand recognition.
You are the on-device audio ML specialist for Modcaster's AI-driven audio processing.
Use when writing songs, generating music or sound with AI, preparing Suno/HeartMuLa prompts, or analyzing audio features and spectrograms.
Use when asked to normalize audio volume, match loudness, or apply peak/RMS normalization to audio files.
Audio playback using Tone.js including players, transport, scheduling, and loading audio. Use when implementing background music, sound effects, audio synchronization, or timed…
Audio ingestion, analysis, transformation, and generation (Transcribe, TTS, VAD, Features).
Converts and processes audio files using ffmpeg. Supports format conversion, sample rate changes, mono/stereo conversion, and segment splitting.
Professional audio production for music, podcasts, and sound design. Use when working with audio recording, mixing, mastering, or sound design for any medium.
Analyze audio recording quality - echo detection, loudness, speech intelligibility, SNR, spectral analysis.
Analyze the WaveCap-SDR audio stream to assess tuning quality, detect silence, noise, proper audio, or distortion.
Binding audio analysis data to visual parameters including smoothing, beat detection responses, and frequency-to-visual mappings.
Generate audio replies using TTS. Trigger with "read it to me [URL]" to fetch and read content aloud, or "talk to me [topic]" to generate a spoken response.
Router for audio domain including playback, analysis, and audio-reactive visuals. Use when implementing any audio functionality including music, sound effects, visualizers, or…
Separates audio tracks into individual stems (vocals, drums, bass, other) using Meta's Demucs neural network model via the demucs Python package.
팟캐스트 대본작가(scriptwriter)와 쇼노트편집자(shownote-editor)가 사용하는 오디오 스토리텔링 전문 스킬. 귀로만 듣는 매체에서 청취자의 몰입을 극대화하는 서사 구조, 페이싱, 사운드 연출 방법론을 제공한다.
Implements audio systems including sound management, music systems, positional audio, and audio effects.
Game audio systems, music, spatial audio, sound effects, and voice implementation. Build immersive audio experiences with professional middleware integration.
Turn creator audio into clean text captions for ecommerce content and reuse. Use when teams need fast transcript-to-caption workflows.
End-to-end audio production workflow with stems, effects, archiving, and verification
Step-by-step audio production with per-stem verification, timing alignment, and incremental quality gates
Incremental audio production with duration alignment handling, per-stem verification, and adaptive extension strategies
Incremental audio production with duration mismatch handling, adaptive stem extension, and pre-mix alignment verification
Audio production with diagnostic analysis, timecode parsing from documents, and verified export workflow
使用 Whisper 将音频/视频转换为文字,支持词级别时间戳。Use when user wants to 语音转文字, 音频转文字, 视频转文字, 字幕生成, transcribe audio, speech to text, generate subtitles, 识别语音.
Transform audio recordings into professional Markdown documentation with intelligent summaries using LLM integration
Build audio transcription pipelines with Whisper, Deepgram, and AssemblyAI including speaker diarization and real-time streaming.
Cut, trim, and edit audio segments with fade effects, speed control, concatenation, and basic audio manipulations.
Audio and video processing with FFmpeg, WebRTC, and streaming. Covers transcoding, format conversion, real-time communication, and media pipelines.
Audio and video processing with FFmpeg, transcoding, and streaming. Process media files, generate thumbnails, and build streaming solutions.
PyTorch library for audio generation including text-to-music (MusicGen) and text-to-sound (AudioGen).
audioFlux is a deep learning tool library for audio and music analysis and feature extraction, supporting dozens of time-frequency transforms and hundreds of feature combinations…
Use AudioPod AI's API for audio processing tasks including AI music generation (text-to-music, text-to-rap, instrumentals, samples, vocals), stem separation, text-to-speech, noise…
Transcribe audio verbatim with speaker attribution and chronological visual context
Migrating Aura components to LWC: feature mapping, interoperability wrappers, event translation, navigation patterns, and Aura-LWC coexistence strategies.
Rules for writing Playwright *.spec.ts files that don't flake — selector hygiene, auto-wait discipline, idempotent fixtures, readable trace output.
Scaffold or draft new Teku documentation to editorial standards. Use when creating a new page, writing a first draft, or helping a contributor structure content correctly.
Create author profiles via questionnaire or transcript analysis for consistent article voice
Implements automated data deletion workflows for GDPR Article 17 right to erasure and retention period expiry.
Auto-Editor is a command-line application that automatically edits video and audio by analyzing loudness, motion, and other signals to cut dead space.
Automatic internal link insertion — scan content for opportunities to link to your other articles, improving SEO and reader retention.
AI-powered security blog automation system (identical to github.com/rebugui/intelligence-agent). Collects news from Google News, arXiv, HackerNews → generates blog posts with…
The auto-subtitle-generator-online skill transcribes and embeds accurate subtitles into your videos using AI-powered speech recognition.
AI-powered video generator using XLXAI Sora2 API. Create professional videos from text prompts or images in seconds.
Auto Video Editor — analyseert long-form video transcripts en stelt overlay/effect plannen voor met Sempertex Europe branding.
Manages GDPR Article 22 rights related to solely automated decision-making and profiling, including identification of automated decisions, meaningful human oversight…
Composes week-over-week automation coverage narratives. Use when /report:automation-coverage is running.
Search, filter, and retrieve Claude/Codex history indexed by the automem CLI. Use when the user wants to index history, run lexical/semantic/hybrid search, fetch full transcripts,…
Programmatically apply UGC (User-Generated Content) to characters including clothing, accessories, and avatar customization.
Create media playback experiences using AVKit. Use when adding video players with AVPlayerViewController, enabling Picture-in-Picture, routing media with AirPlay, using SwiftUI…
Write compelling award submissions, grant applications, and competition entries. Maps achievements to selection criteria using evidence-based narratives.
14 skill opinionated per generare diagrammi e visualizzazioni direttamente in Markdown, senza strumenti esterni.
Build and maintain a dense semantic knowledge graph about search and IR inside an Obsidian vault. Triggers on article URLs, article titles or names (from Clippings/,…
公众号素材|业务资料库|预设包|.aws 预设包|主题包|品牌包|aiworkskills.cn — 用户业务资料库与预设包管理:业务资料按产品名组织在 `.aws-article/products/{产品名}/`(介绍 .md 直挂产品根 + 配图归 `images/` 子目录含同名说明 .md),AI…