Audio Podcast — Content Claude Skills (Page 4 of 9)

For the full experience including quality scoring and one-click install features for each skill — upgrade to Pro.

deepgram-hello-world

Create a minimal working Deepgram transcription example. Use when starting a new Deepgram integration, testing your setup, or learning basic Deepgram API patterns.

deepgram-migration-deep-dive

Deep dive into migrating to Deepgram from other transcription providers. Use when migrating from AWS Transcribe, Google Cloud STT, Azure Speech, OpenAI Whisper, AssemblyAI, or…

deepgram-nova-stt-pipeline

Real-time speech-to-text using Deepgram Nova-2 API with streaming WebSocket connections. Supports diarization, punctuation, and language detection via the Deepgram Python SDK for…

deepgram-performance-tuning

Optimize Deepgram API performance for faster transcription and lower latency. Use when improving transcription speed, reducing latency, or optimizing audio processing pipelines.

deepgram-realtime-transcription-connector

Streams live audio to Deepgram's WebSocket API at wss://api.deepgram.com/v1/listen for real-time speech-to-text.

deepgram-transcribe

Transcribe audio via Deepgram Nova-3 API (5.26% WER, 40x faster than Whisper, built-in speaker diarization).

deepgram-webhooks

Receive and verify Deepgram webhooks (callbacks). Use when setting up Deepgram webhook handlers, processing transcription callbacks, or handling asynchronous transcription results.

deepgram-webhooks-events

Implement Deepgram callback and webhook handling for async transcription. Use when implementing callback URLs, processing async transcription results, or handling Deepgram event…

delive-transcript-analyzer

Analyze, summarize, and extract insights from DeLive transcription sessions. Use when: user mentions DeLive, transcription, meeting transcripts, live captions, audio…

demoscene-coding

Specialist in creating size-optimized real-time audio-visual demos and procedural artUse when "demoscene, size coding, 64k intro, 4k intro, 1k intro, tiny code, shader golf,…

demucs-music-source-separation-for-vocal-and-stem-extraction

Demucs is Meta's open-source music source separation project for splitting songs into stems such as vocals, drums, bass, and accompaniment.

deviation-score-computation-and-interpretation

Use when you have filtered ATAC-seq peak counts, matched motifs to those peaks, and want to measure which transcription factor motifs show elevated or reduced accessibility…

douyin-transcribe

Extract audio from Douyin (抖音/TikTok China) videos and transcribe to text using Whisper. Trigger when user sends a Douyin link (v.douyin.com or www.douyin.com/video/) and asks for…

edge-tts-english

Generate high-quality English (and multilingual) audio using Microsoft Edge TTS. Use when the user asks to \"speak this\", \"pronounce\", \"read aloud\", \"say this in English\",…

elevenlabs-music-generation

Generate full songs and instrumental tracks with ElevenLabs Music on RunComfy via the `runcomfy` CLI.

elevenlabs-music-generation

Generate full songs and instrumental tracks with ElevenLabs Music on RunComfy via the `runcomfy` CLI.

elevenlabs-music-generation

Generate full songs and instrumental tracks with ElevenLabs Music on RunComfy via the `runcomfy` CLI.

elevenlabs-webhooks

Receive and verify ElevenLabs webhooks. Use when setting up ElevenLabs webhook handlers, debugging signature verification, or handling call transcription events.

elevenlabs-webhooks-events

Implement ElevenLabs webhook HMAC signature verification and event handling. Use when setting up webhook endpoints for transcription completion, call recording, or agent…

elna

Elna Company MPN encoding patterns, suffix decoding, and handler guidance. Use when working with Elna audio-grade aluminum electrolytic capacitors and supercapacitors.

eu-china-trade-risk-dashboard

Aufbau eines EU-China-Handelsrisiko-Dashboards für Unternehmen: Datenquellen (Eurostat, BMWK, BAFA), Abhängigkeitsindikatoren, Sektorrisiko-Scores, De-risking-Fortschritts-KPIs,…

extract-bilibili-video

Extract evidence from public Bilibili videos for accurate summaries, analysis, research, or downstream article planning.

ezviz-audio-broadcast

萤石语音广播技能。支持本地音频文件上传或文本转语音，实现语音内容下发到设备播放。 Use when: 需要向萤石设备发送语音通知、广播、提醒等音频内容。 ⚠️ 安全要求：必须设置 EZVIZ_APP_KEY 和 EZVIZ_APP_SECRET 环境变量，使用最小权限凭证。 — from ndesv21/openclaw-master-skills

face-swap

Swap a face / character into video or images on RunComfy via the `runcomfy` CLI. Routes across community Wan 2-2 Animate (audio-driven character animation + identity swap — from…

face-swap

Swap a face / character into video or images on RunComfy via the `runcomfy` CLI. Routes across community Wan 2-2 Animate (audio-driven character animation + identity swap — from…

faster-whisper

Local speech-to-text using faster-whisper. 4-6x faster than OpenAI Whisper with identical accuracy; GPU acceleration enables ~20x realtime transcription.

faster-whisper-high-performance-speech-transcription

faster-whisper is a reimplementation of OpenAI's Whisper model using CTranslate2 that delivers up to 4x faster transcription with lower memory usage.

faster-whisper-high-performance-speech-transcription-library

faster-whisper is SYSTRAN’s high-performance reimplementation of OpenAI Whisper on top of CTranslate2.

ffmpeg-animation-timing-reference

Definitive reference for FFmpeg and ASS/SSA animation timing units, optimal durations, and best practices.

ffmpeg-audio-normalization-pipeline

Normalizes audio loudness to broadcast standards using FFmpeg loudnorm filter with EBU R128 two-pass analysis.

ffmpeg-audio-processing

Complete audio encoding and normalization system. PROACTIVELY activate for: (1) Audio codec selection (AAC, MP3, Opus, FLAC), (2) Loudness normalization (EBU R128, loudnorm), (3)…

ffmpeg-audio-transcoder

Transcodes and processes audio files using the FFmpeg CLI and libavcodec library. Supports batch format conversion, loudness normalization via EBU R128, and metadata extraction…

ffmpeg-captions-subtitles

Complete subtitle and caption system for FFmpeg 7.1 LTS and 8.0.1 (latest stable, released 2025-11-20).

fieldy-analysis

Analyze Field Labs coaching transcription data, calculate session metrics, and generate daily summaries.

fireflies-migration-deep-dive

Migrate to Fireflies.ai from other meeting transcription platforms or legacy recording systems. Use when switching from Otter.ai, Rev, or custom transcription to Fireflies, or…

fitts-law

Apply Fitts's Law to size and position interactive targets for fast, accurate interaction.

bio-atac-seq-footprinting

Detect transcription factor binding sites through footprinting analysis in ATAC-seq data using TOBIAS.

french-meeting-minutes

Génère un compte-rendu de réunion structuré en français à partir d'une transcription brute ou d'un fichier audio.

full-stack-bootstrap

One-time bootstrap for Kokoro TTS engine, Telegram bot, and BotFather setup. TRIGGERS - setup tts, install kokoro, botfather, bootstrap tts-tg-sync, configure telegram bot, full…

Game Audio Engineer

Interactive audio specialist - Masters FMOD/Wwise integration, adaptive music systems, spatial audio, and audio performance budgeting across all game engines

mastering-gcloud-commands

Expert-level Google Cloud CLI (gcloud) skill for managing GCP resources. Use when working with "gcloud commands", "cloud run deploy", "alloydb", "cloud sql", "workload identity…

gemini-audio-tts-music

Generate music (Lyria 3) or synthesize speech (Gemini TTS, single or multi-speaker). Use for soundtracks, voiceovers, demo narration, notification sounds, or audio branding.

mastering-gemini-cli

Build headless automation and agentic workflows with Google's Gemini CLI. Covers approval modes (default, auto_edit, yolo), file permission model, Edit vs WriteFile tool…

gemini-tts-cli

Gemini TTS 命令列工具使用指南，涵蓋單句與批次文字轉語音、列出聲音、合併 WAV、stdout 輸出、API key 設定、快取與併發等。當使用者詢問 gemini-tts、Gemini TTS CLI、list-voices、merge、GEMINI_API_KEY、文字轉語音或相關參數時使用。

gemini-video

Invoke Google Gemini for video understanding and analysis using the Python google-genai SDK. Supports gemini-3-pro-preview and gemini-2.5-flash for video analysis, transcription,…

generate-tts-audio

narration-scripts.json의 대본을 edge-tts로 MP3 파일로 변환하고 mutagen으로 재생 시간을 측정하여 durations.json을 갱신합니다. 사용 시점: TTSAgent가 각 슬라이드의 나레이션 음성 파일을 생성할 때 호출합니다.

genomic-region-annotation-integration

Use when after bias-correcting ATAC-seq cutsite signal (via ATACorrect) when you have a bias-corrected bigWig file and need to compute per-position footprint scores within defined…

gettr-transcribe

Download audio from a GETTR post or streaming page and transcribe it locally with MLX Whisper on Apple Silicon (with timestamps via VTT).

gladia-audio-intelligence

Configure and use Gladia audio intelligence features: speaker diarization, translation, sentiment analysis, named entity recognition (NER), PII redaction, subtitles (SRT/VTT),…

glsl-shader

Create audio-reactive GLSL visualizers for Bice-Box. Provides templates, audio uniforms (iRMSOutput, iRMSInput, iAudioTexture), coordinate patterns, and common shader functions.

godot-setup-audio-buses

Use when configuring audio bus hierarchies in Godot, setting up AudioStreamPlayer pooling for performance, implementing 3D spatial audio, adding audio effect chains like reverb…

granola-common-errors

Troubleshoot common Granola errors — audio capture failures, transcription issues, calendar sync problems, and integration errors. Platform-specific fixes for macOS and Windows.

granola-incident-runbook

Incident response procedures for Granola meeting capture failures and outages. Use when meetings aren't recording, transcription fails mid-meeting, integrations stop syncing, or…

granola-performance-tuning

Optimize Granola transcription accuracy, note quality, and processing speed. Use when improving transcription quality, reducing processing time, optimizing templates for better AI…

groq-core-workflow-b

Execute Groq secondary workflows: audio transcription (Whisper), vision, text-to-speech, and batch model evaluation.

haus-literaturrecherche-leitfaden

Leitfaden Literaturrecherche: Kommentar, Lehrbuch, Aufsatz, Rechtsprechung dejure.org / openjur.de. Pruefraster fuer Querschnitts- und Spezialthemen.

higgsfield-audio

Use when the user asks about audio in Higgsfield videos, needs to add dialogue or lip-sync, wants sound effects or ambient sound in generated video, asks about music or BGM in…

howlerjs-cross-browser-javascript-audio-library

Howler.js is a JavaScript audio library for the modern web that defaults to the Web Audio API with an HTML5 Audio fallback.

ezviz-audio-broadcast

hyperframes

Create video compositions, animations, title cards, overlays, captions, voiceovers, audio-reactive visuals, and scene transitions in HyperFrames HTML.

Audio Podcast (Page 4 of 9)

Categories

Use cases

Popular tags

Learn

Site