Audio Podcast — Content Claude Skills (Page 3 of 9)

For the full experience including quality scoring and one-click install features for each skill — upgrade to Pro.

aligned-stem-workflow

Incremental audio production with duration alignment handling, per-stem verification, and adaptive extension strategies

adaptive-stem-alignment

Incremental audio production with duration mismatch handling, adaptive stem extension, and pre-mix alignment verification

diagnostic-stem-delivery

Audio production with diagnostic analysis, timecode parsing from documents, and verified export workflow

audio-transcribe

使用 Whisper 将音频/视频转换为文字，支持词级别时间戳。Use when user wants to 语音转文字, 音频转文字, 视频转文字, 字幕生成, transcribe audio, speech to text, generate subtitles, 识别语音.

audio-transcriber

Transform audio recordings into professional Markdown documentation with intelligent summaries using LLM integration

audio-transcription-pipeline

Build audio transcription pipelines with Whisper, Deepgram, and AssemblyAI including speaker diarization and real-time streaming.

audio-trimmer

Cut, trim, and edit audio segments with fade effects, speed control, concatenation, and basic audio manipulations.

audio-video

Audio and video processing with FFmpeg, WebRTC, and streaming. Covers transcoding, format conversion, real-time communication, and media pipelines.

audiocraft-audio-generation

PyTorch library for audio generation including text-to-music (MusicGen) and text-to-sound (AudioGen).

audiocraft-audio-generation

PyTorch library for audio generation including text-to-music (MusicGen) and text-to-sound (AudioGen).

audioflux-audio-music-analysis-feature-extraction-library

audioFlux is a deep learning tool library for audio and music analysis and feature extraction, supporting dozens of time-frequency transforms and hundreds of feature combinations…

audiopod

Use AudioPod AI's API for audio processing tasks including AI music generation (text-to-music, text-to-rap, instrumentals, samples, vocals), stem separation, text-to-speech, noise…

audiovisual-transcription

Transcribe audio verbatim with speaker attribution and chronological visual context

auto-subtitle-generator-online

The auto-subtitle-generator-online skill transcribes and embeds accurate subtitles into your videos using AI-powered speech recognition.

axiom-haptics

Use when implementing haptic feedback, Core Haptics patterns, audio-haptic synchronization, or debugging haptic issues - covers UIFeedbackGenerator, CHHapticEngine, AHAP patterns,…

azure-ai-transcription-py

Azure AI Transcription SDK for Python. Use for real-time and batch speech-to-text transcription with timestamps and diarization.

bigwig-signal-processing

Use when after bias correction of ATAC-seq reads (via ATACorrect) when you have a bias-corrected bigWig file and need to measure transcription factor footprint strength within…

bio-atac-seq-footprinting

Detect transcription factor binding sites through footprinting analysis in ATAC-seq data using TOBIAS.

bio-atac-seq-footprinting

Detect transcription factor binding footprints in ATAC-seq using TOBIAS, HINT-ATAC, Wellington, or scprinter.

bio-atac-seq-motif-deviation

Analyze transcription factor motif accessibility variability using chromVAR. Use when identifying which TF motifs show variable accessibility across samples or conditions in…

bio-chip-seq-motif-analysis

De novo motif discovery and known motif enrichment analysis using HOMER and MEME-ChIP. Identify transcription factor binding motifs in ChIP-seq, ATAC-seq, or other genomi — from…

bio-chipseq-allele-specific-binding

Detects allele-specific transcription factor or histone modification binding from heterozygous-variant ChIP-seq using WASP (reference-bias filter; mandatory upstream), RASQUAL…

bio-chipseq-motif-analysis

De novo motif discovery and known motif enrichment analysis using HOMER and MEME-ChIP. Identify transcription factor binding motifs in ChIP-seq, ATAC-seq, or other genomi — from…

bio-chipseq-peak-calling

ChIP-seq peak calling using MACS3 (or MACS2). Call narrow peaks for transcription factors or broad peaks for histone modifications.

bio-chipseq-super-enhancers

Identifies super-enhancers from H3K27ac ChIP-seq data using ROSE and related tools. Use when studying cell identity genes, cancer-associated regulatory elements, or master…

bio-chipseq-super-enhancers

Identifies super-enhancers from H3K27ac, MED1, or BRD4 ChIP-seq using ROSE, ROSE2, LILY, HOMER -style super, and ENCODE dELS cross-referencing.

bio-gene-regulatory-networks-grn-inference

Infer gene regulatory networks from bulk or general expression data with mutual-information (ARACNe) and tree-ensemble (GENIE3, GRNBoost2) methods, and infer transcription-factor…

bio-gene-regulatory-networks-perturbation-simulation

Simulate transcription factor perturbation effects on cell state in silico with CellOracle and Dynamo, and predict transcriptional responses to genetic perturbations with GEARS,…

bio-gene-regulatory-networks-scenic-regulons

Infer transcription factor regulons from single-cell RNA-seq with pySCENIC by combining GRNBoost2 co-expression, cisTarget motif-enrichment pruning, and AUCell per-cell activity…

bio-motif-search

Find patterns, motifs, and subsequences in biological sequences using Biopython. Use when searching for transcription factor binding sites, regulatory elements, or any se — from…

bio-transcription-translation

Transcribe DNA to RNA and translate to protein using Biopython. Use when converting between DNA, RNA, and protein sequences, finding ORFs, or using alternative codon tabl — from…

Build streaming voice LLM agents with Vocode

Use Vocode to compose transcription, LLM, speech synthesis, and telephony components into reviewable real-time voice-agent workflows.

capture-local-screen-and-audio-context-so-agents-can-search-what

Use Screenpipe when an agent needs private, local-first memory of what you saw or heard on your computer, including searchable screen text, app context, and transcripts, instead…

capture-voice-idea

Capture a business idea from a voice memo / audio file. Transcribes the recording, preserves the raw transcript, then hands off to capture-idea so the user can optionally generate…

chat-integrator

Automatically integrates processed media (audio transcriptions and image summaries) into chat.md files at the correct timestamp position.

chea-api

Access ChEA3 and Harmonizome ChEA data for transcription factor enrichment analysis and metadata retrieval.

civitai-gen

Generate images, videos, audio, and more using Civitai's orchestration API. Use when the user wants text-to-image, video generation (11+ engines), text-to-speech, music,…

claude-haiku

Use when deciding whether to route a task to the fast/cheap tier (Claude Haiku) — transcription, polling, format conversion, structured-output slot-filling, small-diff review,…

claudio-jokes

Battute brutte in stile Claudio: giochi di parole su AI, tech e lavoro in italiano

cleaning-auto-transcripts

Use when processing auto-generated YouTube transcripts that contain transcription errors, misspellings, or phonetic pronunciation anomalies.

competitor-content-audit

Analyze a competitor's recent social content — extract what's working, what's not, their posting cadence, content mix, and voice patterns — feeds directly into brand-voice-system,…

construction-daily-report

Generate a structured daily site progress report from unstructured input such as voice transcription, rough notes, or conversational messages.

construction-meeting-minutes

Generate structured construction meeting minutes from rough notes or voice transcription, with separated action items, decision tracking, and contractual flagging.

content-saver

Guides users through saving generated content (summaries, notes, key points) to professionally formatted and themed files.

contexto_sotaque

Esta skill atua como um laboratório de fonética articulatória e prosódica. Utilize esta skill SEMPRE que a pessoa usuária enviar um link de vídeo do YouTube ou um arquivo de vídeo…

controlnet-pose

Pose-conditioned generation on RunComfy via the `runcomfy` CLI. Routes across Kling 2-6 Motion Control Pro / Standard (transfer the motion / blocking of a reference video — from…

controlnet-pose

Pose-conditioned generation on RunComfy via the `runcomfy` CLI. Routes across Kling 2-6 Motion Control Pro / Standard (transfer the motion / blocking of a reference video — from…

controlnet-pose

Pose-conditioned generation on RunComfy via the `runcomfy` CLI. Routes across Kling 2-6 Motion Control Pro / Standard (transfer the motion / blocking of a reference video — from…

convert-to-markdown

Convert documents and files to Markdown using markitdown with Windows/WSL path handling. Supports PDF, Word (.docx), PowerPoint (.pptx), Excel (.xlsx, .xls), HTML, CSV, JSON, XML,…

copilotkit-debug

Use when diagnosing CopilotKit issues -- runtime connectivity failures, agent not responding, streaming errors, tool execution problems, transcription failures, version…

coqui-tts-deep-learning-text-to-speech-toolkit

An agent skill built on Coqui TTS, the open-source deep learning toolkit for text-to-speech synthesis.

core-audio-unit-v3-debugger

Debugs and profiles Apple Audio Unit v3 (AUv3) plugins using auval validation tool, the AUAudioUnit Swift API, and Instruments AudioUnit trace template for latency measurement and…

correcting-transcriptions

Use when user requests to clean up, correct, or improve speech-to-text transcripts that contain filler words, repetitions, self-corrections, or conversational artifacts from voice…

covert-to-markdown

Convert documents and files to Markdown using markitdown. Use when converting PDF, Word (.docx), PowerPoint (.pptx), Excel (.xlsx, .xls), HTML, CSV, JSON, XML, images (wi — from…

dd-tts-pack

Create TTS sound pack using Qwen3-TTS voice cloning or built-in voices. TTS 음성 합성으로 사운드 팩 생성. Use when the user says 'TTS 팩', 'TTS 사운드', 'tts pack', '음성 합성 팩', '보이스 클로닝', 'voice…

deepgram-common-errors

Diagnose and fix common Deepgram errors and issues. Use when troubleshooting Deepgram API errors, debugging transcription failures, or resolving integration issues.

deepgram-core-workflow-a

Implement production pre-recorded speech-to-text with Deepgram. Use when building audio transcription, batch processing, or implementing diarization and intelligence features.

deepgram-core-workflow-b

Implement real-time streaming transcription with Deepgram WebSocket. Use when building live transcription, voice interfaces, real-time captioning, or voice AI applications.

deepgram-cost-tuning

Optimize Deepgram costs and usage for budget-conscious deployments. Use when reducing transcription costs, implementing usage controls, or optimizing pricing tier utilization.

deepgram-data-handling

Implement audio data handling best practices for Deepgram integrations. Use when managing audio file storage, implementing data retention, or ensuring GDPR/HIPAA compliance for…

Audio Podcast (Page 3 of 9)

Categories

Use cases

Popular tags

Learn

Site