Voice agents represent the frontier of AI interaction - humans speaking naturally with AI systems. The challenge isn't just speech recognition and synthesis, it's achieving…
AI voice agent for handling incoming calls, appointment scheduling, lead qualification, and 24/7 customer service without human intervention.
Expert in building voice AI applications - from real-time voice agents to voice-enabled apps. Covers OpenAI Realtime API, Vapi for voice agents, Deepgram for transcription,…
Build real-time conversational AI voice engines using async worker pipelines, streaming transcription, LLM agents, and TTS synthesis with interrupt handling and multi-provider…
High-quality voice synthesis with 9 personas, 11 languages, streaming, and voice cloning using Voice.ai API.
Reverse-engineer voice profiles from sample content by analyzing writing patterns
Voice and tone guidelines for technical documentation. Ensures consistent, clear, and human writing across all documentation.
Use when shaping brand voice — voice attributes, tone-by-context matrix, consistency review. Triggers on 'define our voice', 'why does our copy sound different on every surface'.
Applies a voice profile to transform content. Use when user asks to write in a specific voice, match a tone, apply a style, or transform content to sound like a particular voice…
语音口述驱动的文章创作流程。用户通过发送语音录音提供素材,AI 转录、整理、多轮迭代,最终输出结构化 Markdown 文章。支持截图标注 review(红色=删除,黄色=修改)和封面图生成。适合 OpenClaw 等对话式 Agent…
Full voice-to-voice interaction: transcribe user speech, process request, and respond with synthesized speech
Expert in voice synthesis, TTS, voice cloning, podcast production, speech processing, and voice UI design via ElevenLabs integration.
Learns a user's personal writing and speaking style through interactive writing prompts, then generates a reusable voice profile.
Clone uma voz localmente com Qwen3-TTS Base usando áudio de referência e a transcrição correspondente.
Designs and attaches voices to characters on the filmmaking canvas via the local generate_voice.js CLI, following canvas-aware conventions for voice description, sample text, and…
Generate custom voice profiles from natural language descriptions by mapping tone, formality, and domain to voice dimensions
Select and create the perfect AI voice for your content using ElevenLabs, Qwen3-TTS, and other platforms—matching voice characteristics to brand personality and audience.
Enforce the Orchestrator Voice Constitution during text generation. Provides voice constraints, anti-pattern awareness, and scoring guidance.
Extract and document someone's authentic writing voice from samples. Use when someone needs a "voice guide," wants to capture their writing DNA, or needs to train AI to write in…
Run the mechanical voice-fingerprint scanner across all chapters to identify outliers in sentence rhythm, vocabulary domain balance, dialogue ratio, abstract noun density, and…
Line-editor pass on any content draft against a client voice guide. Flags cadence issues, signature-move overuse, voice-guide violations, receipt gaps — things a rule-based gate…
Clean up and structure rough Japanese voice-dictated text into a clear, actionable prompt — strip fillers, fix homophone/STT errors, normalize punctuation, restate intent for…
Learn from manual edits to improve voice profile. Compares pre-review, post-review, and edited text.
Scale your brand voice across multiple languages using AI voice synthesis, maintaining consistent character and quality for global content.
Apply the user's voice when writing or editing content. Activate when the user says "use the voice-master skill", "write this in my voice", "make this sound like me", or any…
Use when writing in a creator's voice, analyzing a creator's style, or maintaining consistency across content.
Sync, transcribe, and intelligently organize voice memos, audio/video files, and URLs. 同步、转录、智能整理语音备忘录、音视频文件和视频链接。
Build and evolve your personal writing voice model. Captures word choices, sentence patterns, formality levels, and anti-patterns so AI output is indistinguishable from your own…
Ingest a voice note with exact-phrasing preservation (never paraphrased). Routes content to originals/, concepts/, people/, companies/, ideas/, personal/, or voice-notes/ based on…
Voice of Customer program - multi-source signal collection, theming and prioritization, evidence-to-action routing, and product / CS / marketing feedback loop closure.
Aggregate customer feedback from multiple sources — support tickets, NPS comments, Slack messages, G2 reviews, call transcripts, survey responses — into a unified VoC report with…
AI ผู้ช่วยพากษ์/Voice Over — script, tone guide, character voice, pace marking, recording brief สำหรับ animation, โฆษณา, documentary, e-learning, audiobook
Route to Mars (introspective thought partner / demo showman voice persona). Used when the operator wants depth, meaning, or impressive social demos rather than logistics.
Route to Venus (sharp executive-assistant voice persona). Used for logistics — calendar, tasks, recent messages, brain lookups — at sub-second phone-call latency.
Post-call handling for a voice session — turn the transcript into a brain page, post the summary to the operator's messaging surface, archive the audio.
Audition Kokoro TTS voices to compare quality and grade. TRIGGERS - audition voices, kokoro voices, voice comparison, tts voice, voice quality, compare voices.
Stores a static voice profile and 2-3 past writing samples so long-form-writing, case-study, column-editorial, thought-leadership, interview-storytelling, and ai-slop-reviewer can…
Transform verbose voice input into structured, token-efficient Claude prompts. Use when cleaning up voice memos, dictation output, or speech-to-text transcriptions that contain…
语音回复模式。使用 /voiceMode 切换语音回复模式。\n 开启后所有回复自动转换为语音发送,关闭后恢复文字回复。\n 支持 Telegram、iMessage 等渠道的语音消息发送。
Convert written content to voice/podcast scripts — transform note articles, X threads, and blog posts into natural spoken-word scripts for stand.fm, Voicy, podcasts, and YouTube…
Complete voice configuration in chat - PTT key, microphone permissions, ElevenLabs TTS, and troubleshooting
Voice, style, and register in writing across all forms. Covers voice (authorial presence, authenticity, persona, tone), style as choice (diction, syntax, rhythm, density,…
Prepare audio-optimized content for text-to-speech rendering. Generate recording scripts, pronunciation guides, and pacing-marked text for podcasts, video voiceovers, and audio…
Use when working with Quetrex's voice interface, OpenAI Realtime API, WebRTC, or echo cancellation. Knows Quetrex's specific voice architecture decisions and patterns.
MailChimp Voice & Tone framework for UX copy, microcopy, error messages, and interface text — calibrates tone to user emotional state and context.
Transcribe audio files using OpenAI's gpt-4o-mini-transcribe model with vocabulary hints and text replacements. Requires uv (https://docs.astral.sh/uv/).
Self-evolving voice assistant UI. Talk to your AI, ask it to improve itself, and watch the code update in real-time.
Speak responses aloud on macOS using the built-in `say` command when user input indicates Voice Wake/voice recognition (for example, messages starting with "User talked via voice…
Sync and access voice notes from Voicenotes.com. Use when the user wants to retrieve their voice recordings, transcripts, and AI summaries from Voicenotes.
Установка и использование voiceover-pipeline CLI для генерации озвучки из Markdown-сценариев, TTS-аудио, Whisper-таймингов, SRT, manifest.json и Remotion-ready артефактов.
Image generation workflow on Volcengine AI services. Use when users need text-to-image, style variants, prompt refinement, or deterministic image generation parameters and…
Generate or validate Volcengine Ark image requests for text-to-image and single-reference image workflows.
Using volcengine video_generate.py script to generate video, need to provide filename and prompt, optional provide first frame image (URL or local path).
Perform offline speech recognition across 20+ languages with Vosk. Provides compact models, zero-latency streaming transcription, and bindings for Python, Node.js, Java, C#, and…
Guide users through creating Agent Skills for Claude Code. Use when the user wants to create, write, author, or design a new Skill, or needs help with SKILL.md files, fro — from…
VRChat UdonSharp の再利用モジュールを Builder First で立ち上げるときに使う。Runtime と Editor の責務分離、project-owned なドキュメント整備、シーン生成前提の構成、外部プロジェクトへ持ち出せる基本骨格が必要な場合に使う。文書雛形の配布そのものは…
外部の VRChat UdonSharp プロジェクトに project-owned な AGENTS.md、README、SYSTEM_SPEC、SCENE_OBJECT_SPEC、ENVIRONMENT_SETUP、OFFICIAL_REFERENCE_MAP を配布可能な形でそろえるときに使う。実装前の文書土台、環境再現用の文書セット、U#…
Unity 2022.3.22f1 前提の VRChat UdonSharp プロジェクトで CoplayDev の unity-mcp を使うときに使う。Scene / Prefab / Builder / Editor 状態の読解や補助操作を Codex から行いたい場合に、Unity MCP を Runtime 依存ではなく Editor…
Provides Visual Studio Code keyboard shortcuts and commands for efficient code editing. Use when looking up VS Code keybindings, explaining editor shortcuts, or configuring…
Guidelines for writing and reviewing Insiders and Stable release notes for Visual Studio Code.