Migrating Visualforce pages and components to Lightning Web Components: controller-to-Apex-method translation, viewstate replacement, custom URL parameter handling,…
Asistente para el blog de psicología de Viviana Poveda. Ayuda con implementación Supabase, queries de base de datos, gestión de posts, y desarrollo del admin panel.
vLLM non-chat inference surfaces — text embeddings (`/v1/embeddings`, `/v2/embed`), reranking/scoring (`/rerank`, `/score`), speech-to-text (`/v1/audio/transcriptions`,…
vLLM-Omni output-side multimodal generation — image (FLUX.1/2, Qwen-Image, GLM-Image, BAGEL, SD3.5, HunyuanImage-3.0), video (Wan2.1/2.2, LTX-2, HunyuanVideo-1.5), TTS (Qwen3-TTS,…
Use when the user has recorded vocals and wants a processing chain set up. Examples - "set up my vocal chain", "process this vocal", "make the vocal sit in the mix", "give me a…
Create detailed vocal specifications including style, register, techniques, emotional delivery, and influence references for AI music generation
Construction d'agents vocaux conversationnels avec STT, TTS et gestion du dialogue. Se déclenche avec "voice agent", "agent vocal", "téléphone IA", "voice bot", "STT", "TTS",…
Voice agents represent the frontier of AI interaction - humans speaking naturally with AI systems. The challenge isn't just speech recognition and synthesis, it's achieving…
Expert in building voice AI applications - from real-time voice agents to voice-enabled apps. Covers OpenAI Realtime API, Vapi for voice agents, Deepgram for transcription,…
Build real-time conversational AI voice engines using async worker pipelines, streaming transcription, LLM agents, and TTS synthesis with interrupt handling and multi-provider…
Reverse-engineer voice profiles from sample content by analyzing writing patterns
Applies a voice profile to transform content. Use when user asks to write in a specific voice, match a tone, apply a style, or transform content to sound like a particular voice…
语音口述驱动的文章创作流程。用户通过发送语音录音提供素材,AI 转录、整理、多轮迭代,最终输出结构化 Markdown 文章。支持截图标注 review(红色=删除,黄色=修改)和封面图生成。适合 OpenClaw 等对话式 Agent…
Full voice-to-voice interaction: transcribe user speech, process request, and respond with synthesized speech
Learns a user's personal writing and speaking style through interactive writing prompts, then generates a reusable voice profile.
Clone uma voz localmente com Qwen3-TTS Base usando áudio de referência e a transcrição correspondente.
Generate custom voice profiles from natural language descriptions by mapping tone, formality, and domain to voice dimensions
Enforce the Orchestrator Voice Constitution during text generation. Provides voice constraints, anti-pattern awareness, and scoring guidance.
Extract and document someone's authentic writing voice from samples. Use when someone needs a "voice guide," wants to capture their writing DNA, or needs to train AI to write in…
Build and evolve your personal writing voice model. Captures word choices, sentence patterns, formality levels, and anti-patterns so AI output is indistinguishable from your own…
Aggregate customer feedback from multiple sources — support tickets, NPS comments, Slack messages, G2 reviews, call transcripts, survey responses — into a unified VoC report with…
Audition Kokoro TTS voices to compare quality and grade. TRIGGERS - audition voices, kokoro voices, voice comparison, tts voice, voice quality, compare voices.
Convert written content to voice/podcast scripts — transform note articles, X threads, and blog posts into natural spoken-word scripts for stand.fm, Voicy, podcasts, and YouTube…
Complete voice configuration in chat - PTT key, microphone permissions, ElevenLabs TTS, and troubleshooting
Voice, style, and register in writing across all forms. Covers voice (authorial presence, authenticity, persona, tone), style as choice (diction, syntax, rhythm, density,…
Prepare audio-optimized content for text-to-speech rendering. Generate recording scripts, pronunciation guides, and pacing-marked text for podcasts, video voiceovers, and audio…
Use when working with Quetrex's voice interface, OpenAI Realtime API, WebRTC, or echo cancellation. Knows Quetrex's specific voice architecture decisions and patterns.
Self-evolving voice assistant UI. Talk to your AI, ask it to improve itself, and watch the code update in real-time.
Установка и использование voiceover-pipeline CLI для генерации озвучки из Markdown-сценариев, TTS-аудио, Whisper-таймингов, SRT, manifest.json и Remotion-ready артефактов.
Perform offline speech recognition across 20+ languages with Vosk. Provides compact models, zero-latency streaming transcription, and bindings for Python, Node.js, Java, C#, and…
Guide users through creating Agent Skills for Claude Code. Use when the user wants to create, write, author, or design a new Skill, or needs help with SKILL.md files, frontmatter,…
VRChat UdonSharp の再利用モジュールを Builder First で立ち上げるときに使う。Runtime と Editor の責務分離、project-owned なドキュメント整備、シーン生成前提の構成、外部プロジェクトへ持ち出せる基本骨格が必要な場合に使う。文書雛形の配布そのものは…
外部の VRChat UdonSharp プロジェクトに project-owned な AGENTS.md、README、SYSTEM_SPEC、SCENE_OBJECT_SPEC、ENVIRONMENT_SETUP、OFFICIAL_REFERENCE_MAP を配布可能な形でそろえるときに使う。実装前の文書土台、環境再現用の文書セット、U#…
Unity 2022.3.22f1 前提の VRChat UdonSharp プロジェクトで CoplayDev の unity-mcp を使うときに使う。Scene / Prefab / Builder / Editor 状態の読解や補助操作を Codex から行いたい場合に、Unity MCP を Runtime 依存ではなく Editor…
Provides Visual Studio Code keyboard shortcuts and commands for efficient code editing. Use when looking up VS Code keybindings, explaining editor shortcuts, or configuring…
Guidelines for writing and reviewing Insiders and Stable release notes for Visual Studio Code.
Wagtail is an open source CMS built on Django for teams that need structured content, flexible page models, and a polished editor experience.
Write a specgraph evidence waiver for a spec requirement that cannot currently be satisfied, with a clear justification and expiry plan.
Generate text-to-video with Wan 2.7 (Wan-AI's flagship motion model) on RunComfy. Documents Wan 2.7's strengths (multi-reference conditioning, audio-driven lip-sync via…
Audit warehouse flow design and material movement -- evaluate pick path optimization algorithms, ABC velocity-based slotting compliance, zone configuration and boundary balancing,…
Audit warehouse management system operations -- evaluate WMS platform architecture (Manhattan, Blue Yonder, SAP EWM, Korber, Infor), inbound receiving and ASN processing, putaway…
Fetches web content with intelligent content extraction, converting HTML to clean markdown. Use for documentation, articles, and reference pages http/https URLs.
Editorial-minimalist web prototype. Warm monochrome canvas, serif display + grotesque body, 1px hairline borders, muted pastel chips, generous macro-whitespace, ambient…
Execute the primary Webflow workflow — CMS content management: list collections, CRUD items, publish items, and manage content lifecycle via the Data API v2.
Reference WebGPU APIs, TSL syntax, node materials, WGSL integration, compute shaders, and post-processing in Three.js.
Transform webinar recordings into multiple content assets including blog post series, social media snippets, infographic ideas, email sequences, and sales one-pagers.
Convert webinar recordings into blog posts, social snippets, email series. Extract key quotes, statistics, and soundbites.
Manage your author website — auto-publish books on completion, draft blog posts from your projects, render and deploy with one command
Crawl WeChat official account articles and export full content (Markdown/HTML) plus local assets (images, videos, audio) into per-article directories.
Create cinematic wedding montage videos from photos and songs using Remotion. Features act-based narrative structure (5 acts), Ken Burns photo animations, multi-song audio with…
Generate narrative weekly chronicle entry for svaib project. Use ONLY when user invokes /weekly-progress.
Choose majors, minors, classes, and transcript direction by following aliveness, proof, and future options instead of prestige scripts, GPA superstition, or family autopilot.
Comprehensive GitHub release orchestration with AI swarm coordination for automated versioning, testing, deployment, and rollback management.
OpenAI's general-purpose speech recognition model. Supports 99 languages, transcription, translation to English, and language identification.
Processes audio files from an S3 bucket using Whisper large-v3, splitting recordings into 30-second chunks with ffmpeg before transcription.
Runs OpenAI Whisper models locally via whisper.cpp with GGML quantized weights for CPU-efficient transcription.
Streams audio from PulseAudio or ALSA devices into whisper.cpp for real-time speech-to-text with word-level timestamps.
Enhances OpenAI Whisper transcription output with speaker diarization using pyannote.audio pipeline and speechbrain embeddings.
Generates accurate subtitles and captions using OpenAI Whisper API with word-level timestamps. Outputs SRT, VTT, and ASS formats with configurable line length and speaker…
Use when the user wants to transcribe, caption, subtitle, batch process, or convert speech to text from local audio/video files using faster-whisper.