ClaudSkills / Content / audio-podcast

ai-multimodal

Quality score: 85/100  ·  Category: Content  ·  Sub-category: audio-podcast
ai:gemini
Process and generate multimedia content using Google Gemini API. Capabilities include analyze audio files (transcription with timestamps, summarization, speech understanding, music/sound analysis up to 9.5 hours), understand images (captioning, object detection, OCR, visual Q&A, segmentation), process videos (scene detection, Q&A, temporal analysis, YouTube URLs, up to 6 hours), extract from documents (PDF tables, forms, charts, diagrams, multi-page), generate images (text-to-image, editing, composition, refinement). Use when working with audio/video files, analyzing images or screenshots, processing PDF documents, extracting structured data from media, creating images from text prompts, or implementing multimodal AI features. Supports multiple models (Gemini 2.5/2.0) with context windows up to 2M tokens.

What this skill does

ai-multimodal is a well-rated Claude Code skill (quality score 85/100) in the audio-podcast sub-category. It ships as a SKILL.md file that Claude Code auto-discovers under ~/.claude/skills/ai-multimodal-mrgoonie/ and loads when your prompt matches the skill's trigger.

When to invoke it: Use when working with audio/video files, analyzing images or screenshots, processing PDF documents, extracting structured data from media, creating images from text prompts, or implementing multimodal AI features. Supports multiple models (Gemini 2.

Who uses this skill

The ai-multimodal skill is built for content creators, marketers, copywriters, SEO professionals, and editorial teams. It is part of the open ClaudSkills registry, a community-curated catalog of 15,000+ capabilities you can install for Claude Code — the Claude CLI agent.

How to install

Free

Manual install (2 steps)

mkdir -p ~/.claude/skills/ai-multimodal-mrgoonie
curl -L https://claudskills.com/skills/ai-multimodal-mrgoonie/SKILL.md \
  -o ~/.claude/skills/ai-multimodal-mrgoonie/SKILL.md

Or just download SKILL.md directly and drop it into ~/.claude/skills/ai-multimodal-mrgoonie/. Claude Code auto-discovers it on next session.

Skills live at ~/.claude/skills/ai-multimodal-mrgoonie/SKILL.md on macOS/Linux, or %USERPROFILE%\.claude\skills\ai-multimodal-mrgoonie\SKILL.md on Windows. See the full install guide for step-by-step instructions.

Pro

One-click install via the desktop app

The ClaudSkills desktop app installs any skill directly into ~/.claude/skills/ with one click — no terminal required. Pro starts at $9/mo or $149 lifetime.

More Content skills

Browse all Content skills in the ClaudSkills registry, or explore these top-rated picks from the same category:

Browse all Content skills → Top 100 skills
Part of ClaudSkills — the open registry for Claude Code skills.  ·  What's New  ·  Install guide  ·  About  ·  llms.txt