--- name: carousel-builder description: "Orchestrator skill — turns a topic or research brief into an N-slide Instagram / LinkedIn / TikTok carousel with consistent visual style and ready-to-post captions. Wraps essay-write + viral-text (for content) + image-prompt --execute (for slides) + common style library (24 visual styles). Modes: --topic / --research; --style auto||--style-ref ; --slides 3-12; --platform instagram|linkedin|tiktok; --aspect portrait|square|story; --text-mode embedded|overlay|none; --execute; --resume. Outputs: ./generated/carousel//slide-{1..N}.png + captions.md + manifest.json. Use when the user says 'make a carousel about X', 'turn this research into a post', '8 slides on Y', 'carousel for LinkedIn'." license: MIT allowed-tools: - Read - Write - Edit - Bash - Grep - Glob --- End-to-end carousel generator. Input: topic OR research brief. Output: N image files with consistent visual style + per-slide caption + final post copy + manifest for --resume. This skill orchestrates four lower-level skills: 1. `essay-write` or `viral-text` → drafts the content 2. `image-prompt` style anchor + per-slide prompts 3. `common/runners` execute layer → batch generation via the chosen provider 4. `common/style-library/carousel/` → style anchor (24 bundled styles + user overrides) Use when the user wants a finished carousel, not just prompts. Without `--execute`, returns the 8 prompts + captions for manual paste; with `--execute`, generates and saves the actual PNG slides. This skill does NOT: - Compose the slides into a single tall image — Instagram / LinkedIn handle multi-image posts natively. - Add text overlays via a design tool — text either gets generated INSIDE the image (gpt-image-2 / Ideogram / Imagen) via `--text-mode embedded`, or is left to the user's editor (`--text-mode overlay`). - Generate animated carousels (those are reels — use `reel-builder`). - Post to platforms — output is files you upload via the platform's UI / API. ## ROLE Topic / research → split content into N slides → pick style + model → assemble 8 per-slide prompts (style anchor + slide content + composition hint) → batch execute via image provider (one provider for all slides for consistency) → write slides + captions + manifest → print final paths. ## PIPELINE 1. **Resolve input source**: - `--research `: read the brief. Use TL;DR as hero, Key facts as slide content, Suggested angles to inform tone. - `--topic ""`: invoke `essay-write` (long-form) or `viral-text` (hook-driven) to produce 200-400 word source content first. Choose based on `--platform`: - `instagram` / `tiktok` → `viral-text` - `linkedin` → `essay-write` 2. **Split into slides** — see `references/slide-roles.md` (preferred) or `references/slide-split.md` (legacy): - **9 supported roles** (v2.12.0+): `hook`, `point`, `framework`, `data`, `steps`, `comparison`, `quote`, `myth-vs-truth`, `cta`. Each role has its own composition template and info-density expectation (see `references/slide-roles.md`). - Default deck shapes: - 3 slides: hook → point → cta - 5 slides: hook → point → framework-OR-data → point → cta - 6 slides: hook → point → framework → data → quote → cta - 7 slides: hook → point → framework → data → quote → comparison → cta - 8 slides: hook → point → framework → data → comparison → quote → steps → cta - **Information discipline**: middle slides MUST be informative — use `framework` / `data` / `steps` / `comparison` / `quote` / `myth-vs-truth` roles to force real content density. A deck of all-`hook`/`point` slides is hollow and reads as "atmospheric image dump with captions". 3. **Resolve style** — see `references/style-resolution.md`: - `--style `: load from `common/style-library/carousel/.md`. Use the `Style anchor (carousel)` block; if `--text-mode embedded`, use the `Style anchor (text-in-image mode)` block instead. - `--style auto`: examine topic + tone → narrow candidates to 3-5 from library → pick first, log alternatives. - `--style-ref `: skip library; use the user's image as multi-ref. Requires a model that supports image-ref (Nano Banana Pro / Flux Kontext / Seedream / Ideogram ref-mode). - `--style auto` + `--style-ref `: BOTH — library style anchor TEXT + user reference IMAGE. Provider gets both. 4. **Pick model** — see `references/model-picker.md`: - `--model auto`: text-heavy slide AND `--text-mode embedded` → gpt-image-2 or Ideogram 3 Quality. Photo-realistic style → Flux 2 Pro / Imagen 4 Ultra. Illustration / 3D style → Nano Banana Pro / Flux 2 Pro. Multi-ref present → Nano Banana Pro. - `--model `: override. Validate that the model is registered + env var is set. - ONE model for all slides. Mixing models breaks consistency. 5. **Build per-slide prompts** — STRONGLY PREFER `common.runners.carousel_prompt_builder.build_slide_prompt()` over hand-rolling. The builder produces figma-rigor prompts that combine: (a) the style's text-in-image anchor, (b) per-role composition template from `references/slide-roles.md`, (c) static carousel elements (page indicator + swipe arrow OR end marker + slide marker), (d) anti-AI-tells closing modifiers, (e) universal rules from `common/style-library/carousel/_universal-rules.md`. Skill side only provides STRUCTURED CONTENT via the role-specific dataclasses (HookContent, FrameworkContent, DataContent, StepsContent, ComparisonContent, QuoteContent, MythTruthContent, PointContent, CtaContent). Each non-hook slide MUST carry real information (framework boxes, data points, steps, comparison columns, quote with attribution) — not just atmospheric "hook + sentence". Avoid the magazine-with-text-overlay failure mode. Legacy manual prompt assembly is supported for back-compat but produces weaker carousels. Legacy manual format (NOT recommended — use the builder): ```