**Rule:** **Execution:** **Example:** **Edge cases:** (optional) ``` **Sections that are inherently lists** (ranked tactics, build wedges, roadmaps, scorecards): keep the natural list structure but apply the same terseness — bullet points carry the load, prose is minimised. No padding sentences, no "in practice", no "the key thing is". **Strategic, reflective, or observation-style sections** (Strategic Frames, Open Questions, "What's likely vs hype", "The shift to internalise"): do NOT force Rule / Execution / Example onto these — the scaffolding makes them feel mechanical. Use bulleted observations or short prose, each item one to two terse lines. Apply operator-first only where there's an actual action to take. A reader should be able to scan a section in 15 seconds and know: what to do, how to do it, what it looks like in the wild, what could go wrong. #### Visual formatting rules - **Do NOT use markdown tables.** They don't render reliably across all markdown viewers (Obsidian, Notion exports, some terminal renderers). Use one of: - Bulleted key/value lines: `- **Signal name** — value (notes)` - Definition-style: `**Term:** definition` - YAML or JSON code blocks for genuinely structured data - **Use ASCII diagrams when a concept benefits from visualisation.** Don't force them — but when content involves any of these, a diagram beats prose: - **Architectures / pipelines** — show the components and their flow - **Hierarchies** — three-tier systems, layers, taxonomies - **Sequences with branches** — request → check → branch → result - **Ranking visualisations** — when relative weights or sizes matter more than exact numbers - **State machines** — when something has clear states with transitions Example ASCII patterns the skill should use freely: Pipeline / flow: ``` Input ──► [ Stage 1 ] ──► [ Stage 2 ] ──► Output │ └──► (side effect / log) ``` Hierarchy / layers: ``` ┌─────────────────────────────────────────┐ │ Layer 3 — Analytics feedback │ ├─────────────────────────────────────────┤ │ Layer 2 — Agent pipeline │ ├─────────────────────────────────────────┤ │ Layer 1 — Research swipefile │ └─────────────────────────────────────────┘ ``` Ranking by magnitude: ``` Signal A (high-leverage) ████████████████████████ ~24× baseline Signal B ██████████████████ ~15× Signal C ███████████ ~10× Signal D ██ low ``` Tree (for file structures or org charts) — already covered by the standard `├── └──` tree pattern, use freely. Keep diagrams compact: ≤ 12 lines per diagram, ≤ 60 chars wide. If a diagram needs more, the concept is probably overloaded — split it. ### Step 7 — Two-job test (split proposal) Quality is enforced by the strict keep gate (Step 4) and the in-playbook dedup pass (Step 5), not by an arbitrary byte cap. After writing the playbook, the only structural question left is **whether the content covers more than one distinct operational job**. Apply the **two-job test**: > *Does the playbook cover more than one distinct operational job — such that a user pursuing one job would have to skim past content for the other?* Examples: - **Two-job folder** → Job A: how to *do* a thing (tactical playbook for an active practice). Job B: how to *automate* doing that thing (the system or pipeline that makes Job A repeatable). A user pursuing Job A won't read Job B; a user building Job B won't re-read Job A. **Two-job test passes → propose split.** - **Single-domain folder** → one job: build or operate one specific system. The user wants the full picture in one place — even at 20 KB. **Test fails → keep as one.** - **Sub-lane folder** → one job with internal sub-lanes (e.g., two product variants in the same category). The sub-lanes inform the same operator decision, so they belong together. **Test fails → keep as one.** If the two-job test **passes**: - Surface a split proposal in the chat: name the candidate playbooks, give a one-sentence boundary rule per playbook, ask the user to confirm. - Wait for user approval. Do **not** split silently. - If approved, write split files; if user prefers single, write a single playbook. If the two-job test **fails**: - Write a single playbook regardless of size. `playbook_size_bytes` is recorded in the manifest as informational, not as a gate. #### Single vs split playbook — output convention - **Single playbook (default)**: write `_distilled/playbook.md`. - **Split playbooks (only when user confirms the two-job test)**: write `_distilled/playbook-.md`, `_distilled/playbook-.md`, etc. Each filename uses a short kebab-case slug naming the operational job (e.g., `playbook-growth-tactics.md`, `playbook-content-os.md`). Do **not** create a `_distilled/playbook.md` file in this mode; the manifest's `playbooks` array is the index. - **Manifest schema in split mode**: see [`references/manifest-schema.md`](references/manifest-schema.md) for the split-mode shape. The point of the two-job test: size doesn't determine whether a split is right — *operational separability* does. A 25 KB single-job playbook is fine. A 12 KB two-job playbook is worth splitting. ### Step 8 — Move files (every file gets a home) After every run, the top level should hold only the four `_` folders. Every file gets moved into one of them based on its verdict: - **kept** → `mv` into `_sources/` (full canonical reference) - **partial** → see "partial routing" below - **archived** → `mv` into `_archive/` - **quarantined** → `mv` into `_quarantine/` #### Partial routing — second test A `partial` verdict means some sections were extracted into the playbook. The question is whether the *file as a whole* is still worth keeping accessible: - **`partial` → `_sources/`** only if what remains after extraction is **itself worth re-reading** — e.g., additional context, narrative, supporting examples, or unique framing not in the playbook. - **`partial` → `_archive/`** if the extracted sections were the only valuable content; the rest is consensus restatement or generic framing. The manifest still records the extracted sections in `sections_kept`, so traceability is preserved. This prevents `_sources/` from drifting into "any file we touched". `_sources/` should stay reserved for files the user might actually re-open. Create `_sources/`, `_archive/`, and `_quarantine/` if they don't exist. Never overwrite — if a same-name file already exists in the target, append `-{timestamp}` to the incoming file. **Migration**: any file at the top level that the manifest already classifies as kept (from a pre-iter-3 run) should be moved into `_sources/` without re-classifying. Files previously classified as partial should be re-evaluated against the partial routing rule above. **Manifest paths**: after step 8, every file entry's `path` should reflect its final location (`_sources/...`, `_archive/...`, or `_quarantine/...`). ### Step 9 — Write the manifest Update `_distilled/manifest.json`. Full schema in [`references/manifest-schema.md`](references/manifest-schema.md). ### Step 10 — Report to user Output the end-of-run summary (format below). ## Output structure ``` ~/Research/[topic]/ ├── _sources/ ← canonical kept files (was top-level pre-iter-3) │ ├── perplexity-2026-04-12.md │ └── chatgpt-deep-research-3.md ├── _distilled/ │ ├── playbook.md ← operator-first, no inline citations │ ├── manifest.json ← traceability + contribution types │ └── (on --rebuild: backup-{date}/) ← previous _distilled/ snapshot ├── _archive/ │ └── chatgpt-generic-overview.md └── _quarantine/ └── ambiguous-mixed-notes.md ``` The top level holds **only** the four `_` folders after a run. New files dropped at the top level get processed and routed on the next `/distil research` invocation. ## Incremental vs rebuild **Default: incremental.** Only process new and changed files. Merge into existing playbook with compression bias. Re-run the dedup pass (step 5) over the combined section set, not just the new sections. **`--rebuild`**: full re-do. Process every file as if new. Snapshot the current `_distilled/` to `_distilled/backup-{YYYY-MM-DD-HHMM}/` first. `_archive/` and `_quarantine/` are not reset — those represent user-curated state. **Auto-recommend rebuild** when incremental would restructure >30% of sections. Don't force it — surface as a recommendation. ## Housekeeping mode workflow Target: `~/Research/` (the parent). ### Step 1 — Inventory List all topic subfolders. For each, read `_distilled/playbook.md` and `_distilled/manifest.json` if present. If no `_distilled/` exists, work from raw files at the top level and (if it exists) `_sources/`. ### Step 2 — Cross-topic overlap detection For each pair of topic folders, compare playbook sections (or raw file sections if no playbook). Identify: - **Duplicate content** — same idea covered in two playbooks - **Misfiled content** — a section in topic A that belongs in topic B - **Adjacent topics that could merge** — two folders on the same domain ### Step 3 — Grade each topic with confidence Assign A–F per topic on: - Signal density (proportion of active content that's tier 1/2/3) - Playbook quality (compressed, operator-first, scannable) - Source health (ratio of active to archived files; internal duplication) Attach a confidence label to each grade: `high` / `medium` / `low`. A folder with rich playbooks lets you grade with high confidence; a folder with only raw files (no playbook yet) typically yields medium. ### Step 4 — Filename normalisation suggestions While scanning files, flag any whose filename is the literal Perplexity / ChatGPT prompt (e.g., `"I came accross this post . Please can you extract.md"`, `"What does this mean for X.md"`). For each, propose a structured filename using this pattern: ``` ---.md ``` Where `` is the LLM/tool that produced the export (`chatgpt`, `perplexity`, `xai`, `gemini`), or the author/origin if a specific person or post is the source. Examples: - `industry-trends-key-findings-perplexity-2026-05.md` - `tools-evaluation-comparison-matrix-chatgpt-2026-05.md` - `workflow-patterns-from-author-name-chatgpt-2026-05.md` **Propose only. Never auto-rename — even at high confidence.** Filename changes can break links, scripts, and manifests. The user decides which to apply and when. ### Step 5 — Action or propose moves **Auto-move only if ALL hold:** 1. Confidence ≥ 95% the file belongs in another topic 2. The destination topic clearly exists (subfolder present) 3. The file is not already referenced by an existing playbook For everything else, **list in the overlap report with a confidence label** (`high` / `medium` / `low`) and wait for confirmation. The default behaviour is "propose, don't action". Auto-moves should be exceedingly rare. Zero auto-moves is a fine outcome when everything is correctly placed. ### Step 6 — Write the overlap report `~/Research/_overlap-report.md` with sections: 1. **Topics inventoried** — bulleted list: topic, file count, brief note (no markdown tables) 2. **Grades** — letter grade + confidence label + one-line justification per topic (bullets, not tables) 3. **Cross-topic overlap detected** — distinguish real overlaps from filename-pattern matches with different content 4. **Auto-actioned moves** — count + list (typically zero) 5. **Proposed moves** — with confidence label per item 6. **Filename normalisation suggestions** — list of proposed renames; user reviews and applies 7. **Recommendations** — non-move next steps (e.g., "run topic-mode on folder X", "consider splitting folder Y") 8. **Low-confidence calls** — surfaced explicitly so the user can override ## End-of-run summary format After topic mode runs: ``` ✓ Distilled ~/Research/[topic] ([first-run | incremental | rebuild]) Changes - N files processed [if cluster mode used: "(X directly read, Y classified by pattern)"] - N files kept (X full, Y partial) - N files archived (one-line reasons) - N files quarantined - Playbook: N sections (size N KB) - [if two-job test passed and split was confirmed] Split: () + () - [if hallucination_warnings non-empty] Flagged N source(s) for potentially fabricated technical claims — see manifest hallucination_warnings File tree ~/Research/[topic]/ ├── ... Playbook: ~/Research/[topic]/_distilled/playbook.md Manifest: ~/Research/[topic]/_distilled/manifest.json Next best action: ``` After housekeeping mode runs, append the same one-line **Next best action** to the chat summary. Pick the action that delivers the most value next, based on the run's state: - `review quarantine` — when files were quarantined (need user judgement before they're archived or restored) - `split topic` — when playbook size triggered the split recommendation - `run housekeeping` — when this was a topic-mode run and the user hasn't done a cross-topic pass in a while - `accept rename suggestions` — when housekeeping produced high-confidence filename normalisation proposals - `ship` — when nothing else is pending: the playbook is ready to use Keep the rest of the summary terse. The detail lives in playbook and manifest. ## Safety rules - **Never delete files.** Move to `_archive/` or `_quarantine/` only. - **Never overwrite files in `_archive/` or `_quarantine/`.** Append `-{timestamp}` on collision. - **Never auto-rename files** based on normalisation suggestions. Propose only. - **Before large batch moves** (>10 files in a single run), confirm in chat. - **On rebuild, always snapshot** the prior `_distilled/` first. - **Read manifest before writing it.** Never blow away prior decisions silently. ## Triggering reminders When the user says any of these, trigger this skill: - `/distil research`, `distil research`, `distil my research` - "clean up my research folder", "organise my research" - "build a playbook from [folder]", "make a playbook out of these" - "what's in my research folder?", "any overlap in my research?" - Any reference to processing files in `~/Research/` If the user mentions distilling research informally during a longer conversation, surface this skill and confirm before running.