--- name: deduplicate-security-issue description: | Merge two tracking issues that describe the same root-cause vulnerability (typically discovered independently by two reporters, arriving via different channels), preserving every reporter's credit, every mailing-list thread reference, and every independent attack-vector description. Updates the kept issue's body in place, closes the duplicate with the `duplicate` label, and regenerates the CVE JSON attachment so both finders land in `credits[]`. when_to_use: | Invoke when a security team member says "dedupe #NNN and #MMM", "merge #MMM into #NNN", "#MMM is a duplicate of #NNN", or when the import-security-issue skill's Step 2a surfaces a STRONG match (GHSA ID collision) between a new report and an existing tracker. Also appropriate as a periodic cleanup action when a triager spots two open trackers describing the same bug from different angles. --- # deduplicate-security-issue Merges two `` tracking issues that describe the same underlying vulnerability. The output is a single tracker ("the **kept** issue") that carries every reporter's credit, every mailing-list thread, and every independent report's body, with the other tracker ("the **dropped** issue") closed and labelled `duplicate`. This is **one of the few places in the security workflow** where a piece of reporter-supplied content (the dropped issue's body) moves from one tracker to another. Since the target tracker is private to ``, no confidentiality boundary is crossed, but the skill must still preserve every reporter's credit verbatim and surface the merge in a status comment on both trackers so the audit trail stays complete. **Golden rule — propose before applying.** Every merge is a proposal: the skill computes the merged body, the two status comments, the label/close-issue actions, and the CVE-JSON regen command, and shows all of them to the user. Nothing is applied until the user confirms. There is no fast-path. **Golden rule — never merge across scopes.** Two trackers with different **scope labels** (`airflow` vs. `providers`, `airflow` vs. `chart`, etc.) must not be merged. If an external reporter rediscovers the same bug in two different products' surfaces, that is a multi-scope report and the resolution is a **scope split** handled by the `sync-security-issue` skill, not a dedupe. This skill refuses to operate when the two candidate trackers have different scope labels, and the proposal says so explicitly. --- ## Inputs | Selector | Resolves to | |---|---| | `dedupe # ` | merge the `` tracker into ``; `` stays open, `` closes as duplicate | | `dedupe ` | same, without the `#` | | `dedupe #NNN` (single argument) | ambiguous — ask the user which one is kept; do not guess | Picking which is kept vs. dropped is a user decision; the skill does **not** auto-pick. Practical guidance to offer when asked: - If one tracker has a **CVE allocated** and the other does not, keep the one with the CVE (preserves the allocation). - If one tracker is older, keep the older one (preserves the audit-trail timestamp). - If one tracker has richer body content (more attack vectors, CVSS scoring, PoC code), merge *into* the one with the CVE but keep all the rich content via the "Second independent report" section described in Step 3 below. --- ## Prerequisites - **`gh` CLI authenticated** with collaborator access to `` — the skill reads both trackers, edits the kept tracker's body, closes the dropped tracker, and adds / removes labels. - **`uv` installed** — the Step 5 CVE-JSON regeneration is a `uv run` call. See [Prerequisites for running the agent skills](../../../README.md#prerequisites-for-running-the-agent-skills) in `README.md`. --- ## Step 0 — Pre-flight check 1. `gh api repos/ --jq .name` returns ``. 2. Both issue numbers resolve — `gh issue view --repo --json number` and the same for `` — before any write. 3. `uv --version` returns. If any check fails, stop. A partial dedup (body merged but dropped tracker left open, or CVE JSON not regenerated) is worse than no dedup. --- ## Step 1 — Fetch and classify both trackers ```bash gh issue view --repo --json number,title,state,body,labels,milestone,assignees,author,comments gh issue view --repo --json number,title,state,body,labels,milestone,assignees,author,comments ``` Verify: - Both trackers are in state `open` (merging into or out of a closed tracker is almost always a mistake; surface as a blocker if either side is already closed and ask the user to confirm). - Both have the **same scope label** — `airflow` vs. `airflow`, or `providers` vs. `providers`, or `chart` vs. `chart`. If the scope labels differ, refuse the merge and tell the user this is a multi-scope report to be handled by `sync-security-issue`'s scope-split flow instead. - Neither tracker is already labelled `duplicate` (that would indicate a partial-merge already happened and someone left it half-done; surface as a blocker and let the user decide how to recover). --- ## Step 2 — Extract the per-field values from both For each tracker, extract the template fields: - *The issue description* — typically the reporter's full message. In older trackers the field may not have an explicit heading (everything above *"Short public summary for publish"* is the description by convention). - *Short public summary for publish* - *Affected versions* - *Security mailing list thread* - *Public advisory URL* - *Reporter credited as* - *PR with the fix* - *CWE* - *Severity* - *CVE tool link* Also capture: - Each tracker's **labels** (scope, `cve allocated`, `pr *`, `announced - emails sent`, etc.). - Each tracker's **milestone** (Airflow version / Providers wave / Chart version). - Each tracker's **assignees**. - Whether each tracker has a **CVE JSON attachment** comment (from `generate-cve-json --attach`) — only the kept side's attachment will be regenerated in Step 5. --- ## Step 3 — Build the merged body proposal The output is a single body that preserves both reporters' content verbatim. The body-field schema (role names, empty-field convention, body-field-surgery pattern) is documented in [`tools/github/issue-template.md`](../../../tools/github/issue-template.md); the concrete field names for the adopting project live in [`/project.md`](../../..//project.md#issue-template-fields). Structure: ```markdown ### The issue description --- **Second independent report: [#](https://github.com//issues/) — merged on .**

Full report from (click to expand)

### Short public summary for publish ### Affected versions ### Security mailing list thread (): (): ``` (one line per reporter; keep them in chronological order of the original report, earliest first) ```markdown ### Public advisory URL ### Reporter credited as ``` (one line per credit; preserve the *exact* form each reporter confirmed, or the placeholder form when unconfirmed; the merge does not silently re-synthesize credits) ```markdown ### PR with the fix ### CWE ### Severity ### CVE tool link ``` The **Second independent report** block is the load-bearing part of the merge. It lets every future triager read both reports in one place without having to chase the closed duplicate's content. Append the drop side's body **verbatim** inside the `

` disclosure — preserve the reporter's wording, code blocks, and PoC text. Do not paraphrase; paraphrasing a security report is how credits get subtly wrong before publication. The short headline that stays visible at the top of the `

` block is a one-sentence summary for scroll-readers; clicking expands to the full verbatim report. This is the same short-headline-over-collapsed-details pattern the status-change comments use, applied to the body so a long secondary report does not push every other body field below the fold. If the drop-side body already had a *"Second independent report"* `

` block (chain-merge case — rare), nest its content inside the new outer block (or append as a sibling sub-block) so the chain of merges stays visible. Never flatten or rewrite earlier merges. --- ## Step 4 — Build the rollup-entry proposals Two rollup-comment entries, one per tracker — **not** two new top-level comments. The entries are appended to each tracker's existing status-rollup comment (created by `import-security-issue`) via the upsert recipe in [`tools/github/status-rollup.md`](../../../tools/github/status-rollup.md#upsert-recipe--append-to-an-existing-rollup-or-create-one). When either tracker does not yet carry a rollup (legacy tracker pre-dating the convention), the upsert recipe's Step 2b creates one and folds any pre-existing legacy bot comments in on the way. Each entry is a single `

` block. Follow the zero-whitespace rules from the shared spec — no leading spaces inside the block, one blank line after `

…

`, one blank line before `

`. ### Entry appended to the kept tracker's rollup ```markdown

· @ · Merge (kept) (from #)

**Merged [#](https://github.com//issues/) into this tracker.** - Body: 's original report preserved; 's report appended as *"Second independent report"*. - Credits: **** + ****. - Mailing threads: both listed. - CVE: [-](https://cveprocess.apache.org/cve5/-) stays allocated here; [#](...) being closed as duplicate. **Next:** . Full analysis of why the two reports are the same root-cause bug (same function, same file, same allowlist fix) but describe different attack vectors / affected processes / threat-model boundaries. Per-field hand-off details: - *Reporter credited as*: . - *Security mailing list thread*: . - *Short public summary for publish*: . - *CWE*: | kept as _No response_ | BLOCKER: conflict between and — triager to resolve>. - *Affected versions*: widened to . - CVE JSON attachment regenerated: .

``` ### Entry appended to the dropped tracker's rollup ```markdown

· @ · Merge (dropped) (into #)

**Closing as duplicate of [#](https://github.com//issues/).** Full content merged into [#](...) as *"Second independent report"*; credited alongside there. All triage and advisory work continues on [#](...). . Specific artifacts merged: . See [the merge entry on #](…) for the full hand-off record.

``` Both entries must render every cross-issue reference as a clickable markdown link per the *Linking `` issues and PRs* convention in [`AGENTS.md`](../../../AGENTS.md). No six-line visible cap — the entire entry is already collapsed inside `

`; write what the auditor needs. Do not pad. --- ## Step 5 — Confirm with the user, then apply sequentially Present the proposal: - Numbered items for the body update, each status comment, the `duplicate` label application on the dropped side, the close-issue action on the dropped side, and the CVE-JSON regen on the kept side. - The resulting merged body rendered in full (not a diff), so the user can proofread end to end before confirming. Confirmation forms: - `all` — apply every proposed action. - `1,3,5` — apply selected items only (for example, *"apply body update and status comment but don't close the duplicate yet — I want to triple-check"*). - `none` / `cancel` — bail. - Free-form edits — regenerate only the specified item and re-confirm. After confirmation, apply **sequentially** (never in parallel): 1. `gh issue edit --body-file ` — updated body 2. Rollup-comment upsert on the kept tracker per [`tools/github/status-rollup.md`](../../../tools/github/status-rollup.md#upsert-recipe--append-to-an-existing-rollup-or-create-one) — append the `Merge (kept)` entry (`gh api -X PATCH repos//issues/comments/ --input …`) or create the rollup if none exists yet. The same step folds any legacy bot comments on the kept tracker into the rollup first, per the fold-legacy sub-step in [`sync-security-issue`](../sync-security-issue/SKILL.md). 3. Rollup-comment upsert on the dropped tracker — append the `Merge (dropped)` entry (same recipe; fold legacy comments first when needed). 4. `gh issue edit --repo --add-label duplicate` 5. `gh issue close --repo --reason "not planned"` (GitHub's `duplicate` close-reason is not exposed by `gh` on all versions; `not planned` combined with the `duplicate` label carries the same signal) 6. `uv run --project /tools/vulnogram/generate-cve-json generate-cve-json --attach` — the *Remediation developer* body field is the source of truth for remediation-developer credits (populated by the `sync-security-issue` skill from the linked PR's author); no CLI flag needed 7. For each legacy bot comment folded in steps 2 / 3, delete the original with `gh api -X DELETE repos//issues/comments/` — only after the matching rollup PATCH succeeded. If any step fails, stop and ask the user how to proceed — do not guess. Partial merges are recoverable as long as the body update (step 1) succeeded; the rest is bookkeeping on top. --- ## Step 6 — Recap After the apply loop, print a short recap: - The kept tracker as a clickable [`#`](...) link with a short summary of its new state (label set, credit list, both threads). - The dropped tracker as a clickable link with its new closed state. - The regenerated CVE JSON attachment URL. - Any blockers surfaced during the merge (CWE conflict, unconfirmed credits, stale drafts, etc.) repeated here so the user does not have to scroll. Apply the `` link-form self-check to the entire recap before presenting. --- ## Hard rules - **Never merge across scopes.** Different scope labels → scope split (via `sync-security-issue`), not dedupe. - **Never re-synthesize credits.** Copy each reporter's credit line verbatim from their tracker. - **Never propagate a reporter-supplied CVSS** from the dropped tracker into the kept tracker's `Severity` field. The independent-scoring rule in [`AGENTS.md`](../../../AGENTS.md) applies to merged content. - **Never paraphrase a reporter's body.** Paraphrasing is how credits and vulnerability details go subtly wrong before publication; append verbatim under the *Second independent report* heading. - **Never close the wrong side.** The kept issue stays open; the dropped issue closes. Before running the `close` command, re-check the mapping one last time. - **Never delete the dropped tracker.** GitHub issues are effectively immutable audit trail; closing + labelling as `duplicate` is the right ending state. --- ## When dedupe is **not** appropriate - The two trackers are in **different scopes** → use the scope-split flow in `sync-security-issue` instead. - The two trackers describe the same code surface but **different bugs** with **different fixes** (for example, two separate allowlist gaps in the same file, each requiring its own advisory) → leave them as separate trackers and cross-link in comments, but do not merge. - One tracker has already moved past Step 13 (advisory sent) — the advisory has already gone out citing one reporter; retroactively merging a second reporter into the sent advisory requires an errata announcement via the missing-credits follow-up (Step 16 of the handling process), not a tracker-body merge. --- ## References - [`README.md`](../../../README.md) — the handling process; duplicates are resolved here at various steps rather than at a single numbered step. - [`import-security-issue`](../import-security-issue/SKILL.md) — Step 2a surfaces potential duplicates before a new tracker is even created, so in the ideal case this skill is never needed on a fresh import. - [`sync-security-issue`](../sync-security-issue/SKILL.md) — runs on the kept tracker after the merge to reconcile labels / milestone / credit-preference drafts for both reporters. - [`generate-cve-json`](../../../tools/vulnogram/generate-cve-json/SKILL.md) — regenerates the kept tracker's CVE JSON attachment so both finders land in `credits[]`.