` section from `$PLAYWRIGHT_DEBUG_REF`"**, do that lookup if `$PLAYWRIGHT_DEBUG_REF` is non-empty; skip the section augment if empty. ## Session Management Use a unique session name incorporating the worktree identifier to avoid collisions across concurrent worktrees: ```bash WORKTREE_ID="$(basename "$(git rev-parse --show-toplevel)")" SESSION_NAME="pw-debug-${WORKTREE_ID}" ``` All CLI commands below use `-s=$SESSION_NAME` for session persistence. Open the session once before Tier 2, and register a cleanup trap to ensure the session is closed even if the agent is interrupted: ```bash # Register cleanup trap BEFORE opening the session — ensures browser is closed # on normal exit, errors, SIGTERM, SIGINT, and SIGURG (Claude Code tool timeout) _pw_cleanup() { "$PW_CLI" close -s="$SESSION_NAME" 2>/dev/null || true; } trap _pw_cleanup EXIT TERM INT "$PW_CLI" open -s="$SESSION_NAME" ``` Close when done (the trap also handles this, but explicit close is preferred for clarity): ```bash "$PW_CLI" close -s="$SESSION_NAME" trap - EXIT TERM INT # Clear the trap after explicit close ``` ## When to Use - A UI element is missing, wrong, or broken on a deployed or local environment - A browser-visible feature is not behaving as expected - Playwright MCP tests are failing and the root cause is unclear - Debugging a rendering or routing issue before writing a fix ## When NOT to Use - You already know the root cause from a failing unit test — fix it directly - The bug is confirmed server-side (use logs or test output) - You need to validate a full end-to-end deployment workflow **PROJECT-SPECIFIC**: read `## When NOT to Use (Project-Specific)` from `$PLAYWRIGHT_DEBUG_REF` for additional exclusions. ## Visual Regression Gate (run before Tier 1) If visual regression baselines exist, run your project's visual regression test command first. This is a deterministic visual comparison against snapshots in ``. - **Pass**: No browser debugging needed — baselines confirm UI matches expectations. - **Fail**: The diff output identifies which pages/elements changed. Use the failed elements as your Tier 2 targets (skip Tier 1 hypothesis generation — the diff IS the hypothesis). Start at Tier 2 with a single `@playwright/cli run-code` call checking visibility, position, and computed styles of the flagged elements. - **No baselines**: Skip this gate and start at Tier 1 as normal. If called from `/dso:sprint` post-batch: on visual verification failure, the orchestrator reverts the task to open. Save screenshots to `.claude/screenshots/` (gitignored). --- ## The 3-Tier Process ``` Tier 1 (Code Analysis) → Generate hypotheses from source code, templates, CSS, routes → [hypothesis explains bug conclusively] → Fix directly, no browser needed → [hypothesis needs confirmation] → Tier 2 Tier 2 (Targeted Evidence) → @playwright/cli run-code (batched JS) + scoped snapshot → [evidence confirms hypothesis] → Fix directly → [evidence is inconclusive after ≤3 run-code calls] → Tier 3 Tier 3 (Full CLI Interaction) → goto, click, hover, screenshot via @playwright/cli → Fix based on observed behavior ``` **Never jump tiers.** Always complete Tier 1 before touching the browser. Always complete Tier 2 before using full CLI interaction. --- ## Tier 1: Code Analysis — Generate Hypotheses **Tools**: Read, Grep, Glob (no browser tools) **Goal**: Produce a ranked list of hypotheses about the bug's root cause, sourced entirely from reading code. ### Step 1: Identify the symptom's code path **PROJECT-SPECIFIC**: read `## Symptom-to-Code-Path Table` from `$PLAYWRIGHT_DEBUG_REF` for the project-specific symptom table. **Generic fallback (when no reference file configured):** Map the symptom to its source: | Symptom type | Where to look first | |---|---| | Element not visible | Template or view layer, CSS class, JS `display:none` toggle | | Wrong data displayed | Request handler → data layer → view variable | | Form submit fails | `