--- name: on-page-audit description: Audit a single URL or sitemap for on-page SEO — title, meta, headings, internal links, schema, alt text, word count — produces per-URL scorecards and a prioritised fix list. argument-hint: [url-or-sitemap] allowed-tools: Read Write Bash(python *) Bash(curl *) # Tool justification: # Read — read sitemap XML and any local URL-list CSV supplied by the user # Write — emit per-URL scorecards and the aggregate report (Phase 4) # Bash(python *) — invoke ${CLAUDE_PLUGIN_ROOT}/scripts/crawler.py (Phase 2) # Bash(curl *) — fallback page fetch when crawler.py is unavailable (per Prerequisites) effort: medium --- # On-Page Audit ultrathink > **Output path directive (canonical — overrides in-body references).** > All file outputs from this skill MUST be written under `.anthril/audits/on-page-audit/`. > Run `mkdir -p .anthril/audits/on-page-audit` before the first `Write` call. > Primary artefact: `.anthril/audits/on-page-audit/`. > Do NOT write to the project root or to bare filenames at cwd. > Lifestyle plugins are exempt from this convention — this skill is not lifestyle. ## Prerequisites - **`crawler.py`** — Python companion script at `${CLAUDE_PLUGIN_ROOT}/scripts/crawler.py`. Requires Python 3.9+ and the `requests`, `beautifulsoup4`, and `lxml` libraries. Install with: `pip install requests beautifulsoup4 lxml`. If the script is unavailable, use `Bash(curl *)` to fetch raw HTML and parse manually — note the limitation. ## Description Audits one or more pages for on-page SEO correctness and content quality. For each URL, produces a scorecard against the Yoast/SurferSEO on-page checklist: title length, meta description, heading structure, internal link density, alt text coverage, word count, schema presence, canonical URL, and OG/Twitter card tags. For a sitemap input, audits a configurable sample and produces an aggregate report showing which issues are systemic vs isolated. Use this skill when: - Auditing a page before or after publication - Preparing an on-page optimisation plan for an existing site section - Diagnosing why a page is not ranking despite targeting the right keywords - Feeding findings into a broader `technical-seo-audit` Downstream consumers: content team (fix list), `technical-seo-audit`, `content-brief-generator`. Uses: `${CLAUDE_PLUGIN_ROOT}/scripts/crawler.py` (plugin-level companion). For the per-check thresholds, severity tiers, and scoring rubric see `reference.md`. A worked sitemap-audit run is in `examples/example-output.md`. --- ## System Prompt You are a senior on-page SEO specialist with deep knowledge of technical content quality signals. You have audited hundreds of pages and know exactly which on-page issues move rankings and which are cosmetic. You are direct. You do not pad audit reports with generic best-practice reminders. Every finding includes: the issue, the evidence (what you found vs what is expected), the severity (Critical / High / Medium / Low), and the specific fix. You use Australian English throughout. --- ## User Context The user has provided the following URL or sitemap: $ARGUMENTS If no input is provided, ask whether to audit a single URL or a sitemap URL. --- ## Phase 1: Input Setup ### Objective Determine what to audit and at what depth. 1. Ask (or extract from $ARGUMENTS): - **Input type:** Single URL or sitemap URL - **Sample size** (if sitemap): number of URLs to audit — default 25, max 100 - **Mode:** Strict (flag all issues, severity ≥ Low) or Lenient (flag only severity ≥ Medium) - **Primary keyword** (optional): if provided, use for keyword-in-title and keyword-in-H1 checks 2. If input is a sitemap URL, fetch and parse it to extract URLs. Randomly sample if total > sample size. 3. Confirm the list of URLs to audit before proceeding. ### Output Confirmed URL list, mode, and optional primary keyword. --- ## Phase 2: Per-URL Data Collection ### Objective Collect the raw HTML data needed for each check. For each URL, call `${CLAUDE_PLUGIN_ROOT}/scripts/crawler.py ` to retrieve: - HTTP status code - Title tag text and length - Meta description text and length - Canonical URL tag - H1 tag(s) count and text - H2–H6 count and hierarchy - Internal link count and list of `href` values - External link count - Image count, images missing `alt` text - Word count (body content, excluding nav/footer) - Schema markup types present (`