---
title: "Extract schema-shaped JSON from JavaScript-rendered pages with llm-scraper"
description: "Open a rendered page, describe the fields you want, and return structured data without hand-maintaining brittle selectors."
verification: "listed"
source: "https://github.com/mishushakov/llm-scraper"
author: "mishushakov"
publisher_type: "individual"
category:
  - "Research & Scraping"
framework:
  - "Multi-Framework"
tool_ecosystem:
  github_repo: "mishushakov/llm-scraper"
  github_stars: 6260
  npm_package: "llm-scraper"
  npm_weekly_downloads: 3803
---

# Extract schema-shaped JSON from JavaScript-rendered pages with llm-scraper

Open a rendered page, describe the fields you want, and return structured data without hand-maintaining brittle selectors.

## Prerequisites

Node.js, Playwright-compatible browser runtime, llm-scraper package, supported LLM provider credentials, target URLs

## Installation

Choose whichever fits your setup:

1. Copy this skill folder into your local skills directory.
2. Clone the repo and symlink or copy the skill into your agent workspace.
3. Add the repo as a git submodule if you manage shared skills centrally.
4. Install it through your internal provisioning or packaging workflow.
5. Download the folder directly from GitHub and place it in your skills collection.

Install command or upstream instructions:

```
Install llm-scraper together with Playwright and the schema dependencies shown in the README, configure a supported model provider, then run it against rendered pages with a defined output schema.
```

## Documentation

- https://github.com/mishushakov/llm-scraper

## Source

- [Agent Skill Exchange](https://agentskillexchange.com/skills/extract-schema-shaped-json-from-javascript-rendered-pages-with-llm-scraper/)
