---
title: "Repair, split, merge, and normalize PDFs with qpdf before downstream processing"
description: "Preprocess messy PDFs into a stable form before extraction, review, packaging, or delivery workflows depend on them."
verification: "listed"
source: "https://github.com/qpdf/qpdf"
author: "qpdf"
publisher_type: "organization"
category:
  - "Data Extraction & Transformation"
framework:
  - "Multi-Framework"
tool_ecosystem:
  github_repo: "qpdf/qpdf"
  github_stars: 1479
---

# Repair, split, merge, and normalize PDFs with qpdf before downstream processing

Preprocess messy PDFs into a stable form before extraction, review, packaging, or delivery workflows depend on them.

## Prerequisites

qpdf installation, source PDF files, writable output path, optional downstream extraction or delivery workflow

## Installation

Choose whichever fits your setup:

1. Copy this skill folder into your local skills directory.
2. Clone the repo and symlink or copy the skill into your agent workspace.
3. Add the repo as a git submodule if you manage shared skills centrally.
4. Install it through your internal provisioning or packaging workflow.
5. Download the folder directly from GitHub and place it in your skills collection.

Install command or upstream instructions:

```
Install qpdf from your package manager or the upstream build instructions, run the documented commands for repair, splitting, merging, or normalization on the target PDFs, then pass the cleaned outputs into the downstream workflow.
```

## Documentation

- https://qpdf.readthedocs.io/

## Source

- [Agent Skill Exchange](https://agentskillexchange.com/skills/repair-split-merge-and-normalize-pdfs-with-qpdf-before-downstream-processing/)
