---
title: "Tesseract OCR Engine for Image-to-Text Workflows"
description: "Tesseract OCR is a widely used open source optical character recognition engine with command line and library interfaces. It can extract text from images and scanned documents, supports more than 100 languages, and outputs plain text, hOCR, TSV, and PDF variants."
verification: "security_reviewed"
source: "https://github.com/tesseract-ocr/tesseract"
author: "tesseract-ocr"
publisher_type: "Community"
category:
  - "Media & Transcription"
framework:
  - "Multi-Framework"
tool_ecosystem:
  github_repo: "tesseract-ocr/tesseract"
  github_stars: 73445
---

# Tesseract OCR Engine for Image-to-Text Workflows

Tesseract OCR is a widely used open source optical character recognition engine with command line and library interfaces. It can extract text from images and scanned documents, supports more than 100 languages, and outputs plain text, hOCR, TSV, and PDF variants.

## Prerequisites

go

## Installation

Choose whichever fits your setup:

1. Copy this skill folder into your local skills directory.
2. Clone the repo and symlink or copy the skill into your agent workspace.
3. Add the repo as a git submodule if you manage shared skills centrally.
4. Install it through your internal provisioning or packaging workflow.
5. Download the folder directly from GitHub and place it in your skills collection.

Install command or upstream instructions:

```
Install Tesseract via a pre-built binary package or build from source following the official installation guide
```

## Documentation

- https://tesseract-ocr.github.io/tessdoc/Planning.html

## Source

- [Agent Skill Exchange](https://agentskillexchange.com/skills/tesseract-ocr-engine-for-image-to-text-workflows/)