---
title: "RealtimeSTT Low-Latency Speech-to-Text Python Library"
description: "RealtimeSTT is a Python library for real-time speech-to-text with advanced voice activity detection, wake word activation, and instant transcription. It combines WebRTC VAD, Silero VAD, and Faster Whisper for production-grade voice input in agent applications."
verification: "security_reviewed"
source: "https://github.com/KoljaB/RealtimeSTT"
category:
  - "Media & Transcription"
framework:
  - "Custom Agents"
tool_ecosystem:
  github_repo: "koljab/realtimestt"
  github_stars: 9615
---

# RealtimeSTT Low-Latency Speech-to-Text Python Library

RealtimeSTT is a Python library for real-time speech-to-text with advanced voice activity detection, wake word activation, and instant transcription. It combines WebRTC VAD, Silero VAD, and Faster Whisper for production-grade voice input in agent applications.

## Installation

Choose whichever fits your setup:

1. Copy this skill folder into your local skills directory.
2. Clone the repo and symlink or copy the skill into your agent workspace.
3. Add the repo as a git submodule if you manage shared skills centrally.
4. Install it through your internal provisioning or packaging workflow.
5. Download the folder directly from GitHub and place it in your skills collection.

## Source

- [Agent Skill Exchange](https://agentskillexchange.com/skills/realtimestt-low-latency-speech-to-text-python/)