Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure. Learn more →
Top 23 Python speech-to-text Projects
-
-
Stream
Stream - Scalable APIs for Chat, Feeds, Moderation, & Video. Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.
-
2.3X speed improvement over WhisperX and a 3X speed boost compared to HuggingFace Pipeline with FlashAttention 2 (Insanely Fast Whisper)
-
-
Star the Speech Brain repository ⭐
-
RealtimeSTT
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.
-
SpeechRecognition
Speech recognition module for Python, supporting several engines and APIs, online and offline.
-
-
InfluxDB
InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
-
voice-pro
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.
Project mention: Show HN: Likes/day as fake profile → built my own dating app in 100 days | news.ycombinator.com | 2025-12-16 -
-
LLaMA-Omni
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
-
-
-
whisper-standalone-win
Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.
Project mention: Show HN: Python Audio Transcription: Convert Speech to Text Locally | news.ycombinator.com | 2025-09-22I like this version of Whisper which has diarization built in: https://github.com/Purfview/whisper-standalone-win
-
whisper-timestamped
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
-
-
-
airunner
Offline inference engine for art, real-time voice conversations, LLM powered chatbots and automated workflows
Project page here.
-
StreamSpeech
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
-
-
whisper-ctranslate2
Whisper command line client compatible with original OpenAI client based on CTranslate2.
-
-
-
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Python speech-to-text discussion
Python speech-to-text related posts
-
Video to Text AI: The [2025 Guide] to Unlocking Revenue from Content
-
Show HN: Transcribe Your Voice in Terminal Locally
-
Making AI Models Faster, Cheaper, and Greener — Here’s How
-
Show HN: Python Audio Transcription: Convert Speech to Text Locally
-
FFmpeg 8.0 adds Whisper support
-
Anthropic teams use Claude Code
-
Ask HN: What Speaker Diarization tools should I look into?
- A note from our sponsor - Stream getstream.io | 22 Dec 2025
Index
What are some of the best open-source speech-to-text projects in Python? This list will help you:
| # | Project | Stars |
|---|---|---|
| 1 | faster-whisper | 19,503 |
| 2 | whisperX | 19,239 |
| 3 | pyvideotrans | 15,550 |
| 4 | speechbrain | 10,956 |
| 5 | RealtimeSTT | 9,043 |
| 6 | SpeechRecognition | 8,913 |
| 7 | SenseVoice | 7,202 |
| 8 | voice-pro | 5,202 |
| 9 | speech-to-speech | 4,251 |
| 10 | LLaMA-Omni | 3,103 |
| 11 | whisper-asr-webservice | 3,070 |
| 12 | lingvo | 2,856 |
| 13 | whisper-standalone-win | 2,740 |
| 14 | whisper-timestamped | 2,700 |
| 15 | kalliope | 1,750 |
| 16 | Dragonfire | 1,398 |
| 17 | airunner | 1,274 |
| 18 | StreamSpeech | 1,213 |
| 19 | quillman | 1,182 |
| 20 | whisper-ctranslate2 | 1,170 |
| 21 | dc_tts | 1,160 |
| 22 | AI-Waifu-Vtuber | 984 |
| 23 | whisper-writer | 974 |