Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure. Learn more →
Top 23 Python Asr Projects
-
2.3X speed improvement over WhisperX and a 3X speed boost compared to HuggingFace Pipeline with FlashAttention 2 (Insanely Fast Whisper)
-
Stream
Stream - Scalable APIs for Chat, Feeds, Moderation, & Video. Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.
-
NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
git clone https://github.com/NVIDIA/NeMo.git nemo
-
PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
-
Star the Speech Brain repository ⭐
-
-
youtube-transcript-api
This is a python API which allows you to get the transcript/subtitles for a given YouTube video. It also works for automatically generated subtitles and it does not require an API key nor a headless browser, like other selenium based solutions do!
Project mention: Show HN: YouTubeTldw: ad‑free, login‑free YouTube summaries in a flash | news.ycombinator.com | 2025-07-29- en ("English")
If you are sure that the described cause is not responsible for this error and that a transcript should be retrievable, please create an issue at https://github.com/jdepoix/youtube-transcript-api/issues. Please add which version of youtube_transcript_api you are using and provide the information needed to replicate the error. Also make sure that there are no open issues which already describe your problem!
- Project mention: CosyVoice 2025 Complete Guide: The Ultimate Multi-lingual Text-to-Speech Solution | dev.to | 2025-12-15
WeNet - Speech Recognition Toolkit
-
InfluxDB
InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
-
-
-
whisper-standalone-win
Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.
Project mention: Show HN: Python Audio Transcription: Convert Speech to Text Locally | news.ycombinator.com | 2025-09-22I like this version of Whisper which has diarization built in: https://github.com/Purfview/whisper-standalone-win
-
whisper-timestamped
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
-
-
StreamSpeech
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
-
-
-
whisper.api
This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR model.
-
CrisperWhisper
Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection
-
-
-
- Project mention: Video to Text AI: The [2025 Guide] to Unlocking Revenue from Content | dev.to | 2025-12-10
1. Rev
- Project mention: How to Deploy a Voice AI Agent Using Railway for eCommerce Success | dev.to | 2025-12-20
Deepgram STT – Real-time speech-to-text alternative
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Python Asr discussion
Python Asr related posts
-
Video to Text AI: The [2025 Guide] to Unlocking Revenue from Content
-
Making AI Models Faster, Cheaper, and Greener — Here’s How
-
FFmpeg 8.0 adds Whisper support
-
Ask HN: What Speaker Diarization tools should I look into?
-
Ask HN: What API or software are people using for transcription?
-
The Technology Behind YouTube’s Auto-Captioning System
-
Show HN: Mikey – No bot meeting notetaker for Windows
- A note from our sponsor - Stream getstream.io | 22 Dec 2025
Index
What are some of the best open-source Asr projects in Python? This list will help you:
| # | Project | Stars |
|---|---|---|
| 1 | whisperX | 19,239 |
| 2 | NeMo | 16,336 |
| 3 | PaddleSpeech | 12,449 |
| 4 | speechbrain | 10,956 |
| 5 | SenseVoice | 7,202 |
| 6 | youtube-transcript-api | 6,560 |
| 7 | wenet | 4,953 |
| 8 | whisper-asr-webservice | 3,070 |
| 9 | lingvo | 2,856 |
| 10 | whisper-standalone-win | 2,740 |
| 11 | whisper-timestamped | 2,700 |
| 12 | vosk-server | 1,214 |
| 13 | StreamSpeech | 1,213 |
| 14 | SincNet | 1,200 |
| 15 | pykaldi | 1,031 |
| 16 | whisper.api | 902 |
| 17 | CrisperWhisper | 880 |
| 18 | cheetah | 648 |
| 19 | pyannote-whisper | 647 |
| 20 | voicemode | 501 |
| 21 | leopard | 469 |
| 22 | reverb | 434 |
| 23 | deepgram-python-sdk | 375 |