Python Asr

Open-source Python projects categorized as Asr

Top 23 Python Asr Projects

  1. whisperX

    WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

    Project mention: Making AI Models Faster, Cheaper, and Greener — Here’s How | dev.to | 2025-11-03

    2.3X speed improvement over WhisperX and a 3X speed boost compared to HuggingFace Pipeline with FlashAttention 2 (Insanely Fast Whisper)

  2. Stream

    Stream - Scalable APIs for Chat, Feeds, Moderation, & Video. Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.

    Stream logo
  3. NeMo

    A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

    Project mention: FFmpeg 8.0 adds Whisper support | news.ycombinator.com | 2025-08-13

    git clone https://github.com/NVIDIA/NeMo.git nemo

  4. PaddleSpeech

    Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

  5. speechbrain

    A PyTorch-based Speech Toolkit

    Project mention: 5 must know open-source repositories to build cool AI apps | dev.to | 2025-10-29

    Star the Speech Brain repository ⭐

  6. SenseVoice

    Multilingual Voice Understanding Model

  7. youtube-transcript-api

    This is a python API which allows you to get the transcript/subtitles for a given YouTube video. It also works for automatically generated subtitles and it does not require an API key nor a headless browser, like other selenium based solutions do!

    Project mention: Show HN: YouTubeTldw: ad‑free, login‑free YouTube summaries in a flash | news.ycombinator.com | 2025-07-29

    - en ("English")

    If you are sure that the described cause is not responsible for this error and that a transcript should be retrievable, please create an issue at https://github.com/jdepoix/youtube-transcript-api/issues. Please add which version of youtube_transcript_api you are using and provide the information needed to replicate the error. Also make sure that there are no open issues which already describe your problem!

  8. wenet

    Production First and Production Ready End-to-End Speech Recognition Toolkit

    Project mention: CosyVoice 2025 Complete Guide: The Ultimate Multi-lingual Text-to-Speech Solution | dev.to | 2025-12-15

    WeNet - Speech Recognition Toolkit

  9. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
  10. whisper-asr-webservice

    OpenAI Whisper ASR Webservice API

  11. lingvo

    Lingvo

  12. whisper-standalone-win

    Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.

    Project mention: Show HN: Python Audio Transcription: Convert Speech to Text Locally | news.ycombinator.com | 2025-09-22

    I like this version of Whisper which has diarization built in: https://github.com/Purfview/whisper-standalone-win

  13. whisper-timestamped

    Multilingual Automatic Speech Recognition with word-level timestamps and confidence

  14. vosk-server

    WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries

  15. StreamSpeech

    StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.

  16. SincNet

    SincNet is a neural architecture for efficiently processing raw audio samples.

  17. pykaldi

    A Python wrapper for Kaldi

  18. whisper.api

    This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR model.

  19. CrisperWhisper

    Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection

  20. cheetah

    On-device streaming speech-to-text engine powered by deep learning (by Picovoice)

  21. pyannote-whisper

  22. voicemode

    VoiceMode MCP brings natural conversations to Claude Code

    Project mention: Gemini CLI | news.ycombinator.com | 2025-06-25
  23. leopard

    On-device speech-to-text engine powered by deep learning

  24. reverb

    Open source inference code for Rev's model

    Project mention: Video to Text AI: The [2025 Guide] to Unlocking Revenue from Content | dev.to | 2025-12-10

    1. Rev

  25. deepgram-python-sdk

    Official Python SDK for Deepgram.

    Project mention: How to Deploy a Voice AI Agent Using Railway for eCommerce Success | dev.to | 2025-12-20

    Deepgram STT – Real-time speech-to-text alternative

  26. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python Asr discussion

Python Asr related posts

  • Video to Text AI: The [2025 Guide] to Unlocking Revenue from Content

    1 project | dev.to | 10 Dec 2025
  • Making AI Models Faster, Cheaper, and Greener — Here’s How

    5 projects | dev.to | 3 Nov 2025
  • FFmpeg 8.0 adds Whisper support

    10 projects | news.ycombinator.com | 13 Aug 2025
  • Ask HN: What Speaker Diarization tools should I look into?

    1 project | news.ycombinator.com | 23 Jul 2025
  • Ask HN: What API or software are people using for transcription?

    10 projects | news.ycombinator.com | 9 Jun 2025
  • The Technology Behind YouTube’s Auto-Captioning System

    1 project | dev.to | 29 Apr 2025
  • Show HN: Mikey – No bot meeting notetaker for Windows

    6 projects | news.ycombinator.com | 12 Feb 2025
  • A note from our sponsor - Stream
    getstream.io | 22 Dec 2025
    Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure. Learn more →

Index

What are some of the best open-source Asr projects in Python? This list will help you:

# Project Stars
1 whisperX 19,239
2 NeMo 16,336
3 PaddleSpeech 12,449
4 speechbrain 10,956
5 SenseVoice 7,202
6 youtube-transcript-api 6,560
7 wenet 4,953
8 whisper-asr-webservice 3,070
9 lingvo 2,856
10 whisper-standalone-win 2,740
11 whisper-timestamped 2,700
12 vosk-server 1,214
13 StreamSpeech 1,213
14 SincNet 1,200
15 pykaldi 1,031
16 whisper.api 902
17 CrisperWhisper 880
18 cheetah 648
19 pyannote-whisper 647
20 voicemode 501
21 leopard 469
22 reverb 434
23 deepgram-python-sdk 375

Sponsored
Stream - Scalable APIs for Chat, Feeds, Moderation, & Video.
Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.
getstream.io

Did you know that Python is
the 2nd most popular programming language
based on number of references?