Python speech-to-text

Open-source Python projects categorized as speech-to-text

Top 23 Python speech-to-text Projects

speech-to-text
  1. faster-whisper

    Faster Whisper transcription with CTranslate2

  2. Stream

    Stream - Scalable APIs for Chat, Feeds, Moderation, & Video. Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.

    Stream logo
  3. whisperX

    WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

    Project mention: Making AI Models Faster, Cheaper, and Greener — Here’s How | dev.to | 2025-11-03

    2.3X speed improvement over WhisperX and a 3X speed boost compared to HuggingFace Pipeline with FlashAttention 2 (Insanely Fast Whisper)

  4. pyvideotrans

    Translate the video from one language to another and embed dubbing & subtitles.

  5. speechbrain

    A PyTorch-based Speech Toolkit

    Project mention: 5 must know open-source repositories to build cool AI apps | dev.to | 2025-10-29

    Star the Speech Brain repository ⭐

  6. RealtimeSTT

    A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.

  7. SpeechRecognition

    Speech recognition module for Python, supporting several engines and APIs, online and offline.

  8. SenseVoice

    Multilingual Voice Understanding Model

  9. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
  10. voice-pro

    Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.

    Project mention: Show HN: Likes/day as fake profile → built my own dating app in 100 days | news.ycombinator.com | 2025-12-16
  11. speech-to-speech

    Speech To Speech: an effort for an open-sourced and modular GPT4-o

  12. LLaMA-Omni

    LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.

  13. whisper-asr-webservice

    OpenAI Whisper ASR Webservice API

  14. lingvo

    Lingvo

  15. whisper-standalone-win

    Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.

    Project mention: Show HN: Python Audio Transcription: Convert Speech to Text Locally | news.ycombinator.com | 2025-09-22

    I like this version of Whisper which has diarization built in: https://github.com/Purfview/whisper-standalone-win

  16. whisper-timestamped

    Multilingual Automatic Speech Recognition with word-level timestamps and confidence

  17. kalliope

    Kalliope is a framework that will help you to create your own personal assistant.

  18. Dragonfire

    the open-source virtual assistant for Ubuntu based Linux distributions

  19. airunner

    Offline inference engine for art, real-time voice conversations, LLM powered chatbots and automated workflows

    Project mention: Real-time, offline, voice conversations with custom chatbots | dev.to | 2025-05-16

    Project page here.

  20. StreamSpeech

    StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.

  21. quillman

    A voice chat app

  22. whisper-ctranslate2

    Whisper command line client compatible with original OpenAI client based on CTranslate2.

  23. dc_tts

    A TensorFlow Implementation of DC-TTS: yet another text-to-speech model

  24. AI-Waifu-Vtuber

    AI Vtuber for Streaming on Youtube/Twitch

  25. whisper-writer

    💬📝 A small dictation app using OpenAI's Whisper speech recognition model.

  26. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python speech-to-text discussion

Python speech-to-text related posts

  • Video to Text AI: The [2025 Guide] to Unlocking Revenue from Content

    1 project | dev.to | 10 Dec 2025
  • Show HN: Transcribe Your Voice in Terminal Locally

    1 project | news.ycombinator.com | 21 Nov 2025
  • Making AI Models Faster, Cheaper, and Greener — Here’s How

    5 projects | dev.to | 3 Nov 2025
  • Show HN: Python Audio Transcription: Convert Speech to Text Locally

    8 projects | news.ycombinator.com | 22 Sep 2025
  • FFmpeg 8.0 adds Whisper support

    10 projects | news.ycombinator.com | 13 Aug 2025
  • Anthropic teams use Claude Code

    4 projects | news.ycombinator.com | 24 Jul 2025
  • Ask HN: What Speaker Diarization tools should I look into?

    1 project | news.ycombinator.com | 23 Jul 2025
  • A note from our sponsor - Stream
    getstream.io | 22 Dec 2025
    Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure. Learn more →

Index

What are some of the best open-source speech-to-text projects in Python? This list will help you:

# Project Stars
1 faster-whisper 19,503
2 whisperX 19,239
3 pyvideotrans 15,550
4 speechbrain 10,956
5 RealtimeSTT 9,043
6 SpeechRecognition 8,913
7 SenseVoice 7,202
8 voice-pro 5,202
9 speech-to-speech 4,251
10 LLaMA-Omni 3,103
11 whisper-asr-webservice 3,070
12 lingvo 2,856
13 whisper-standalone-win 2,740
14 whisper-timestamped 2,700
15 kalliope 1,750
16 Dragonfire 1,398
17 airunner 1,274
18 StreamSpeech 1,213
19 quillman 1,182
20 whisper-ctranslate2 1,170
21 dc_tts 1,160
22 AI-Waifu-Vtuber 984
23 whisper-writer 974

Sponsored
Stream - Scalable APIs for Chat, Feeds, Moderation, & Video.
Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.
getstream.io

Did you know that Python is
the 2nd most popular programming language
based on number of references?