Top 12 Jupyter Notebook speech-recognition Projects

DeepLearningExamples

1 7 14,577 4.0 Jupyter Notebook

State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
InfluxDB

www.influxdata.com featured

InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
vosk-api

2 62 13,915 5.6 Jupyter Notebook

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

Project mention: Audio-to-Text Transcriber Automated via Termux | dev.to | 2025-10-01

pip install vosk soundfile git clone https://github.com/alphacep/vosk-api.git # download a small model into ~/models # convert audio to 16k mono wav: ffmpeg -i input.mp3 -ar 16000 -ac 1 input_16k.wav # run a small python script: python transcribe_vosk.py input_16k.wav > output.txt
silero-models

3 34 5,504 4.7 Jupyter Notebook

Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
whisper-diarization

4 10 5,271 4.6 Jupyter Notebook

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
whisper-jax

5 6 4,636 6.5 Jupyter Notebook

JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.
Speech-Backbones

6 1 595 0.0 Jupyter Notebook

This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
whisper-youtube

7 3 406 2.9 Jupyter Notebook

🔉 Youtube Videos Transcription with OpenAI's Whisper
Stream

getstream.io featured

Stream - Scalable APIs for Chat, Feeds, Moderation, & Video. Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.
soxan

8 1 267 10.0 Jupyter Notebook

Wav2Vec for speech recognition, classification, and audio classification
vid2cleantxt

9 1 215 2.0 Jupyter Notebook

Python API & command-line tool to easily transcribe speech-based video files into clean text
Multimodal

10 1 11 0.0 Jupyter Notebook

Listen. Write. Speak. Read. Think. (by kritiksoman)
Bangla-Spoken-Number-Recognition

11 1 4 0.0 Jupyter Notebook

recognizing spoken Bangla numbers using MFCCs and CNN.
kwest-whisper-analysis

12 1 4 6.7 Jupyter Notebook

Conducted a comprehensive technical analysis of the Whisper model on M-series hardware, publishing performance benchmarks and implementation guidance for developers.

Project mention: Whisper Speech Recognition on Mac M4: Performance Analysis and Benchmarks | dev.to | 2025-07-01

Repository: https://github.com/theinsyeds/theinsyeds-whisper-analysis

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Jupyter Notebook speech-recognition discussion

Jupyter Notebook speech-recognition related posts

Audio-to-Text Transcriber Automated via Termux

1 project | dev.to | 1 Oct 2025
Ask HN: Using Voice Cloning and TTS to Fine-Tune Diarizer

1 project | news.ycombinator.com | 11 Jun 2024
SignWave: Program to transcribe text, audio files into a sign language animation

2 projects | news.ycombinator.com | 27 May 2024
Infini-Gram: Scaling unbounded n-gram language models to a trillion tokens

4 projects | news.ycombinator.com | 5 May 2024
VOSK Offline Speech Recognition API

1 project | news.ycombinator.com | 13 Apr 2024
Weird A.I. Yankovic, a cursed deep dive into the world of voice cloning

4 projects | news.ycombinator.com | 2 Oct 2023
Working Vosk model?

1 project | /r/learnpython | 29 May 2023
A note from our sponsor - InfluxDB
www.influxdata.com | 23 Dec 2025

InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now. Learn more →

Index

What are some of the best open-source speech-recognition projects in Jupyter Notebook? This list will help you:

#	Project	Stars
1	DeepLearningExamples	14,577
2	vosk-api	13,915
3	silero-models	5,504
4	whisper-diarization	5,271
5	whisper-jax	4,636
6	Speech-Backbones	595
7	whisper-youtube	406
8	soxan	267
9	vid2cleantxt	215
10	Multimodal	11
11	Bangla-Spoken-Number-Recognition	4
12	kwest-whisper-analysis	4

Jupyter Notebook speech-recognition

Top 12 Jupyter Notebook speech-recognition Projects

Jupyter Notebook speech-recognition discussion

Jupyter Notebook speech-recognition related posts

Audio-to-Text Transcriber Automated via Termux

Ask HN: Using Voice Cloning and TTS to Fine-Tune Diarizer

SignWave: Program to transcribe text, audio files into a sign language animation

Infini-Gram: Scaling unbounded n-gram language models to a trillion tokens

VOSK Offline Speech Recognition API

Weird A.I. Yankovic, a cursed deep dive into the world of voice cloning

Working Vosk model?

Index

Did you know that Jupyter Notebook is the 13th most popular programming language based on number of references?

Did you know that Jupyter Notebook is
the 13th most popular programming language
based on number of references?