Top 23 Python speech-synthesis Projects

TTS

1 244 43,441 8.1 Python

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Project mention: AI Twin — Voice Cloning with Text-to-Speech | dev.to | 2025-12-16

Coqui TTS - The amazing text-to-speech library that powers this project
Stream

getstream.io featured

Stream - Scalable APIs for Chat, Feeds, Moderation, & Video. Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.
NeMo

2 31 16,336 9.9 Python

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Project mention: FFmpeg 8.0 adds Whisper support | news.ycombinator.com | 2025-08-13

git clone https://github.com/NVIDIA/NeMo.git nemo
PaddleSpeech

3 6 12,430 8.4 Python

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
espnet

4 15 9,647 9.9 Python

End-to-End Speech Processing Toolkit
edge-tts

5 9 9,620 7.9 Python

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

Project mention: Show HN: Voice Cloning and Multilingual TTS in One Click (Windows) | news.ycombinator.com | 2025-01-26

There is a MIT license in the repo. In that sense it's open source.
It's using "Edge TTS", which I believe means use API keys stolen [1] from Microsoft Edge and hope Microsoft doesn't sue you, non jolly-roger flying internet users beware.
Can't speak to other models and their licenses, I stopped looking after I saw this since I don't feel the need to use this.
[1] https://github.com/rany2/edge-tts/blob/ac41fb85ab2b2b48fef8a...
Amphion

6 6 9,546 7.6 Python

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
so-vits-svc-fork

7 16 9,208 9.1 Python

so-vits-svc fork with realtime support, improved interface and more features.
InfluxDB

www.influxdata.com featured

InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
EmotiVoice

8 5 8,367 7.9 Python

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
vits

9 6 7,746 0.0 Python

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
StyleTTS2

10 7 6,033 7.7 Python

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
voice-pro

11 13 5,202 7.1 Python

Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.

Project mention: Show HN: Likes/day as fake profile → built my own dating app in 100 days | news.ycombinator.com | 2025-12-16
DiffSinger

12 1 4,678 2.1 Python

DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
speech-to-speech

13 3 4,251 8.7 Python

Speech To Speech: an effort for an open-sourced and modular GPT4-o
metavoice-src

14 5 4,191 7.8 Python

Foundational model for human-like, expressive TTS
TensorFlowTTS

15 6 3,982 0.0 Python

:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
abogen

16 2 3,964 9.6 Python

Generate audiobooks from EPUBs, PDFs and text with synchronized captions.

Project mention: Abogen – Generate audiobooks from EPUBs, PDFs and text | news.ycombinator.com | 2025-08-09

It's probably due to the unusual sound format, 24kHz PCM, and the fact that it was somehow forced into a WebM container, which only supports the Vorbis and Opus formats.
It looks like they created it using the "higher quality" ffmpeg command line, except for the "webm" final extension, producing the opposite of what's described as "an MP4 file that's compatible with more devices".
https://github.com/denizsafak/abogen/tree/main/demo#for-high...
RealtimeTTS

17 1 3,672 9.1 Python

Converts text to speech in realtime
VoxCPM

18 1 2,988 8.4 Python

VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning

Project mention: VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and Voice Cloning | news.ycombinator.com | 2025-12-05
tacotron

19 3 2,984 0.0 Python

A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)
lingvo

20 1 2,856 6.0 Python

Lingvo
Tacotron-2

21 1 2,315 0.0 Python

DeepMind's Tacotron-2 Tensorflow implementation
hifi-gan

22 5 2,243 0.0 Python

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
WaveRNN

23 5 2,167 0.0 Python

WaveRNN Vocoder + TTS
SaaSHub

www.saashub.com featured

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python speech-synthesis discussion

Python speech-synthesis related posts

Kitten TTS: 25MB CPU-Only, Open-Source Voice Model

19 projects | news.ycombinator.com | 5 Aug 2025
Show HN: Voice Cloning and Multilingual TTS in One Click (Windows)

2 projects | news.ycombinator.com | 26 Jan 2025
Edge TTS

4 projects | news.ycombinator.com | 22 Jan 2025
Show HN: Voice-Pro – AI Voice Cloning Magic: Transform Any Voice in 15 Seconds

10 projects | news.ycombinator.com | 27 Nov 2024
Play 3.0 mini – A lightweight, reliable, cost-efficient Multilingual TTS model

5 projects | news.ycombinator.com | 14 Oct 2024
Show HN: Offline audiobook from any format with one CLI command

7 projects | news.ycombinator.com | 6 Oct 2024
Ask HN: What is the state of OSS voice cloning?

6 projects | news.ycombinator.com | 30 Sep 2024
A note from our sponsor - SaaSHub
www.saashub.com | 22 Dec 2025

SaaSHub helps you find the best software and product alternatives Learn more →

Index

What are some of the best open-source speech-synthesis projects in Python? This list will help you:

#	Project	Stars
1	TTS	43,441
2	NeMo	16,336
3	PaddleSpeech	12,430
4	espnet	9,647
5	edge-tts	9,620
6	Amphion	9,546
7	so-vits-svc-fork	9,208
8	EmotiVoice	8,367
9	vits	7,746
10	StyleTTS2	6,033
11	voice-pro	5,202
12	DiffSinger	4,678
13	speech-to-speech	4,251
14	metavoice-src	4,191
15	TensorFlowTTS	3,982
16	abogen	3,964
17	RealtimeTTS	3,672
18	VoxCPM	2,988
19	tacotron	2,984
20	lingvo	2,856
21	Tacotron-2	2,315
22	hifi-gan	2,243
23	WaveRNN	2,167

Python speech-synthesis

Top 23 Python speech-synthesis Projects

Python speech-synthesis discussion

Python speech-synthesis related posts

Kitten TTS: 25MB CPU-Only, Open-Source Voice Model

Show HN: Voice Cloning and Multilingual TTS in One Click (Windows)

Edge TTS

Show HN: Voice-Pro – AI Voice Cloning Magic: Transform Any Voice in 15 Seconds

Play 3.0 mini – A lightweight, reliable, cost-efficient Multilingual TTS model

Show HN: Offline audiobook from any format with one CLI command

Ask HN: What is the state of OSS voice cloning?

Index

Did you know that Python is the 2nd most popular programming language based on number of references?

Did you know that Python is
the 2nd most popular programming language
based on number of references?