-
So, I went digging and found an open-source project called Real-Time Voice Cloning. Game changer. I was able to dive into the code, mess with the pronunciation, pitch, timing—you name it. I even made the assistant reply in pirate speak (long story, rum was involved... 🏴☠️).
-
Stream
Stream - Scalable APIs for Chat, Feeds, Moderation, & Video. Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.
-
TTS
:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts) (by mozilla)
Find a community-based project: Tools like Hugging Face or Mozilla TTS have super active communities that welcome newcomers and share tips constantly.
-
2. Coqui AI