feat: Real-time French-to-English translator for Apple Silicon #136

woziii · 2024-11-06T15:13:32Z

Replace STT with LightningWhisperMLX Medium for Apple Silicon
Switch LLM to Llama-3.2-3B-Instruct-8bit MLX format
Update TTS to Melo TTS
Keep Silero VAD for voice detection

Performance:

Optimize latency (~4s end-to-end)
Focus on French-to-English translation
Add video call platforms support (Teams, Zoom, FaceTime)
Test & validate on M2 chip with 22GB RAM

Changes:

Modify system prompt for translation tasks
Remove CUDA components
Streamline pipeline for Apple Silicon
Add real-time processing optimizations

Tested on MacBook Air M2, compatible with major video call platforms except Google Meet.

Based on original speech-to-speech project, inspired by Andrés Marafioti's work.

- Replace STT with LightningWhisperMLX Medium for Apple Silicon - Switch LLM to Llama-3.2-3B-Instruct-8bit MLX format - Update TTS to Melo TTS - Keep Silero VAD for voice detection Performance: - Optimize latency (~4s end-to-end) - Focus on French-to-English translation - Add video call platforms support (Teams, Zoom, FaceTime) - Test & validate on M2 chip with 22GB RAM Changes: - Modify system prompt for translation tasks - Remove CUDA components - Streamline pipeline for Apple Silicon - Add real-time processing optimizations Tested on MacBook Air M2, compatible with major video call platforms except Google Meet. Based on original speech-to-speech project, inspired by Andrés Marafioti's work.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Real-time French-to-English translator for Apple Silicon #136

feat: Real-time French-to-English translator for Apple Silicon #136

Uh oh!

woziii commented Nov 6, 2024

Labels

2 participants

feat: Real-time French-to-English translator for Apple Silicon #136

Are you sure you want to change the base?

feat: Real-time French-to-English translator for Apple Silicon #136

Uh oh!

Conversation

woziii commented Nov 6, 2024

Labels

2 participants