#

wavtokenizer

Here are 2 public repositories matching this topic...

jishengpeng / WavChat

A Survey of Spoken Dialogue Models (60 pages)

streaming duplex speech moshi speech-representation encodec gpt-4o speech-language-model spoken-dialogue-models modal-alignment intreaction mini-omni llama-omni wavtokenizer

Updated Nov 28, 2024

mbzuai-oryx / LLMVoX

LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM

audio text-to-speech streaming transformers tts codec omni voice-assistant neural-speech-synthesis mbzuai llm multimodal-large-language-models mini-omni wavtokenizer audiollm llmvox

Updated May 16, 2025
Python

Improve this page

Add a description, image, and links to the wavtokenizer topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the wavtokenizer topic, visit your repo's landing page and select "manage topics."