Skip to content

Features

Mylo edited this page Jun 16, 2023 · 1 revision

Features (including planned)

  • πŸ”Š Text-to-audio
    • πŸ—£ Text-to-speech
      • 🐢 Bark
        • πŸ—£ Speech generation
        • 🧬 Voice cloning
        • 🀣 Disable stopping token option to let the AI decide how it wants to continue
    • 🎡 AudioLDM text-to-audio generation
    • 🎡 AudioCraft text-to-audio generation
  • πŸ”Š Audio-to-audio
    • 🐢 Bark audio-to-audio using a custom quantizer to deconstruct audio for bark input
    • 😎 RVC (retrieval based voice conversion)
      • 🧬 RVC training
      • 🐸 coqui-ai/TTS text-to-speech
  • 🎀 Automatic-speech-recognition
    • 🎀 Whisper speech recognition
Clone this wiki locally