Name	Name	Last commit message	Last commit date
Latest commit History 19 Commits
configs	configs
data	data
models	models
README.md	README.md
preprocess.py	preprocess.py
reconstruct.py	reconstruct.py
train.py	train.py

Name

Last commit message

Last commit date

configs

Universal Vocoder

This is a restructured and rewritten version of bshall/UniversalVocoding. The main difference here is that the model is turned into a TorchScript module during training and can be loaded for inferencing anywhere without Python dependencies.

Preprocess training data

Multiple directories containing audio files can be processed at the same time.

python preprocess.py VCTK-Corpus LibriTTS/train-clean-100 preprocessed

Train from scratch

python train.py preprocessed

Generate waveforms

You can load a trained model anywhere and generate multiple waveforms parallelly.

import torch vocoder = torch.jit.load("vocoder.pt") mels = [ torch.randn(100, 80), torch.randn(200, 80), torch.randn(300, 80), ] with torch.no_grad(): wavs = vocoder.generate(mels)

Emperically, if you're using the default architecture, you can generate 100 samples at the same time on an Nvidia GTX 1080 Ti.

References

Towards achieving robust universal neural vocoding

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Universal Vocoder

Preprocess training data

Train from scratch

Generate waveforms

References

About

Uh oh!

Releases 1

Packages

Languages

yistLin/universal-vocoder

Folders and files

Latest commit

History

Repository files navigation

Universal Vocoder

Preprocess training data

Train from scratch

Generate waveforms

References

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages