- Notifications
You must be signed in to change notification settings - Fork 2.9k
Closed
Description
Refactor the whole data preprocessor for DeepSpeech2 (e.g. re-design classes, re-organize dir, add augmentation interfaces etc.):
- Refactor the data preprocessor with newly added classes, e.g.
AudioSegment,SpeechSegment,TextFeaturizer,AudioFeaturizer,SpeechFeaturizeretc. - Add data augmentation interfaces and classes e.g.
AugmentorBase,AugmentationPipeline,VolumePerturbAugmentoretc., to make it easier to add more data augmentation models. - Separate normalizer's mean-std computing from
DataGenerator. AddFeatureNormalizer. - - Add an independent tool
compute_mean_std.pyfor users to create mean_std file before training. - Re-organize
datadirectory intodatasetsanddata_utils. - Test for convergence.
- Add module, class, function docs.
- Update README.md.
Metadata
Metadata
Assignees
Labels
No labels