Skip to content

Conversation

@david-ryan-snyder
Copy link
Contributor

@david-ryan-snyder david-ryan-snyder commented May 15, 2018

The PR adds an i-vector and an x-vector recipe for Speakers in the Wild (SITW) (http://www.speech.sri.com/projects/sitw/). The results using x-vectors are currently state-of-the-art, as far as I know.

The recipe is trained on VoxCeleb1 and VoxCeleb2 (http://www.robots.ox.ac.uk/~vgg/data/voxceleb/). Sixty speakers in VoxCeleb1 overlap with SITW. We remove those from VoxCeleb1 prior to training.

FYI @entn-at, @danpovey, @leibny

@david-ryan-snyder david-ryan-snyder changed the title [WIP] [egs] Add recipe for Speakers in the Wild (SITW) [egs] Add recipes for Speakers in the Wild (SITW) May 23, 2018
@david-ryan-snyder
Copy link
Contributor Author

@danpovey, when you get a chance, I think this is OK to merge. The v1 (i-vector) and v2 (x-vector) recipes are similar to the existing speaker recognition recipes.

@danpovey danpovey merged commit 447e964 into kaldi-asr:master May 24, 2018
dpriver pushed a commit to dpriver/kaldi that referenced this pull request Sep 13, 2018
Skaiste pushed a commit to Skaiste/idlak that referenced this pull request Sep 26, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

2 participants