You can not select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
51 lines
2.0 KiB
51 lines
2.0 KiB
|
|
dataset info refer to [VoxCeleb](https://www.robots.ox.ac.uk/~vgg/data/voxceleb/index.html#about)
|
|
|
|
sv0 - speaker verfication with softmax backend etc, all python code
|
|
more info refer to the sv0/readme.txt
|
|
|
|
sv1 - dependence on kaldi, speaker verfication with plda/sc backend,
|
|
more info refer to the sv1/readme.txt
|
|
|
|
|
|
## VoxCeleb2 preparation
|
|
|
|
VoxCeleb2 audio files are released in m4a format. All the VoxCeleb2 m4a audio files must be converted in wav files before feeding them in PaddleSpeech.
|
|
Please, follow these steps to prepare the dataset correctly:
|
|
|
|
1. Download Voxceleb2.
|
|
You can find download instructions here: http://www.robots.ox.ac.uk/~vgg/data/voxceleb/
|
|
|
|
2. Convert .m4a to wav
|
|
VoxCeleb2 stores files with the m4a audio format. To use them in PaddleSpeech, you have to convert all the m4a audio files into wav files.
|
|
|
|
``` shell
|
|
ffmpeg -y -i %s -ac 1 -vn -acodec pcm_s16le -ar 16000 %s
|
|
```
|
|
|
|
You can do the conversion using ffmpeg https://gist.github.com/seungwonpark/4f273739beef2691cd53b5c39629d830). This operation might take several hours and should be only once.
|
|
|
|
3. Put all the wav files in a folder called `wav`. You should have something like `voxceleb2/wav/id*/*.wav` (e.g, `voxceleb2/wav/id00012/21Uxsk56VDQ/00001.wav`)
|
|
|
|
|
|
## voxceleb dataset summary
|
|
|
|
|
|
|dataset | vox1 - dev | vox1 - test |vox2 - dev| vox2 - test|
|
|
|---------|-----------|------------|-----------|----------|
|
|
|spks | 1211 |40 | 5994 | 118|
|
|
|utts | 148642 | 4874 | 1092009 |36273|
|
|
| time(h) | 340.4 | 11.2 | 2360.2 |79.9 |
|
|
|
|
|
|
## trial summary
|
|
|
|
| trial | filename | nums | positive | negative |
|
|
|--------|-----------|--------|-------|------|
|
|
| VoxCeleb1 | veri_test.txt | 37720 | 18860 | 18860 |
|
|
| VoxCeleb1(cleaned) | veri_test2.txt | 37611 | 18802 | 18809 |
|
|
| VoxCeleb1-H | list_test_hard.txt | 552536 | 276270 | 276266 |
|
|
|VoxCeleb1-H(cleaned) |list_test_hard2.txt | 550894 | 275488 | 275406 |
|
|
|VoxCeleb1-E | list_test_all.txt | 581480 | 290743 | 290737 |
|
|
|VoxCeleb1-E(cleaned) | list_test_all2.txt |579818 |289921 |289897 |
|