PaddleSpeech/examples/voxceleb/README.md


dataset info refer to [VoxCeleb](https://www.robots.ox.ac.uk/~vgg/data/voxceleb/index.html#about)

sv0 - speaker verfication with softmax backend etc, all python code
      more info refer to the sv0/readme.txt

sv1 - dependence on kaldi, speaker verfication with plda/sc backend, 
      more info refer to the sv1/readme.txt


## VoxCeleb2 preparation

VoxCeleb2 audio files are released in m4a format. All the VoxCeleb2 m4a audio files must be converted in wav files before feeding them in PaddleSpeech. 
Please, follow these steps to prepare the dataset correctly:

1. Download Voxceleb2.
You can find download instructions here: http://www.robots.ox.ac.uk/~vgg/data/voxceleb/

2. Convert .m4a to wav
VoxCeleb2 stores files with the m4a audio format. To use them in PaddleSpeech,  you have to convert all the m4a audio files into wav files.

``` shell
ffmpeg -y -i %s -ac 1 -vn -acodec pcm_s16le -ar 16000 %s
```

You can do the conversion using ffmpeg  https://gist.github.com/seungwonpark/4f273739beef2691cd53b5c39629d830). This operation might take several hours and should be only once.

3. Put all the wav files in a folder called `wav`. You should have something like `voxceleb2/wav/id*/*.wav` (e.g, `voxceleb2/wav/id00012/21Uxsk56VDQ/00001.wav`)


## voxceleb dataset summary


|dataset | vox1 - dev | vox1 - test |vox2 - dev| vox2 - test|
|---------|-----------|------------|-----------|----------|
|spks    |  1211       |40     |      5994        | 118|
|utts     | 148642    | 4874   | 1092009     |36273|
| time(h) | 340.4 | 11.2  | 2360.2  |79.9 |


## trial summary

| trial     | filename |  nums | positive | negative |
|--------|-----------|--------|-------|------|
| VoxCeleb1 | veri_test.txt | 37720 | 18860 | 18860 | 
| VoxCeleb1(cleaned) | veri_test2.txt | 37611 | 18802 | 18809 |
| VoxCeleb1-H | list_test_hard.txt | 552536 | 276270 | 276266 |
|VoxCeleb1-H(cleaned) |list_test_hard2.txt | 550894 | 275488 | 275406 |
|VoxCeleb1-E | list_test_all.txt | 581480 | 290743 | 290737 | 
|VoxCeleb1-E(cleaned) | list_test_all2.txt |579818 |289921 |289897 |
convert voxceleb trial to kaldi format trial 3 years ago
			`dataset info refer to [VoxCeleb](https://www.robots.ox.ac.uk/~vgg/data/voxceleb/index.html#about)`

			`sv0 - speaker verfication with softmax backend etc, all python code`
			`more info refer to the sv0/readme.txt`

			`sv1 - dependence on kaldi, speaker verfication with plda/sc backend,`
			`more info refer to the sv1/readme.txt`
add state 0 to prepare the voxcele data and augment data 3 years ago

			`## VoxCeleb2 preparation`

			`VoxCeleb2 audio files are released in m4a format. All the VoxCeleb2 m4a audio files must be converted in wav files before feeding them in PaddleSpeech.`
			`Please, follow these steps to prepare the dataset correctly:`

			`1. Download Voxceleb2.`
			`You can find download instructions here: http://www.robots.ox.ac.uk/~vgg/data/voxceleb/`

			`2. Convert .m4a to wav`
			`VoxCeleb2 stores files with the m4a audio format. To use them in PaddleSpeech, you have to convert all the m4a audio files into wav files.`

			``` shell
			`ffmpeg -y -i %s -ac 1 -vn -acodec pcm_s16le -ar 16000 %s`
			```

			`You can do the conversion using ffmpeg https://gist.github.com/seungwonpark/4f273739beef2691cd53b5c39629d830). This operation might take several hours and should be only once.`

			3. Put all the wav files in a folder called `wav`. You should have something like `voxceleb2/wav/id/.wav` (e.g, `voxceleb2/wav/id00012/21Uxsk56VDQ/00001.wav`)
add voxceleb dataset and trial info, test=doc 3 years ago

			`## voxceleb dataset summary`


			`\|dataset \| vox1 - dev \| vox1 - test \|vox2 - dev\| vox2 - test\|`
			`\|---------\|-----------\|------------\|-----------\|----------\|`
			`\|spks \| 1211 \|40 \| 5994 \| 118\|`
			`\|utts \| 148642 \| 4874 \| 1092009 \|36273\|`
			`\| time(h) \| 340.4 \| 11.2 \| 2360.2 \|79.9 \|`


			`## trial summary`

			`\| trial \| filename \| nums \| positive \| negative \|`
			`\|--------\|-----------\|--------\|-------\|------\|`
			`\| VoxCeleb1 \| veri_test.txt \| 37720 \| 18860 \| 18860 \|`
			`\| VoxCeleb1(cleaned) \| veri_test2.txt \| 37611 \| 18802 \| 18809 \|`
			`\| VoxCeleb1-H \| list_test_hard.txt \| 552536 \| 276270 \| 276266 \|`
			`\|VoxCeleb1-H(cleaned) \|list_test_hard2.txt \| 550894 \| 275488 \| 275406 \|`
			`\|VoxCeleb1-E \| list_test_all.txt \| 581480 \| 290743 \| 290737 \|`
			`\|VoxCeleb1-E(cleaned) \| list_test_all2.txt \|579818 \|289921 \|289897 \|`