[Hackathon 7th] fix voc1 readme.md in CSMSC (#3915)

* fix

* fix

* fix md

* fix
pull/3951/head
cyberslack_lee 2 weeks ago committed by GitHub
parent 2d7cf7f0e6
commit 73beb187da
No known key found for this signature in database
GPG Key ID: B5690EEEBB952194

@ -4,6 +4,18 @@ This example contains code used to train a [parallel wavegan](http://arxiv.org/a
### Download and Extract ### Download and Extract
Download CSMSC from it's [official website](https://test.data-baker.com/data/index/TNtts/) and extract it to `~/datasets`. Then the dataset is in the directory `~/datasets/BZNSYP`. Download CSMSC from it's [official website](https://test.data-baker.com/data/index/TNtts/) and extract it to `~/datasets`. Then the dataset is in the directory `~/datasets/BZNSYP`.
After processing the data, the ``BZNSYP`` directory will look like this:
```text
BZNSYP
├── Wave
│ └─ *.wav files (audio speech)
├── PhoneLabeling
│ └─ *.interval files (alignment between phoneme and duration)
└── ProsodyLabeling
└─ 000001-010000.txt (text with prosodic by pinyin)
```
This experiment only uses *.wav files from the Wave file
### Get MFA Result and Extract ### Get MFA Result and Extract
We use [MFA](https://github.com/MontrealCorpusTools/Montreal-Forced-Aligner) results to cut silence at the edge of audio. We use [MFA](https://github.com/MontrealCorpusTools/Montreal-Forced-Aligner) results to cut silence at the edge of audio.
You can download from here [baker_alignment_tone.tar.gz](https://paddlespeech.bj.bcebos.com/MFA/BZNSYP/with_tone/baker_alignment_tone.tar.gz), or train your MFA model reference to [mfa example](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/examples/other/mfa) of our repo. You can download from here [baker_alignment_tone.tar.gz](https://paddlespeech.bj.bcebos.com/MFA/BZNSYP/with_tone/baker_alignment_tone.tar.gz), or train your MFA model reference to [mfa example](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/examples/other/mfa) of our repo.

Loading…
Cancel
Save