|
|
@ -46,6 +46,18 @@ The input of this demo should be a text of the specific language that can be pas
|
|
|
|
```bash
|
|
|
|
```bash
|
|
|
|
paddlespeech tts --am fastspeech2_vctk --voc pwgan_vctk --input "hello, boys" --lang en --spk_id 0
|
|
|
|
paddlespeech tts --am fastspeech2_vctk --voc pwgan_vctk --input "hello, boys" --lang en --spk_id 0
|
|
|
|
```
|
|
|
|
```
|
|
|
|
|
|
|
|
- Chinese English Mixed, multi-speaker
|
|
|
|
|
|
|
|
You can change `spk_id` here.
|
|
|
|
|
|
|
|
```bash
|
|
|
|
|
|
|
|
# The `am` must be `fastspeech2_mix`!
|
|
|
|
|
|
|
|
# The `lang` must be `mix`!
|
|
|
|
|
|
|
|
# The voc must be chinese datasets' voc now!
|
|
|
|
|
|
|
|
# spk 174 is csmcc, spk 175 is ljspeech
|
|
|
|
|
|
|
|
paddlespeech tts --am fastspeech2_mix --voc hifigan_csmsc --lang mix --input "热烈欢迎您在 Discussions 中提交问题,并在 Issues 中指出发现的 bug。此外,我们非常希望您参与到 Paddle Speech 的开发中!" --spk_id 174 --output mix_spk174.wav
|
|
|
|
|
|
|
|
paddlespeech tts --am fastspeech2_mix --voc hifigan_aishell3 --lang mix --input "热烈欢迎您在 Discussions 中提交问题,并在 Issues 中指出发现的 bug。此外,我们非常希望您参与到 Paddle Speech 的开发中!" --spk_id 174 --output mix_spk174_aishell3.wav
|
|
|
|
|
|
|
|
paddlespeech tts --am fastspeech2_mix --voc pwgan_csmsc --lang mix --input "我们的声学模型使用了 Fast Speech Two, 声码器使用了 Parallel Wave GAN and Hifi GAN." --spk_id 175 --output mix_spk175_pwgan.wav
|
|
|
|
|
|
|
|
paddlespeech tts --am fastspeech2_mix --voc hifigan_csmsc --lang mix --input "我们的声学模型使用了 Fast Speech Two, 声码器使用了 Parallel Wave GAN and Hifi GAN." --spk_id 175 --output mix_spk175.wav
|
|
|
|
|
|
|
|
```
|
|
|
|
Usage:
|
|
|
|
Usage:
|
|
|
|
|
|
|
|
|
|
|
|
```bash
|
|
|
|
```bash
|
|
|
@ -110,19 +122,29 @@ The input of this demo should be a text of the specific language that can be pas
|
|
|
|
Here is a list of pretrained models released by PaddleSpeech that can be used by command and python API:
|
|
|
|
Here is a list of pretrained models released by PaddleSpeech that can be used by command and python API:
|
|
|
|
|
|
|
|
|
|
|
|
- Acoustic model
|
|
|
|
- Acoustic model
|
|
|
|
| Model | Language
|
|
|
|
| Model | Language |
|
|
|
|
| :--- | :---: |
|
|
|
|
| :--- | :---: |
|
|
|
|
| speedyspeech_csmsc| zh
|
|
|
|
| speedyspeech_csmsc | zh |
|
|
|
|
| fastspeech2_csmsc| zh
|
|
|
|
| fastspeech2_csmsc | zh |
|
|
|
|
| fastspeech2_aishell3| zh
|
|
|
|
| fastspeech2_ljspeech | en |
|
|
|
|
| fastspeech2_ljspeech| en
|
|
|
|
| fastspeech2_aishell3 | zh |
|
|
|
|
| fastspeech2_vctk| en
|
|
|
|
| fastspeech2_vctk | en |
|
|
|
|
|
|
|
|
| fastspeech2_cnndecoder_csmsc | zh |
|
|
|
|
|
|
|
|
| fastspeech2_mix | mix |
|
|
|
|
|
|
|
|
| tacotron2_csmsc | zh |
|
|
|
|
|
|
|
|
| tacotron2_ljspeech | en |
|
|
|
|
|
|
|
|
|
|
|
|
- Vocoder
|
|
|
|
- Vocoder
|
|
|
|
| Model | Language
|
|
|
|
| Model | Language |
|
|
|
|
| :--- | :---: |
|
|
|
|
| :--- | :---: |
|
|
|
|
| pwgan_csmsc| zh
|
|
|
|
| pwgan_csmsc | zh |
|
|
|
|
| pwgan_aishell3| zh
|
|
|
|
| pwgan_ljspeech | en |
|
|
|
|
| pwgan_ljspeech| en
|
|
|
|
| pwgan_aishell3 | zh |
|
|
|
|
| pwgan_vctk| en
|
|
|
|
| pwgan_vctk | en |
|
|
|
|
| mb_melgan_csmsc| zh
|
|
|
|
| mb_melgan_csmsc | zh |
|
|
|
|
|
|
|
|
| style_melgan_csmsc | zh |
|
|
|
|
|
|
|
|
| hifigan_csmsc | zh |
|
|
|
|
|
|
|
|
| hifigan_ljspeech | en |
|
|
|
|
|
|
|
|
| hifigan_aishell3 | zh |
|
|
|
|
|
|
|
|
| hifigan_vctk | en |
|
|
|
|
|
|
|
|
| wavernn_csmsc | zh |
|
|
|
|