文本前端 |
|
tn / g2p
|
声学模型 |
Tacotron2 |
LJSpeech / CSMSC |
tacotron2-ljspeech / tacotron2-csmsc
|
Transformer TTS |
LJSpeech |
transformer-ljspeech
|
SpeedySpeech |
CSMSC |
speedyspeech-csmsc
|
FastSpeech2 |
LJSpeech / VCTK / CSMSC / AISHELL-3 / ZH_EN / finetune |
fastspeech2-ljspeech / fastspeech2-vctk / fastspeech2-csmsc / fastspeech2-aishell3 / fastspeech2-zh_en / fastspeech2-finetune
|
ERNIE-SAT |
VCTK / AISHELL-3 / ZH_EN |
ERNIE-SAT-vctk / ERNIE-SAT-aishell3 / ERNIE-SAT-zh_en
|
DiffSinger |
Opencpop |
DiffSinger-opencpop
|
声码器 |
WaveFlow |
LJSpeech |
waveflow-ljspeech
|
Parallel WaveGAN |
LJSpeech / VCTK / CSMSC / AISHELL-3 / Opencpop |
PWGAN-ljspeech / PWGAN-vctk / PWGAN-csmsc / PWGAN-aishell3 / PWGAN-opencpop
|
Multi Band MelGAN |
CSMSC |
Multi Band MelGAN-csmsc
|
Style MelGAN |
CSMSC |
Style MelGAN-csmsc
|
HiFiGAN |
LJSpeech / VCTK / CSMSC / AISHELL-3 / Opencpop |
HiFiGAN-ljspeech / HiFiGAN-vctk / HiFiGAN-csmsc / HiFiGAN-aishell3 / HiFiGAN-opencpop
|
WaveRNN |
CSMSC |
WaveRNN-csmsc
|
声音克隆 |
GE2E |
Librispeech, etc. |
GE2E
|
SV2TTS (GE2E + Tacotron2) |
AISHELL-3 |
VC0
|
SV2TTS (GE2E + FastSpeech2) |
AISHELL-3 |
VC1
|
SV2TTS (ECAPA-TDNN + FastSpeech2) |
AISHELL-3 |
VC2
|
GE2E + VITS |
AISHELL-3 |
VITS-VC
|
端到端 |
VITS |
CSMSC / AISHELL-3 |
VITS-csmsc / VITS-aishell3
|