diff --git a/README.md b/README.md index 809ffe6df..e0769720f 100644 --- a/README.md +++ b/README.md @@ -1,31 +1,302 @@ -# PaddlePaddle Speech toolkit +English | [简体中文](README_ch.md) +# PaddleSpeech + + + +
+
+
ASR Module Type | +Dataset | +Model Type | +Link | +
---|---|---|---|
Acoustic Model | +Aishell | +2 Conv + 5 LSTM layers with only forward direction | ++ Ds2 Online Aishell Model + | +
2 Conv + 3 bidirectional GRU layers | ++ Ds2 Offline Aishell Model + | +||
Encoder:Conformer, Decoder:Transformer, Decoding method: Attention + CTC | ++ Conformer Offline Aishell Model + | +||
Encoder:Conformer, Decoder:Transformer, Decoding method: Attention | ++ Conformer Librispeech Model + | +||
Librispeech | +Encoder:Conformer, Decoder:Transformer, Decoding method: Attention | +Conformer Librispeech Model | +|
Encoder:Transformer, Decoder:Transformer, Decoding method: Attention | ++ Transformer Librispeech Model + | +||
Language Model | +CommonCrawl(en.00) | +English Language Model | ++ English Language Model + | +
Baidu Internal Corpus | +Mandarin Language Model Small | ++ Mandarin Language Model Small + | +|
Mandarin Language Model Large | ++ Mandarin Language Model Large + | +
TTS Module Type | +Model Type | +Dataset | +Link | +
---|---|---|---|
Text Frontend | ++ | + chinese-fronted + | +|
Acoustic Model | +Tacotron2 | +LJSpeech | ++ tacotron2-vctk + | +
TransformerTTS | ++ transformer-ljspeech + | +||
SpeedySpeech | +CSMSC | ++ speedyspeech-csmsc + | +|
FastSpeech2 | +AISHELL-3 | ++ fastspeech2-aishell3 + | +|
VCTK | +fastspeech2-vctk | +||
LJSpeech | +fastspeech2-ljspeech | +||
CSMSC | ++ fastspeech2-csmsc + | +||
Vocoder | +WaveFlow | +LJSpeech | ++ waveflow-ljspeech + | +
Parallel WaveGAN | +LJSpeech | ++ PWGAN-ljspeech + | +|
VCTK | ++ PWGAN-vctk + | +||
CSMSC | ++ PWGAN-csmsc + | +||
Voice Cloning | +GE2E | +AISHELL-3, etc. | ++ ge2e + | +
GE2E + Tactron2 | +AISHELL-3 | ++ ge2e-tactron2-aishell3 + | + +