English | [简体中文](README_ch.md) # PaddleSpeech
| ASR Module Type | Dataset | Model Type | Link |
|---|---|---|---|
| Acoustic Model | Aishell | 2 Conv + 5 LSTM layers with only forward direction | Ds2 Online Aishell Model |
| 2 Conv + 3 bidirectional GRU layers | Ds2 Offline Aishell Model | ||
| Encoder:Conformer, Decoder:Transformer, Decoding method: Attention + CTC | Conformer Offline Aishell Model | ||
| Encoder:Conformer, Decoder:Transformer, Decoding method: Attention | Conformer Librispeech Model | ||
| Librispeech | Encoder:Conformer, Decoder:Transformer, Decoding method: Attention | Conformer Librispeech Model | |
| Encoder:Transformer, Decoder:Transformer, Decoding method: Attention | Transformer Librispeech Model | ||
| Language Model | CommonCrawl(en.00) | English Language Model | English Language Model |
| Baidu Internal Corpus | Mandarin Language Model Small | Mandarin Language Model Small | |
| Mandarin Language Model Large | Mandarin Language Model Large |
| TTS Module Type | Model Type | Dataset | Link |
|---|---|---|---|
| Text Frontend | chinese-fronted | ||
| Acoustic Model | Tacotron2 | LJSpeech | tacotron2-vctk |
| TransformerTTS | transformer-ljspeech | ||
| SpeedySpeech | CSMSC | speedyspeech-csmsc | |
| FastSpeech2 | AISHELL-3 | fastspeech2-aishell3 | |
| VCTK | fastspeech2-vctk | ||
| LJSpeech | fastspeech2-ljspeech | ||
| CSMSC | fastspeech2-csmsc | ||
| Vocoder | WaveFlow | LJSpeech | waveflow-ljspeech |
| Parallel WaveGAN | LJSpeech | PWGAN-ljspeech | |
| VCTK | PWGAN-vctk | ||
| CSMSC | PWGAN-csmsc | ||
| Voice Cloning | GE2E | AISHELL-3, etc. | ge2e |
| GE2E + Tactron2 | AISHELL-3 | ge2e-tactron2-aishell3 | |