PaddleSpeech/docs/source/introduction.md

# PaddleSpeech

## What is PaddleSpeech?
PaddleSpeech is an open-source toolkit on PaddlePaddle platform for two critical tasks in Speech -  Speech-to-Text (Automatic Speech Recognition, ASR) and Text-to-Speech Synthesis (TTS), with modules involving state-of-art and influential models.

## What can PaddleSpeech do?

### Speech-to-Text
PaddleSpeech ASR mainly consists of components below:
- Implementation of models and commonly used neural network layers.
- Dataset abstraction and common data preprocessing pipelines.
- Ready-to-run experiments.

PaddleSpeech ASR provides you with a complete ASR pipeline, including:
- Data Preparation
    - Build vocabulary
    - Compute Cepstral mean and variance normalization (CMVN)
    - Featrue extraction
        - linear
        - fbank (also support kaldi feature)
        - mfcc
- Acoustic Models
    - Deepspeech2 (Streaming and Non-Streaming)
    - Transformer (Streaming and Non-Streaming)
    - Conformer (Streaming and Non-Streaming)
- Decoder
    - ctc greedy search (used in DeepSpeech2, Transformer and Conformer)
    - ctc beam search (used in DeepSpeech2, Transformer and Conformer)
    - attention decoding (used in Transformer and Conformer)
    - attention rescoring (used in Transformer and Conformer)

Speech-to-Text helps you training the ASR model very simply.

### Text-to-Speech
TTS mainly consists of components below:
- Implementation of models and commonly used neural network layers.
- Dataset abstraction and common data preprocessing pipelines.
- Ready-to-run experiments.

PaddleSpeech TTS provides you with a complete TTS pipeline, including:
- Text FrontEnd
    - Rule based Chinese frontend.
- Acoustic Models
    - FastSpeech2
    - SpeedySpeech
    - TransformerTTS
    - Tacotron2
- Vocoders
    - Multi Band MelGAN
    - Parallel WaveGAN
    - WaveFlow
- Voice Cloning
    - Transfer Learning from Speaker Verification to Multispeaker Text-to-Speech Synthesis
    - GE2E

Text-to-Speech  helps you to train TTS models with simple commands.
refactor docs 3 years ago			`# PaddleSpeech`

			`## What is PaddleSpeech?`
Update introduction.md 3 years ago			`PaddleSpeech is an open-source toolkit on PaddlePaddle platform for two critical tasks in Speech - Speech-to-Text (Automatic Speech Recognition, ASR) and Text-to-Speech Synthesis (TTS), with modules involving state-of-art and influential models.`
refactor docs 3 years ago
			`## What can PaddleSpeech do?`

Update introduction.md 3 years ago			`### Speech-to-Text`
Add the Speech-To-Text in introduction.md 3 years ago			`PaddleSpeech ASR mainly consists of components below:`
			`- Implementation of models and commonly used neural network layers.`
			`- Dataset abstraction and common data preprocessing pipelines.`
			`- Ready-to-run experiments.`

			`PaddleSpeech ASR provides you with a complete ASR pipeline, including:`
			`- Data Preparation`
			`- Build vocabulary`
			`- Compute Cepstral mean and variance normalization (CMVN)`
			`- Featrue extraction`
Update introduction.md 3 years ago			`- linear`
Add the Speech-To-Text in introduction.md 3 years ago			`- fbank (also support kaldi feature)`
			`- mfcc`
			`- Acoustic Models`
fix doc of install 3 years ago			`- Deepspeech2 (Streaming and Non-Streaming)`
			`- Transformer (Streaming and Non-Streaming)`
			`- Conformer (Streaming and Non-Streaming)`
Add the Speech-To-Text in introduction.md 3 years ago			`- Decoder`
			`- ctc greedy search (used in DeepSpeech2, Transformer and Conformer)`
			`- ctc beam search (used in DeepSpeech2, Transformer and Conformer)`
			`- attention decoding (used in Transformer and Conformer)`
			`- attention rescoring (used in Transformer and Conformer)`
refactor docs 3 years ago
Update introduction.md 3 years ago			`Speech-to-Text helps you training the ASR model very simply.`
fix doc of install 3 years ago
Update introduction.md 3 years ago			`### Text-to-Speech`
refactor docs 3 years ago			`TTS mainly consists of components below:`
			`- Implementation of models and commonly used neural network layers.`
			`- Dataset abstraction and common data preprocessing pipelines.`
			`- Ready-to-run experiments.`

			`PaddleSpeech TTS provides you with a complete TTS pipeline, including:`
			`- Text FrontEnd`
			`- Rule based Chinese frontend.`
			`- Acoustic Models`
			`- FastSpeech2`
			`- SpeedySpeech`
			`- TransformerTTS`
			`- Tacotron2`
			`- Vocoders`
			`- Multi Band MelGAN`
			`- Parallel WaveGAN`
			`- WaveFlow`
			`- Voice Cloning`
update ipynb, add eval loss 3 years ago			`- Transfer Learning from Speaker Verification to Multispeaker Text-to-Speech Synthesis`
refactor docs 3 years ago			`- GE2E`

Update introduction.md 3 years ago			`Text-to-Speech helps you to train TTS models with simple commands.`