PaddleSpeech/docs/source/introduction.md

# PaddleSpeech

## What is PaddleSpeech?
PaddleSpeech is an open-source toolkit on PaddlePaddle platform for two critical tasks in Speech -  Speech-To-Text (Automatic Speech Recognition, ASR) and Text-To-Speech Synthesis (TTS), with modules involving state-of-art and influential models.

## What can PaddleSpeech do?

### Speech-To-Text
PaddleSpeech ASR mainly consists of components below:
- Implementation of models and commonly used neural network layers.
- Dataset abstraction and common data preprocessing pipelines.
- Ready-to-run experiments.

PaddleSpeech ASR provides you with a complete ASR pipeline, including:
- Data Preparation
    - Build vocabulary
    - Compute Cepstral mean and variance normalization (CMVN)
    - Featrue extraction
        - linear
        - fbank (also support kaldi feature)
        - mfcc
- Acoustic Models
    - Deepspeech2 (online and offline)
    - Transformer (online and offline)
    - Conformer (online and offline)
- Decoder
    - ctc greedy search (used in DeepSpeech2, Transformer and Conformer)
    - ctc beam search (used in DeepSpeech2, Transformer and Conformer)
    - attention decoding (used in Transformer and Conformer)
    - attention rescoring (used in Transformer and Conformer)

Speech-To-Text helps you training the ASR model very simply.
   
### Text-To-Speech
TTS mainly consists of components below:
- Implementation of models and commonly used neural network layers.
- Dataset abstraction and common data preprocessing pipelines.
- Ready-to-run experiments.

PaddleSpeech TTS provides you with a complete TTS pipeline, including:
- Text FrontEnd
    - Rule based Chinese frontend.
- Acoustic Models
    - FastSpeech2
    - SpeedySpeech
    - TransformerTTS
    - Tacotron2
- Vocoders
    - Multi Band MelGAN
    - Parallel WaveGAN
    - WaveFlow
- Voice Cloning
    - Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis
    - GE2E

Text-To-Speech  helps you to train TTS models with simple commands.
refactor docs 3 years ago			`# PaddleSpeech`

			`## What is PaddleSpeech?`
			`PaddleSpeech is an open-source toolkit on PaddlePaddle platform for two critical tasks in Speech - Speech-To-Text (Automatic Speech Recognition, ASR) and Text-To-Speech Synthesis (TTS), with modules involving state-of-art and influential models.`

			`## What can PaddleSpeech do?`

			`### Speech-To-Text`
Add the Speech-To-Text in introduction.md 3 years ago			`PaddleSpeech ASR mainly consists of components below:`
			`- Implementation of models and commonly used neural network layers.`
			`- Dataset abstraction and common data preprocessing pipelines.`
			`- Ready-to-run experiments.`

			`PaddleSpeech ASR provides you with a complete ASR pipeline, including:`
			`- Data Preparation`
			`- Build vocabulary`
			`- Compute Cepstral mean and variance normalization (CMVN)`
			`- Featrue extraction`
Update introduction.md 3 years ago			`- linear`
Add the Speech-To-Text in introduction.md 3 years ago			`- fbank (also support kaldi feature)`
			`- mfcc`
			`- Acoustic Models`
			`- Deepspeech2 (online and offline)`
			`- Transformer (online and offline)`
			`- Conformer (online and offline)`
			`- Decoder`
			`- ctc greedy search (used in DeepSpeech2, Transformer and Conformer)`
			`- ctc beam search (used in DeepSpeech2, Transformer and Conformer)`
			`- attention decoding (used in Transformer and Conformer)`
			`- attention rescoring (used in Transformer and Conformer)`
refactor docs 3 years ago
Add the Speech-To-Text in introduction.md 3 years ago			`Speech-To-Text helps you training the ASR model very simply.`

refactor docs 3 years ago			`### Text-To-Speech`
			`TTS mainly consists of components below:`
			`- Implementation of models and commonly used neural network layers.`
			`- Dataset abstraction and common data preprocessing pipelines.`
			`- Ready-to-run experiments.`

			`PaddleSpeech TTS provides you with a complete TTS pipeline, including:`
			`- Text FrontEnd`
			`- Rule based Chinese frontend.`
			`- Acoustic Models`
			`- FastSpeech2`
			`- SpeedySpeech`
			`- TransformerTTS`
			`- Tacotron2`
			`- Vocoders`
			`- Multi Band MelGAN`
			`- Parallel WaveGAN`
			`- WaveFlow`
			`- Voice Cloning`
			`- Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis`
			`- GE2E`

			`Text-To-Speech helps you to train TTS models with simple commands.`