# Features

### Dataset
* Aishell
* Librispeech
* THCHS30
* TIMIT

### Speech Recognition

* Non-Streaming
  * [Baidu's DeepSpeech2](http://proceedings.mlr.press/v48/amodei16.pdf)
  * [Transformer](https://arxiv.org/abs/1706.03762)
  * [Conformer](https://arxiv.org/abs/2005.08100)

* Streaming
  * [Baidu's DeepSpeech2](http://proceedings.mlr.press/v48/amodei16.pdf)
  * [U2](https://arxiv.org/pdf/2012.05481.pdf)

### Language Model

* Ngram

### Decoder

* ctc greedy
* ctc prefix beam search
* greedy
* beam search
* attention rescore

### Deployment

* Paddle Inference

### Aligment  

* MFA  
* CTC Alignment  

### Speech Frontend

* Audio
  * Auto Gain
* Feature
  * kaldi fbank
  * kaldi mfcc
  * linear
  * delta detla

### Speech Augmentation

* Audio
  - Volume Perturbation
  - Speed Perturbation
  - Shifting Perturbation
  - Online Bayesian normalization
  - Noise Perturbation
  - Impulse Response
* Spectrum
  - SpecAugment
  - Adaptive SpecAugment

### Tokenizer

* Chinese/English Character
* English Word
* Sentence Piece

### Word Segmentation

*  [mmseg](http://technology.chtsai.org/mmseg/)

### Grapheme To Phoneme

* syllable
* phoneme