commit
8697d422b9
@ -0,0 +1,61 @@
|
|||||||
|
# Featrues
|
||||||
|
|
||||||
|
### Speech Recognition
|
||||||
|
|
||||||
|
* Offline
|
||||||
|
* [Baidu's DeepSpeech2](http://proceedings.mlr.press/v48/amodei16.pdf)
|
||||||
|
* [Transformer](https://arxiv.org/abs/1706.03762)
|
||||||
|
* [Conformer](https://arxiv.org/abs/2005.08100)
|
||||||
|
|
||||||
|
* Online
|
||||||
|
* [U2](https://arxiv.org/pdf/2012.05481.pdf)
|
||||||
|
|
||||||
|
### Language Model
|
||||||
|
|
||||||
|
* Ngram
|
||||||
|
|
||||||
|
### Decoder
|
||||||
|
|
||||||
|
* ctc greedy
|
||||||
|
* ctc prefix beam search
|
||||||
|
* greedy
|
||||||
|
* beam search
|
||||||
|
* attention rescore
|
||||||
|
|
||||||
|
### Speech Frontend
|
||||||
|
|
||||||
|
* Audio
|
||||||
|
* Auto Gain
|
||||||
|
* Feature
|
||||||
|
* kaldi fbank
|
||||||
|
* kaldi mfcc
|
||||||
|
* linear
|
||||||
|
* delta detla
|
||||||
|
|
||||||
|
### Speech Augmentation
|
||||||
|
|
||||||
|
* Audio
|
||||||
|
- Volume Perturbation
|
||||||
|
- Speed Perturbation
|
||||||
|
- Shifting Perturbation
|
||||||
|
- Online Bayesian normalization
|
||||||
|
- Noise Perturbation
|
||||||
|
- Impulse Response
|
||||||
|
* Spectrum
|
||||||
|
- SpecAugment
|
||||||
|
- Adaptive SpecAugment
|
||||||
|
|
||||||
|
### Tokenizer
|
||||||
|
|
||||||
|
* Chinese/English Character
|
||||||
|
* English Word
|
||||||
|
* Sentence Piece
|
||||||
|
|
||||||
|
### Word Segmentation
|
||||||
|
|
||||||
|
* [mmseg](http://technology.chtsai.org/mmseg/)
|
||||||
|
|
||||||
|
### Grapheme To Phoneme
|
||||||
|
|
||||||
|
* syallable
|
||||||
|
* phoneme
|
@ -1,14 +1,16 @@
|
|||||||
# Aishell
|
# Aishell
|
||||||
|
|
||||||
## Conformer
|
## Conformer
|
||||||
|
|
||||||
| Model | Config | Augmentation| Test set | Decode method | Loss | WER |
|
| Model | Config | Augmentation| Test set | Decode method | Loss | WER |
|
||||||
| --- | --- | --- | --- | --- | --- |
|
| --- | --- | --- | --- | --- | --- | --- |
|
||||||
| conformer | conf/conformer.yaml | spec_aug + shift | test | attention | - | 0.059858 |
|
| conformer | conf/conformer.yaml | spec_aug + shift | test | attention | - | 0.059858 |
|
||||||
| conformer | conf/conformer.yaml | spec_aug + shift | test | ctc_greedy_search | - | 0.062311 |
|
| conformer | conf/conformer.yaml | spec_aug + shift | test | ctc_greedy_search | - | 0.062311 |
|
||||||
| conformer | conf/conformer.yaml | spec_aug + shift | test | ctc_prefix_beam_search | - | 0.062196 |
|
| conformer | conf/conformer.yaml | spec_aug + shift | test | ctc_prefix_beam_search | - | 0.062196 |
|
||||||
| conformer | conf/conformer.yaml | spec_aug + shift | test | attention_rescoring | - | 0.054694 |
|
| conformer | conf/conformer.yaml | spec_aug + shift | test | attention_rescoring | - | 0.054694 |
|
||||||
|
|
||||||
## Transformer
|
## Transformer
|
||||||
|
|
||||||
| Model | Config | Augmentation| Test set | Decode method | Loss | WER |
|
| Model | Config | Augmentation| Test set | Decode method | Loss | WER |
|
||||||
| --- | --- | --- | --- | --- | --- |
|
| --- | --- | --- | --- | --- | --- | ---|
|
||||||
| transformer | conf/transformer.yaml | spec_aug + shift | test | attention | - | - |
|
| transformer | conf/transformer.yaml | spec_aug + shift | test | attention | - | - |
|
||||||
|
Loading…
Reference in new issue