commit
8697d422b9
@ -0,0 +1,61 @@
|
||||
# Featrues
|
||||
|
||||
### Speech Recognition
|
||||
|
||||
* Offline
|
||||
* [Baidu's DeepSpeech2](http://proceedings.mlr.press/v48/amodei16.pdf)
|
||||
* [Transformer](https://arxiv.org/abs/1706.03762)
|
||||
* [Conformer](https://arxiv.org/abs/2005.08100)
|
||||
|
||||
* Online
|
||||
* [U2](https://arxiv.org/pdf/2012.05481.pdf)
|
||||
|
||||
### Language Model
|
||||
|
||||
* Ngram
|
||||
|
||||
### Decoder
|
||||
|
||||
* ctc greedy
|
||||
* ctc prefix beam search
|
||||
* greedy
|
||||
* beam search
|
||||
* attention rescore
|
||||
|
||||
### Speech Frontend
|
||||
|
||||
* Audio
|
||||
* Auto Gain
|
||||
* Feature
|
||||
* kaldi fbank
|
||||
* kaldi mfcc
|
||||
* linear
|
||||
* delta detla
|
||||
|
||||
### Speech Augmentation
|
||||
|
||||
* Audio
|
||||
- Volume Perturbation
|
||||
- Speed Perturbation
|
||||
- Shifting Perturbation
|
||||
- Online Bayesian normalization
|
||||
- Noise Perturbation
|
||||
- Impulse Response
|
||||
* Spectrum
|
||||
- SpecAugment
|
||||
- Adaptive SpecAugment
|
||||
|
||||
### Tokenizer
|
||||
|
||||
* Chinese/English Character
|
||||
* English Word
|
||||
* Sentence Piece
|
||||
|
||||
### Word Segmentation
|
||||
|
||||
* [mmseg](http://technology.chtsai.org/mmseg/)
|
||||
|
||||
### Grapheme To Phoneme
|
||||
|
||||
* syallable
|
||||
* phoneme
|
@ -1,14 +1,16 @@
|
||||
# Aishell
|
||||
|
||||
## Conformer
|
||||
| Model | Config | Augmentation| Test set | Decode method | Loss | WER |
|
||||
| --- | --- | --- | --- | --- | --- |
|
||||
| conformer | conf/conformer.yaml | spec_aug + shift | test | attention | - | 0.059858 |
|
||||
| conformer | conf/conformer.yaml | spec_aug + shift | test | ctc_greedy_search | - | 0.062311 |
|
||||
| conformer | conf/conformer.yaml | spec_aug + shift | test | ctc_prefix_beam_search | - | 0.062196 |
|
||||
| conformer | conf/conformer.yaml | spec_aug + shift | test | attention_rescoring | - | 0.054694 |
|
||||
|
||||
| Model | Config | Augmentation| Test set | Decode method | Loss | WER |
|
||||
| --- | --- | --- | --- | --- | --- | --- |
|
||||
| conformer | conf/conformer.yaml | spec_aug + shift | test | attention | - | 0.059858 |
|
||||
| conformer | conf/conformer.yaml | spec_aug + shift | test | ctc_greedy_search | - | 0.062311 |
|
||||
| conformer | conf/conformer.yaml | spec_aug + shift | test | ctc_prefix_beam_search | - | 0.062196 |
|
||||
| conformer | conf/conformer.yaml | spec_aug + shift | test | attention_rescoring | - | 0.054694 |
|
||||
|
||||
## Transformer
|
||||
| Model | Config | Augmentation| Test set | Decode method | Loss | WER |
|
||||
| --- | --- | --- | --- | --- | --- |
|
||||
| transformer | conf/transformer.yaml | spec_aug + shift | test | attention | - | - |
|
||||
|
||||
| Model | Config | Augmentation| Test set | Decode method | Loss | WER |
|
||||
| --- | --- | --- | --- | --- | --- | ---|
|
||||
| transformer | conf/transformer.yaml | spec_aug + shift | test | attention | - | - |
|
||||
|
@ -1,7 +1,8 @@
|
||||
# LibriSpeech
|
||||
|
||||
## Deepspeech2
|
||||
| Model | Config | Test set | WER |
|
||||
| --- | --- | --- | --- |
|
||||
| DeepSpeech2 | conf/deepspeech2.yaml | test-clean | 0.073973 |
|
||||
| DeepSpeech2 | release 1.8.5 | test-clean | 0.074939 |
|
||||
## Deepspeech2
|
||||
|
||||
| Model | Config | Test set | WER |
|
||||
| --- | --- | --- | --- |
|
||||
| DeepSpeech2 | conf/deepspeech2.yaml | test-clean | 0.073973 |
|
||||
| DeepSpeech2 | release 1.8.5 | test-clean | 0.074939 |
|
||||
|
@ -1,16 +1,18 @@
|
||||
# LibriSpeech
|
||||
|
||||
## Conformer
|
||||
| Model | Config | Augmentation| Test set | Decode method | Loss | WER |
|
||||
| --- | --- | --- | --- | --- | --- |
|
||||
| conformer | conf/conformer.yaml | spec_aug + shift | test-all | attention | test-all 6.35 | 0.057117 |
|
||||
| conformer | conf/conformer.yaml | spec_aug + shift | test-clean | attention | test-all 6.35 | 0.030162 |
|
||||
| conformer | conf/conformer.yaml | spec_aug + shift | test-clean | ctc_greedy_search | test-all 6.35 | 0.037910 |
|
||||
| conformer | conf/conformer.yaml | spec_aug + shift | test-clean | ctc_prefix_beam_search | test-all 6.35 | 0.037761 |
|
||||
| conformer | conf/conformer.yaml | spec_aug + shift | test-clean | attention_rescoring | test-all 6.35 | 0.032115 |
|
||||
|
||||
| Model | Config | Augmentation| Test set | Decode method | Loss | WER |
|
||||
| --- | --- | --- | --- | --- | --- | --- |
|
||||
| conformer | conf/conformer.yaml | spec_aug + shift | test-all | attention | test-all 6.35 | 0.057117 |
|
||||
| conformer | conf/conformer.yaml | spec_aug + shift | test-clean | attention | test-all 6.35 | 0.030162 |
|
||||
| conformer | conf/conformer.yaml | spec_aug + shift | test-clean | ctc_greedy_search | test-all 6.35 | 0.037910 |
|
||||
| conformer | conf/conformer.yaml | spec_aug + shift | test-clean | ctc_prefix_beam_search | test-all 6.35 | 0.037761 |
|
||||
| conformer | conf/conformer.yaml | spec_aug + shift | test-clean | attention_rescoring | test-all 6.35 | 0.032115 |
|
||||
|
||||
## Transformer
|
||||
| Model | Config | Augmentation| Test set | Decode method | Loss | WER |
|
||||
| --- | --- | --- | --- | --- | --- |
|
||||
| transformer | conf/transformer.yaml | spec_aug + shift | test-all | attention | test-all 6.98 | 0.066500 |
|
||||
| transformer | conf/transformer.yaml | spec_aug + shift | test-clean | attention | test-all 6.98 | 0.036 |
|
||||
|
||||
| Model | Config | Augmentation| Test set | Decode method | Loss | WER |
|
||||
| --- | --- | --- | --- | --- | --- | --- |
|
||||
| transformer | conf/transformer.yaml | spec_aug + shift | test-all | attention | test-all 6.98 | 0.066500 |
|
||||
| transformer | conf/transformer.yaml | spec_aug + shift | test-clean | attention | test-all 6.98 | 0.036 |
|
||||
|
Loading…
Reference in new issue