commit
8697d422b9
@ -0,0 +1,61 @@
|
|||||||
|
# Featrues
|
||||||
|
|
||||||
|
### Speech Recognition
|
||||||
|
|
||||||
|
* Offline
|
||||||
|
* [Baidu's DeepSpeech2](http://proceedings.mlr.press/v48/amodei16.pdf)
|
||||||
|
* [Transformer](https://arxiv.org/abs/1706.03762)
|
||||||
|
* [Conformer](https://arxiv.org/abs/2005.08100)
|
||||||
|
|
||||||
|
* Online
|
||||||
|
* [U2](https://arxiv.org/pdf/2012.05481.pdf)
|
||||||
|
|
||||||
|
### Language Model
|
||||||
|
|
||||||
|
* Ngram
|
||||||
|
|
||||||
|
### Decoder
|
||||||
|
|
||||||
|
* ctc greedy
|
||||||
|
* ctc prefix beam search
|
||||||
|
* greedy
|
||||||
|
* beam search
|
||||||
|
* attention rescore
|
||||||
|
|
||||||
|
### Speech Frontend
|
||||||
|
|
||||||
|
* Audio
|
||||||
|
* Auto Gain
|
||||||
|
* Feature
|
||||||
|
* kaldi fbank
|
||||||
|
* kaldi mfcc
|
||||||
|
* linear
|
||||||
|
* delta detla
|
||||||
|
|
||||||
|
### Speech Augmentation
|
||||||
|
|
||||||
|
* Audio
|
||||||
|
- Volume Perturbation
|
||||||
|
- Speed Perturbation
|
||||||
|
- Shifting Perturbation
|
||||||
|
- Online Bayesian normalization
|
||||||
|
- Noise Perturbation
|
||||||
|
- Impulse Response
|
||||||
|
* Spectrum
|
||||||
|
- SpecAugment
|
||||||
|
- Adaptive SpecAugment
|
||||||
|
|
||||||
|
### Tokenizer
|
||||||
|
|
||||||
|
* Chinese/English Character
|
||||||
|
* English Word
|
||||||
|
* Sentence Piece
|
||||||
|
|
||||||
|
### Word Segmentation
|
||||||
|
|
||||||
|
* [mmseg](http://technology.chtsai.org/mmseg/)
|
||||||
|
|
||||||
|
### Grapheme To Phoneme
|
||||||
|
|
||||||
|
* syallable
|
||||||
|
* phoneme
|
@ -1,14 +1,16 @@
|
|||||||
# Aishell
|
# Aishell
|
||||||
|
|
||||||
## Conformer
|
## Conformer
|
||||||
| Model | Config | Augmentation| Test set | Decode method | Loss | WER |
|
|
||||||
| --- | --- | --- | --- | --- | --- |
|
| Model | Config | Augmentation| Test set | Decode method | Loss | WER |
|
||||||
| conformer | conf/conformer.yaml | spec_aug + shift | test | attention | - | 0.059858 |
|
| --- | --- | --- | --- | --- | --- | --- |
|
||||||
| conformer | conf/conformer.yaml | spec_aug + shift | test | ctc_greedy_search | - | 0.062311 |
|
| conformer | conf/conformer.yaml | spec_aug + shift | test | attention | - | 0.059858 |
|
||||||
| conformer | conf/conformer.yaml | spec_aug + shift | test | ctc_prefix_beam_search | - | 0.062196 |
|
| conformer | conf/conformer.yaml | spec_aug + shift | test | ctc_greedy_search | - | 0.062311 |
|
||||||
| conformer | conf/conformer.yaml | spec_aug + shift | test | attention_rescoring | - | 0.054694 |
|
| conformer | conf/conformer.yaml | spec_aug + shift | test | ctc_prefix_beam_search | - | 0.062196 |
|
||||||
|
| conformer | conf/conformer.yaml | spec_aug + shift | test | attention_rescoring | - | 0.054694 |
|
||||||
|
|
||||||
## Transformer
|
## Transformer
|
||||||
| Model | Config | Augmentation| Test set | Decode method | Loss | WER |
|
|
||||||
| --- | --- | --- | --- | --- | --- |
|
| Model | Config | Augmentation| Test set | Decode method | Loss | WER |
|
||||||
| transformer | conf/transformer.yaml | spec_aug + shift | test | attention | - | - |
|
| --- | --- | --- | --- | --- | --- | ---|
|
||||||
|
| transformer | conf/transformer.yaml | spec_aug + shift | test | attention | - | - |
|
||||||
|
@ -1,7 +1,8 @@
|
|||||||
# LibriSpeech
|
# LibriSpeech
|
||||||
|
|
||||||
## Deepspeech2
|
## Deepspeech2
|
||||||
| Model | Config | Test set | WER |
|
|
||||||
| --- | --- | --- | --- |
|
| Model | Config | Test set | WER |
|
||||||
| DeepSpeech2 | conf/deepspeech2.yaml | test-clean | 0.073973 |
|
| --- | --- | --- | --- |
|
||||||
| DeepSpeech2 | release 1.8.5 | test-clean | 0.074939 |
|
| DeepSpeech2 | conf/deepspeech2.yaml | test-clean | 0.073973 |
|
||||||
|
| DeepSpeech2 | release 1.8.5 | test-clean | 0.074939 |
|
||||||
|
@ -1,16 +1,18 @@
|
|||||||
# LibriSpeech
|
# LibriSpeech
|
||||||
|
|
||||||
## Conformer
|
## Conformer
|
||||||
| Model | Config | Augmentation| Test set | Decode method | Loss | WER |
|
|
||||||
| --- | --- | --- | --- | --- | --- |
|
| Model | Config | Augmentation| Test set | Decode method | Loss | WER |
|
||||||
| conformer | conf/conformer.yaml | spec_aug + shift | test-all | attention | test-all 6.35 | 0.057117 |
|
| --- | --- | --- | --- | --- | --- | --- |
|
||||||
| conformer | conf/conformer.yaml | spec_aug + shift | test-clean | attention | test-all 6.35 | 0.030162 |
|
| conformer | conf/conformer.yaml | spec_aug + shift | test-all | attention | test-all 6.35 | 0.057117 |
|
||||||
| conformer | conf/conformer.yaml | spec_aug + shift | test-clean | ctc_greedy_search | test-all 6.35 | 0.037910 |
|
| conformer | conf/conformer.yaml | spec_aug + shift | test-clean | attention | test-all 6.35 | 0.030162 |
|
||||||
| conformer | conf/conformer.yaml | spec_aug + shift | test-clean | ctc_prefix_beam_search | test-all 6.35 | 0.037761 |
|
| conformer | conf/conformer.yaml | spec_aug + shift | test-clean | ctc_greedy_search | test-all 6.35 | 0.037910 |
|
||||||
| conformer | conf/conformer.yaml | spec_aug + shift | test-clean | attention_rescoring | test-all 6.35 | 0.032115 |
|
| conformer | conf/conformer.yaml | spec_aug + shift | test-clean | ctc_prefix_beam_search | test-all 6.35 | 0.037761 |
|
||||||
|
| conformer | conf/conformer.yaml | spec_aug + shift | test-clean | attention_rescoring | test-all 6.35 | 0.032115 |
|
||||||
|
|
||||||
## Transformer
|
## Transformer
|
||||||
| Model | Config | Augmentation| Test set | Decode method | Loss | WER |
|
|
||||||
| --- | --- | --- | --- | --- | --- |
|
| Model | Config | Augmentation| Test set | Decode method | Loss | WER |
|
||||||
| transformer | conf/transformer.yaml | spec_aug + shift | test-all | attention | test-all 6.98 | 0.066500 |
|
| --- | --- | --- | --- | --- | --- | --- |
|
||||||
| transformer | conf/transformer.yaml | spec_aug + shift | test-clean | attention | test-all 6.98 | 0.036 |
|
| transformer | conf/transformer.yaml | spec_aug + shift | test-all | attention | test-all 6.98 | 0.066500 |
|
||||||
|
| transformer | conf/transformer.yaml | spec_aug + shift | test-clean | attention | test-all 6.98 | 0.036 |
|
||||||
|
Loading…
Reference in new issue