947 B
947 B
Features
Speech Recognition
-
Offline
-
Online
Language Model
- Ngram
Decoder
- ctc greedy
- ctc prefix beam search
- greedy
- beam search
- attention rescore
Speech Frontend
- Audio
- Auto Gain
- Feature
- kaldi fbank
- kaldi mfcc
- linear
- delta detla
Speech Augmentation
- Audio
- Volume Perturbation
- Speed Perturbation
- Shifting Perturbation
- Online Bayesian normalization
- Noise Perturbation
- Impulse Response
- Spectrum
- SpecAugment
- Adaptive SpecAugment
Tokenizer
- Chinese/English Character
- English Word
- Sentence Piece
Word Segmentation
Grapheme To Phoneme
- syallable
- phoneme