You can not select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
1.1 KiB
1.1 KiB
Features
Dataset
- Aishell
- Librispeech
- THCHS30
- TIMIT
Speech Recognition
-
Non-Streaming
-
Streaming
Language Model
- Ngram
Decoder
- ctc greedy
- ctc prefix beam search
- greedy
- beam search
- attention rescore
Deployment
- Paddle Inference
Aligment
- MFA
- CTC Alignment
Speech Frontend
- Audio
- Auto Gain
- Feature
- kaldi fbank
- kaldi mfcc
- linear
- delta detla
Speech Augmentation
- Audio
- Volume Perturbation
- Speed Perturbation
- Shifting Perturbation
- Online Bayesian normalization
- Noise Perturbation
- Impulse Response
- Spectrum
- SpecAugment
- Adaptive SpecAugment
Tokenizer
- Chinese/English Character
- English Word
- Sentence Piece
Word Segmentation
Grapheme To Phoneme
- syllable
- phoneme