You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
PaddleSpeech/docs/source/asr/feature_list.md

1.1 KiB

Features

Dataset

  • Aishell
  • Librispeech
  • THCHS30
  • TIMIT

Speech Recognition

Language Model

  • Ngram

Decoder

  • ctc greedy
  • ctc prefix beam search
  • greedy
  • beam search
  • attention rescore

Deployment

  • Paddle Inference

Aligment

  • MFA
  • CTC Aligment

Speech Frontend

  • Audio
    • Auto Gain
  • Feature
    • kaldi fbank
    • kaldi mfcc
    • linear
    • delta detla

Speech Augmentation

  • Audio
    • Volume Perturbation
    • Speed Perturbation
    • Shifting Perturbation
    • Online Bayesian normalization
    • Noise Perturbation
    • Impulse Response
  • Spectrum
    • SpecAugment
    • Adaptive SpecAugment

Tokenizer

  • Chinese/English Character
  • English Word
  • Sentence Piece

Word Segmentation

Grapheme To Phoneme

  • syallable
  • phoneme