You can not select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
17 lines
685 B
17 lines
685 B
# Speech Application based on PaddleSpeech
|
|
|
|
([简体中文](./README_cn.md)|English)
|
|
|
|
The directory containes many speech applications in multi scenarios.
|
|
|
|
* audio searching - mass audio similarity retrieval
|
|
* audio tagging - multi-label tagging of an audio file
|
|
* automatic_video_subtitiles - generate subtitles from a video
|
|
* metaverse - 2D AR with TTS
|
|
* punctuation_restoration - restore punctuation from raw text
|
|
* speech recogintion - recognize text of an audio file
|
|
* speech translation - end to end speech translation
|
|
* story talker - book reader based on OCR and TTS
|
|
* style_fs2 - multi style control for FastSpeech2 model
|
|
* text_to_speech - convert text into speech
|