You can not select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
lym0302
88adcaa6dc
|
3 years ago | |
---|---|---|
.. | ||
audio_searching | 3 years ago | |
audio_tagging | 3 years ago | |
automatic_video_subtitiles | 3 years ago | |
metaverse | 3 years ago | |
punctuation_restoration | 3 years ago | |
speaker_verification | 3 years ago | |
speech_recognition | 3 years ago | |
speech_server | 3 years ago | |
speech_translation | 3 years ago | |
story_talker | 3 years ago | |
streaming_asr_server | 3 years ago | |
streaming_tts_server | 3 years ago | |
style_fs2 | 3 years ago | |
text_to_speech | 3 years ago | |
README.md | 3 years ago | |
README_cn.md | 3 years ago |
README.md
Speech Application based on PaddleSpeech
(简体中文|English)
The directory containes many speech applications in multi scenarios.
- audio searching - mass audio similarity retrieval
- audio tagging - multi-label tagging of an audio file
- automatic_video_subtitiles - generate subtitles from a video
- metaverse - 2D AR with TTS
- punctuation_restoration - restore punctuation from raw text
- speech recogintion - recognize text of an audio file
- speech server - Server for Speech Task, e.g. ASR,TTS,CLS
- streaming asr server - receive audio stream from websocket, and recognize to transcript.
- speech translation - end to end speech translation
- story talker - book reader based on OCR and TTS
- style_fs2 - multi style control for FastSpeech2 model
- text_to_speech - convert text into speech