History

zxcd 4be005858b 【DOC】fix demos bug (#3830 ) * fix demos * fix test		1 year ago
..
TTSAndroid	…
TTSArmLinux	…
TTSCppFrontend	…
asr_deployment	…
audio_content_search	【ASR】fix acs demo (#3826 )	1 year ago
audio_searching	【DOC】fix demos bug (#3830 )	1 year ago
audio_tagging	…
automatic_video_subtitiles	…
custom_streaming_asr	…
keyword_spotting	…
metaverse	…
punctuation_restoration	…
speaker_verification	…
speech_recognition	…
speech_server	…
speech_ssl	…
speech_translation	…
speech_web	Update ge2e_clone.py (#3517 )	2 years ago
story_talker	…
streaming_asr_server	【DOC】fix demos bug (#3830 )	1 year ago
streaming_tts_server	…
streaming_tts_serving_fastdeploy	…
style_fs2	【DOC】fix demos bug (#3830 )	1 year ago
text_to_speech	…
whisper	…
README.md	Update README.md (#3532 )	2 years ago
README_cn.md	…

Speech Application based on PaddleSpeech

This directory contains many speech applications in multiple scenarios.

audio searching - mass audio similarity retrieval
audio tagging - multi-label tagging of an audio file
automatic_video_subtitles - generate subtitles from a video
metaverse - 2D AR with TTS
punctuation_restoration - restore punctuation from raw text
speech recognition - recognize text of an audio file
speech server - Server for Speech Task, e.g. ASR,TTS,CLS
streaming asr server - receive audio stream from websocket, and recognize to transcript.
streaming tts server - receive text from http or websocket, and streaming audio data stream.
speech translation - end to end speech translation
story talker - book reader based on OCR and TTS
style_fs2 - multi style control for FastSpeech2 model
text_to_speech - convert text into speech
self supervised pretraining - speech feature extraction and speech recognition based on wav2vec2
Whisper - speech recognize and translate based on Whisper model