History

Hui Zhang 7eb3ab0dfa Merge pull request #1806 from lym0302/r1.0 [server] update streaming demos readme		3 years ago
..
audio_searching	format code	3 years ago
audio_tagging	update readme, test=doc_fix	3 years ago
automatic_video_subtitiles	update readme, test=doc_fix	3 years ago
metaverse	fix demos, test=tts	3 years ago
punctuation_restoration	update readme, test=doc_fix	3 years ago
speaker_verification	Update README.md	3 years ago
speech_recognition	add asr websocket server note, test=doc	3 years ago
speech_server	Update README_cn.md	3 years ago
speech_translation	update readme, test=doc_fix	3 years ago
story_talker	fix demos, test=tts	3 years ago
streaming_asr_server	update streaming asr readme, test=doc	3 years ago
streaming_tts_server	update readme, test=doc	3 years ago
style_fs2	update readme, test=doc_fix (#1156 )	3 years ago
text_to_speech	cli batch and shell pipe, test=doc	3 years ago
README.md	update the streaming asr english note, test=doc	3 years ago
README_cn.md	update the streaming asr server readme, test=doc	3 years ago

Speech Application based on PaddleSpeech

The directory containes many speech applications in multi scenarios.

audio searching - mass audio similarity retrieval
audio tagging - multi-label tagging of an audio file
automatic_video_subtitiles - generate subtitles from a video
metaverse - 2D AR with TTS
punctuation_restoration - restore punctuation from raw text
speech recogintion - recognize text of an audio file
speech server - Server for Speech Task, e.g. ASR,TTS,CLS
streaming asr server - receive audio stream from websocket, and recognize to transcript.
speech translation - end to end speech translation
story talker - book reader based on OCR and TTS
style_fs2 - multi style control for FastSpeech2 model
text_to_speech - convert text into speech