History

zxcd 3afe871a87 remvoe duplicate line (#4158 )		2 days ago
..
TTSAndroid	Fix typos in multiple files (#4032 )	12 months ago
TTSArmLinux	【doc】fix download link case abnormal traffic (#4020 )	1 year ago
TTSCppFrontend	Fix typos (#4021 )	1 year ago
asr_deployment	[demo] u2++ asr deployment demo (#2639 )	3 years ago
audio_content_search	remvoe duplicate line (#4158 )	2 days ago
audio_searching	Fix typos in multiple files (#4032 )	12 months ago
audio_tagging	【doc】fix download link case abnormal traffic (#4020 )	1 year ago
automatic_video_subtitiles	【doc】fix download link case abnormal traffic (#4020 )	1 year ago
custom_streaming_asr	【doc】fix download link case abnormal traffic (#4020 )	1 year ago
keyword_spotting	【doc】fix download link case abnormal traffic (#4020 )	1 year ago
metaverse	【doc】fix download link case abnormal traffic (#4020 )	1 year ago
punctuation_restoration	Fix (#3981 )	1 year ago
speaker_verification	【doc】fix download link case abnormal traffic (#4020 )	1 year ago
speech_recognition	【doc】fix download link case abnormal traffic (#4020 )	1 year ago
speech_server	Fix typos in multiple files (#4032 )	12 months ago
speech_ssl	【doc】fix download link case abnormal traffic (#4020 )	1 year ago
speech_translation	【doc】fix download link case abnormal traffic (#4020 )	1 year ago
speech_web	Bump lodash from 4.17.21 to 4.17.23 in /demos/speech_web/web_client (#4150 )	2 weeks ago
story_talker	【doc】fix download link case abnormal traffic (#4020 )	1 year ago
streaming_asr_server	Fix typo: weboscket -> WebSocket (#4155 )	2 days ago
streaming_tts_server	Fix typos in multiple files (#4032 )	12 months ago
streaming_tts_serving_fastdeploy	Fix typos (#4021 )	1 year ago
style_fs2	【doc】fix download link case abnormal traffic (#4020 )	1 year ago
text_to_speech	Fix typos in multiple files (#4032 )	12 months ago
whisper	add whisper doc. (#4115 )	7 months ago
README.md	Update README.md (#3532 )	2 years ago
README_cn.md	add all whisper model size support, test=asr (#2677 )	3 years ago

README.md

Speech Application based on PaddleSpeech

(简体中文|English)

This directory contains many speech applications in multiple scenarios.

audio searching - mass audio similarity retrieval
audio tagging - multi-label tagging of an audio file
automatic_video_subtitles - generate subtitles from a video
metaverse - 2D AR with TTS
punctuation_restoration - restore punctuation from raw text
speech recognition - recognize text of an audio file
speech server - Server for Speech Task, e.g. ASR,TTS,CLS
streaming asr server - receive audio stream from websocket, and recognize to transcript.
streaming tts server - receive text from http or websocket, and streaming audio data stream.
speech translation - end to end speech translation
story talker - book reader based on OCR and TTS
style_fs2 - multi style control for FastSpeech2 model
text_to_speech - convert text into speech
self supervised pretraining - speech feature extraction and speech recognition based on wav2vec2
Whisper - speech recognize and translate based on Whisper model