PaddleSpeech/demos/README.md

# Speech Application based on PaddleSpeech

([简体中文](./README_cn.md)|English)

This directory contains many speech applications in multiple scenarios.

* audio searching - mass audio similarity retrieval
* audio tagging - multi-label tagging of an audio file
* automatic_video_subtitles - generate subtitles from a video
* metaverse - 2D AR with TTS  
* punctuation_restoration - restore punctuation from raw text
* speech recognition - recognize text of an audio file 
* speech server - Server for Speech Task, e.g. ASR,TTS,CLS
* streaming asr server - receive audio stream from websocket, and recognize to transcript.
* streaming tts server - receive text from http or websocket, and streaming audio data stream.
* speech translation - end to end speech translation  
* story talker - book reader based on OCR and TTS  
* style_fs2 - multi style control for FastSpeech2 model  
* text_to_speech - convert text into speech 
* self supervised pretraining - speech feature extraction and speech recognition based on wav2vec2
* Whisper - speech recognize and translate based on Whisper model
Update README.md 3 years ago			`# Speech Application based on PaddleSpeech`

Update README. test=doc_fix 3 years ago			`([简体中文](./README_cn.md)\|English)`

Improve readability 2 years ago			`This directory contains many speech applications in multiple scenarios.`
Update README.md 3 years ago
[vec][search] update client image url, test=doc fix #1608 3 years ago			`* audio searching - mass audio similarity retrieval`
Update README. test=doc_fix 3 years ago			`* audio tagging - multi-label tagging of an audio file`
Improve readability 2 years ago			`* automatic_video_subtitles - generate subtitles from a video`
Update README. test=doc_fix 3 years ago			`* metaverse - 2D AR with TTS`
			`* punctuation_restoration - restore punctuation from raw text`
Improve readability 2 years ago			`* speech recognition - recognize text of an audio file`
Update README.md 3 years ago			`* speech server - Server for Speech Task, e.g. ASR,TTS,CLS`
update the streaming asr english note, test=doc 3 years ago			`* streaming asr server - receive audio stream from websocket, and recognize to transcript.`
update demos readme, test=doc 2 years ago			`* streaming tts server - receive text from http or websocket, and streaming audio data stream.`
Update README.md 3 years ago			`* speech translation - end to end speech translation`
			`* story talker - book reader based on OCR and TTS`
			`* style_fs2 - multi style control for FastSpeech2 model`
Update README. test=doc_fix 3 years ago			`* text_to_speech - convert text into speech`
add all whisper model size support, test=asr (#2677) * add all whisper model size support * add choices in parser. 2 years ago			`* self supervised pretraining - speech feature extraction and speech recognition based on wav2vec2`
Update README.md (#3532) Fixed a typo 8 months ago			`* Whisper - speech recognize and translate based on Whisper model`