History

dependabot[bot] 5a6159810b Bump numpy from 1.21.0 to 1.22.0 in /demos/audio_searching Bumps [numpy](https://github.com/numpy/numpy) from 1.21.0 to 1.22.0. - [Release notes](https://github.com/numpy/numpy/releases) - [Changelog](https://github.com/numpy/numpy/blob/main/doc/HOWTO_RELEASE.rst) - [Commits](https://github.com/numpy/numpy/compare/v1.21.0...v1.22.0) --- updated-dependencies: - dependency-name: numpy dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com>		2 years ago
..
audio_content_search	remove old vector model info, test=doc	2 years ago
audio_searching	Bump numpy from 1.21.0 to 1.22.0 in /demos/audio_searching	2 years ago
audio_tagging	Update usage and doc of cli executor.	2 years ago
automatic_video_subtitiles	Update usage and doc of cli executor.	2 years ago
custom_streaming_asr	fix	2 years ago
metaverse	fix demos, test=tts	3 years ago
punctuation_restoration	Update usage and doc of cli executor.	2 years ago
speaker_verification	Update usage and doc of cli executor.	2 years ago
speech_recognition	Update usage and doc of cli executor.	2 years ago
speech_server	update engine, test=doc	2 years ago
speech_translation	Update usage and doc of cli executor.	2 years ago
speech_web	del dead link	2 years ago
story_talker	fix demos, test=tts	3 years ago
streaming_asr_server	fix rtf bug	2 years ago
streaming_tts_server	fix hifigan pad value	2 years ago
style_fs2	update readme, test=doc_fix (#1156 )	3 years ago
text_to_speech	Update usage and doc of cli executor.	2 years ago
README.md	Improve readability	2 years ago
README_cn.md	update the streaming asr server readme, test=doc	3 years ago

README.md

Speech Application based on PaddleSpeech

(简体中文|English)

This directory contains many speech applications in multiple scenarios.

audio searching - mass audio similarity retrieval
audio tagging - multi-label tagging of an audio file
automatic_video_subtitles - generate subtitles from a video
metaverse - 2D AR with TTS
punctuation_restoration - restore punctuation from raw text
speech recognition - recognize text of an audio file
speech server - Server for Speech Task, e.g. ASR,TTS,CLS
streaming asr server - receive audio stream from websocket, and recognize to transcript.
speech translation - end to end speech translation
story talker - book reader based on OCR and TTS
style_fs2 - multi style control for FastSpeech2 model
text_to_speech - convert text into speech