You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
PaddleSpeech/demos
zxcd 1e3e186c18
add whisper doc. (#4115)
3 days ago
..
TTSAndroid Fix typos in multiple files (#4032) 5 months ago
TTSArmLinux 【doc】fix download link case abnormal traffic (#4020) 6 months ago
TTSCppFrontend Fix typos (#4021) 6 months ago
asr_deployment [demo] u2++ asr deployment demo (#2639) 3 years ago
audio_content_search 【doc】fix download link case abnormal traffic (#4020) 6 months ago
audio_searching Fix typos in multiple files (#4032) 5 months ago
audio_tagging 【doc】fix download link case abnormal traffic (#4020) 6 months ago
automatic_video_subtitiles 【doc】fix download link case abnormal traffic (#4020) 6 months ago
custom_streaming_asr 【doc】fix download link case abnormal traffic (#4020) 6 months ago
keyword_spotting 【doc】fix download link case abnormal traffic (#4020) 6 months ago
metaverse 【doc】fix download link case abnormal traffic (#4020) 6 months ago
punctuation_restoration Fix (#3981) 7 months ago
speaker_verification 【doc】fix download link case abnormal traffic (#4020) 6 months ago
speech_recognition 【doc】fix download link case abnormal traffic (#4020) 6 months ago
speech_server Fix typos in multiple files (#4032) 5 months ago
speech_ssl 【doc】fix download link case abnormal traffic (#4020) 6 months ago
speech_translation 【doc】fix download link case abnormal traffic (#4020) 6 months ago
speech_web Bump @babel/runtime in /demos/speech_web/web_client (#4031) 5 months ago
story_talker 【doc】fix download link case abnormal traffic (#4020) 6 months ago
streaming_asr_server 【ASR】add chunk conformer model for tal_cs dataset. (#4110) 3 weeks ago
streaming_tts_server Fix typos in multiple files (#4032) 5 months ago
streaming_tts_serving_fastdeploy Fix typos (#4021) 6 months ago
style_fs2 【doc】fix download link case abnormal traffic (#4020) 6 months ago
text_to_speech Fix typos in multiple files (#4032) 5 months ago
whisper add whisper doc. (#4115) 3 days ago
README.md Update README.md (#3532) 2 years ago
README_cn.md add all whisper model size support, test=asr (#2677) 3 years ago

README.md

Speech Application based on PaddleSpeech

(简体中文|English)

This directory contains many speech applications in multiple scenarios.

  • audio searching - mass audio similarity retrieval
  • audio tagging - multi-label tagging of an audio file
  • automatic_video_subtitles - generate subtitles from a video
  • metaverse - 2D AR with TTS
  • punctuation_restoration - restore punctuation from raw text
  • speech recognition - recognize text of an audio file
  • speech server - Server for Speech Task, e.g. ASR,TTS,CLS
  • streaming asr server - receive audio stream from websocket, and recognize to transcript.
  • streaming tts server - receive text from http or websocket, and streaming audio data stream.
  • speech translation - end to end speech translation
  • story talker - book reader based on OCR and TTS
  • style_fs2 - multi style control for FastSpeech2 model
  • text_to_speech - convert text into speech
  • self supervised pretraining - speech feature extraction and speech recognition based on wav2vec2
  • Whisper - speech recognize and translate based on Whisper model