You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
PaddleSpeech/demos
Hui Zhang 0cde9f87ab
Merge pull request #1710 from Honei/deepspeech_server
3 years ago
..
audio_searching format 3 years ago
audio_tagging update readme, test=doc_fix 3 years ago
automatic_video_subtitiles update readme, test=doc_fix 3 years ago
metaverse fix demos, test=tts 3 years ago
punctuation_restoration update readme, test=doc_fix 3 years ago
speaker_verification add part ecapa-tdnn note, test=doc 3 years ago
speech_recognition add asr websocket server note, test=doc 3 years ago
speech_server added online asr doc and online asr command line, test=doc 3 years ago
speech_translation update readme, test=doc_fix 3 years ago
story_talker fix demos, test=tts 3 years ago
style_fs2 update readme, test=doc_fix (#1156) 3 years ago
text_to_speech cli batch and shell pipe, test=doc 3 years ago
README.md [vec][search] update client image url, test=doc fix #1608 3 years ago
README_cn.md [vec][search] update client image url, test=doc fix #1608 3 years ago

README.md

Speech Application based on PaddleSpeech

(简体中文|English)

The directory containes many speech applications in multi scenarios.

  • audio searching - mass audio similarity retrieval
  • audio tagging - multi-label tagging of an audio file
  • automatic_video_subtitiles - generate subtitles from a video
  • metaverse - 2D AR with TTS
  • punctuation_restoration - restore punctuation from raw text
  • speech recogintion - recognize text of an audio file
  • speech translation - end to end speech translation
  • story talker - book reader based on OCR and TTS
  • style_fs2 - multi style control for FastSpeech2 model
  • text_to_speech - convert text into speech