You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
PaddleSpeech/demos
iftaken e68f1ce6f5
add speech web demo
2 years ago
..
audio_content_search
audio_searching
audio_tagging
automatic_video_subtitiles
custom_streaming_asr
metaverse
punctuation_restoration
speaker_verification
speech_recognition
speech_server
speech_translation
speech_web_demo add speech web demo 2 years ago
story_talker
streaming_asr_server
streaming_tts_server
style_fs2
text_to_speech
README.md
README_cn.md

README.md

Speech Application based on PaddleSpeech

(简体中文|English)

This directory contains many speech applications in multiple scenarios.

  • audio searching - mass audio similarity retrieval
  • audio tagging - multi-label tagging of an audio file
  • automatic_video_subtitles - generate subtitles from a video
  • metaverse - 2D AR with TTS
  • punctuation_restoration - restore punctuation from raw text
  • speech recognition - recognize text of an audio file
  • speech server - Server for Speech Task, e.g. ASR,TTS,CLS
  • streaming asr server - receive audio stream from websocket, and recognize to transcript.
  • speech translation - end to end speech translation
  • story talker - book reader based on OCR and TTS
  • style_fs2 - multi style control for FastSpeech2 model
  • text_to_speech - convert text into speech