Easy-to-use Speech Toolkit including SOTA/Streaming ASR with punctuation, influential TTS with text frontend, Speaker Verification System and End-to-End Speech Simultaneous Translation.

kws streaming-asr speech-alignment tts conformer speech-translation voice-recognition sound-classification transformer asr speech-synthesis voice-cloning punctuation-restoration streaming-tts speech-recognition vocoder

Go to file

Hui Zhang 538bf271eb chinese char/word ngram lm (#613 ) * add ngram lm egs * add zhon repo * install kenlm, zhon * format * add chinese_text_normalization repo * add ngram lm egs		5 years ago
.github	add stale config (#604 )	5 years ago
.notebook	E2E/Streaming Transformer/Conformer ASR (#578 )	5 years ago
.pre-commit-hooks	add pre commit hooks	6 years ago
.travis	E2E/Streaming Transformer/Conformer ASR (#578 )	5 years ago
deepspeech	E2E/Streaming Transformer/Conformer ASR (#578 )	5 years ago
doc	fix image link (#612 )	5 years ago
examples	chinese char/word ngram lm (#613 )	5 years ago
tests	E2E/Streaming Transformer/Conformer ASR (#578 )	5 years ago
third_party	chinese char/word ngram lm (#613 )	5 years ago
tools	chinese char/word ngram lm (#613 )	5 years ago
utils	chinese char/word ngram lm (#613 )	5 years ago
.clang-format	E2E/Streaming Transformer/Conformer ASR (#578 )	5 years ago
.flake8	E2E/Streaming Transformer/Conformer ASR (#578 )	5 years ago
.gitconfig	E2E/Streaming Transformer/Conformer ASR (#578 )	5 years ago
.gitignore	chinese char/word ngram lm (#613 )	5 years ago
.mergify.yml	speech text process docs (#607 )	5 years ago
.pre-commit-config.yaml	E2E/Streaming Transformer/Conformer ASR (#578 )	5 years ago
.style.yapf	Add ci and code format checking.	9 years ago
.travis.yml	E2E/Streaming Transformer/Conformer ASR (#578 )	5 years ago
.vimrc	E2E/Streaming Transformer/Conformer ASR (#578 )	5 years ago
LICENSE	Create License	9 years ago
README.md	chinese char/word ngram lm (#613 )	5 years ago
README_cn.md	chinese char/word ngram lm (#613 )	5 years ago
env.sh	E2E/Streaming Transformer/Conformer ASR (#578 )	5 years ago
requirements.txt	update doc (#603 )	5 years ago
setup.sh	chinese char/word ngram lm (#613 )	5 years ago

README.md

中文版

PaddlePaddle ASR toolkit

PaddleASR is an open-source implementation of end-to-end Automatic Speech Recognition (ASR) engine, with PaddlePaddle platform. Our vision is to empower both industrial application and academic research on speech recognition, via an easy-to-use, efficient, samller and scalable implementation, including training, inference & testing module, and deployment.

Models

Setup

python>=3.7
paddlepaddle>=2.1.0

Please see install.

Getting Started

Please see Getting Started and tiny egs.

More Information

Questions and Help

You are welcome to submit questions in Github Discussions and bug reports in Github Issues. You are also welcome to contribute to this project.

License

DeepSpeech is provided under the Apache-2.0 License.

Acknowledgement

We depends on many open source repos. See References for more information.