Easy-to-use Speech Toolkit including SOTA/Streaming ASR with punctuation, influential TTS with text frontend, Speaker Verification System and End-to-End Speech Simultaneous Translation.
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
 
Go to file
Hui Zhang 538bf271eb
chinese char/word ngram lm (#613)
4 years ago
.github add stale config (#604) 4 years ago
.notebook E2E/Streaming Transformer/Conformer ASR (#578) 4 years ago
.pre-commit-hooks add pre commit hooks 4 years ago
.travis E2E/Streaming Transformer/Conformer ASR (#578) 4 years ago
deepspeech E2E/Streaming Transformer/Conformer ASR (#578) 4 years ago
doc fix image link (#612) 4 years ago
examples chinese char/word ngram lm (#613) 4 years ago
tests E2E/Streaming Transformer/Conformer ASR (#578) 4 years ago
third_party chinese char/word ngram lm (#613) 4 years ago
tools chinese char/word ngram lm (#613) 4 years ago
utils chinese char/word ngram lm (#613) 4 years ago
.clang-format E2E/Streaming Transformer/Conformer ASR (#578) 4 years ago
.flake8 E2E/Streaming Transformer/Conformer ASR (#578) 4 years ago
.gitconfig E2E/Streaming Transformer/Conformer ASR (#578) 4 years ago
.gitignore chinese char/word ngram lm (#613) 4 years ago
.mergify.yml speech text process docs (#607) 4 years ago
.pre-commit-config.yaml E2E/Streaming Transformer/Conformer ASR (#578) 4 years ago
.style.yapf Add ci and code format checking. 7 years ago
.travis.yml E2E/Streaming Transformer/Conformer ASR (#578) 4 years ago
.vimrc E2E/Streaming Transformer/Conformer ASR (#578) 4 years ago
LICENSE Create License 7 years ago
README.md chinese char/word ngram lm (#613) 4 years ago
README_cn.md chinese char/word ngram lm (#613) 4 years ago
env.sh E2E/Streaming Transformer/Conformer ASR (#578) 4 years ago
requirements.txt update doc (#603) 4 years ago
setup.sh chinese char/word ngram lm (#613) 4 years ago

README.md

中文版

PaddlePaddle ASR toolkit

License python version support os

PaddleASR is an open-source implementation of end-to-end Automatic Speech Recognition (ASR) engine, with PaddlePaddle platform. Our vision is to empower both industrial application and academic research on speech recognition, via an easy-to-use, efficient, samller and scalable implementation, including training, inference & testing module, and deployment.

Models

Setup

  • python>=3.7
  • paddlepaddle>=2.1.0

Please see install.

Getting Started

Please see Getting Started and tiny egs.

More Information

Questions and Help

You are welcome to submit questions in Github Discussions and bug reports in Github Issues. You are also welcome to contribute to this project.

License

DeepSpeech is provided under the Apache-2.0 License.

Acknowledgement

We depends on many open source repos. See References for more information.