Easy-to-use Speech Toolkit including SOTA/Streaming ASR with punctuation, influential TTS with text frontend, Speaker Verification System and End-to-End Speech Simultaneous Translation.
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
 
Go to file
TianYuan 04bcb6a12d
fix rtf, fix inf input of speedyspeech, fix stft dir for 2.2.0
3 years ago
.github add stale config (#604) 4 years ago
.pre-commit-hooks add pre commit hooks 4 years ago
.travis E2E/Streaming Transformer/Conformer ASR (#578) 4 years ago
deepspeech Merge branch 'develop' of github.com:PaddlePaddle/DeepSpeech into merge_parakeet 3 years ago
docs fix img link; rsl format; 3 years ago
examples add csmsc mb melgan example 3 years ago
hub add requirements for hub 3 years ago
parakeet fix rtf, fix inf input of speedyspeech, fix stft dir for 2.2.0 3 years ago
speechnn fix for kaldi 3 years ago
tests fix benchmark and chain, add parse_options in run.sh, move tacotron2_ge2e into voice_cloning 3 years ago
third_party Kaldi (#839) 3 years ago
tools fix wget 3 years ago
utils Merge branch 'develop' of github.com:PaddlePaddle/DeepSpeech into merge_parakeet 3 years ago
.bashrc fix bugs 3 years ago
.clang-format E2E/Streaming Transformer/Conformer ASR (#578) 4 years ago
.flake8 refactor feature, dict and argument for new config format 3 years ago
.gitconfig E2E/Streaming Transformer/Conformer ASR (#578) 4 years ago
.gitignore setup with ubuntu deps 3 years ago
.mergify.yml add delpoy mergify label 4 years ago
.pre-commit-config.yaml E2E/Streaming Transformer/Conformer ASR (#578) 4 years ago
.readthedocs.yml merge parakeet repo into deepspeech 3 years ago
.style.yapf Add ci and code format checking. 7 years ago
.travis.yml E2E/Streaming Transformer/Conformer ASR (#578) 4 years ago
.vimrc E2E/Streaming Transformer/Conformer ASR (#578) 4 years ago
LICENSE Create License 7 years ago
README.md update readme 3 years ago
env.sh fix env.sh PATH postion 3 years ago
requirements.txt refactor parakeet examples 3 years ago
setup.cfg setup.py install cpp deps 3 years ago
setup.py refactor parakeet examples 3 years ago
setup.sh setup with ubuntu deps 3 years ago

README.md

PaddlePaddle Speech toolkit

License python version support os

DeepSpeech is an open-source implementation of end-to-end Automatic Speech Recognition engine, with PaddlePaddle platform. Our vision is to empower both industrial application and academic research on speech recognition, via an easy-to-use, efficient, samller and scalable implementation, including training, inference & testing module, and deployment.

Features

See feature list for more information.

Setup

All tested under:

  • Ubuntu 16.04
  • python>=3.7
  • paddlepaddle==2.1.2

Please see install.

Getting Started

Please see Getting Started and tiny egs.

More Information

Questions and Help

You are welcome to submit questions in Github Discussions and bug reports in Github Issues. You are also welcome to contribute to this project.

License

DeepSpeech is provided under the Apache-2.0 License.

Acknowledgement

We depends on many open source repos. See References for more information.