Easy-to-use Speech Toolkit including SOTA/Streaming ASR with punctuation, influential TTS with text frontend, Speaker Verification System and End-to-End Speech Simultaneous Translation.
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
 
Go to file
Hui Zhang 09d5209ff9
Merge pull request #857 from Jackwaterveg/test
3 years ago
.github
.pre-commit-hooks
.travis
deepspeech fix bug: bug of space id in score.cpp, add detokenize 3 years ago
docs
examples fix the run.sh in g2p/zh 3 years ago
speechnn
tests nprocs 0 for cpu, other for gpu 3 years ago
third_party
tools fix sctk install 3 years ago
utils replace space when build vocab 3 years ago
.bashrc
.clang-format
.flake8
.gitconfig
.gitignore fix sctk install 3 years ago
.mergify.yml
.pre-commit-config.yaml
.style.yapf
.travis.yml
.vimrc
LICENSE
README.md
env.sh
requirements.txt rm dupliate packages 3 years ago
setup.sh

README.md

PaddlePaddle Speech to Any toolkit

License python version support os

DeepSpeech is an open-source implementation of end-to-end Automatic Speech Recognition engine, with PaddlePaddle platform. Our vision is to empower both industrial application and academic research on speech recognition, via an easy-to-use, efficient, samller and scalable implementation, including training, inference & testing module, and deployment.

Features

See feature list for more information.

Setup

All tested under:

  • Ubuntu 16.04
  • python>=3.7
  • paddlepaddle>=2.2.0rc

Please see install.

Getting Started

Please see Getting Started and tiny egs.

More Information

Questions and Help

You are welcome to submit questions in Github Discussions and bug reports in Github Issues. You are also welcome to contribute to this project.

License

DeepSpeech is provided under the Apache-2.0 License.

Acknowledgement

We depends on many open source repos. See References for more information.