Easy-to-use Speech Toolkit including SOTA/Streaming ASR with punctuation, influential TTS with text frontend, Speaker Verification System and End-to-End Speech Simultaneous Translation.

punctuation-restoration streaming-tts speech-recognition vocoder kws streaming-asr speech-alignment tts conformer speech-translation voice-recognition sound-classification transformer asr speech-synthesis voice-cloning

Go to file

Hui Zhang 69bd17dcb2 refactor raw ctc decoder into ctcdecoder		4 years ago
.github	…
.pre-commit-hooks	…
.travis	…
deepspeech	refactor raw ctc decoder into ctcdecoder	4 years ago
docs	test refactor collator	4 years ago
examples	update timit result	4 years ago
hub	add requirements for hub	4 years ago
speechnn	…
tests	Merge pull request #879 from PaddlePaddle/debug	4 years ago
third_party	Kaldi (#839 )	4 years ago
tools	fix sctk install	4 years ago
utils	reader default type is mat, sound need explicitlyc specify	4 years ago
.bashrc	…
.clang-format	…
.flake8	…
.gitconfig	…
.gitignore	fix sctk install	4 years ago
.mergify.yml	…
.pre-commit-config.yaml	…
.style.yapf	…
.travis.yml	…
.vimrc	…
LICENSE	…
README.md	update paddle version to 2.1.2	4 years ago
env.sh	fix env.sh PATH postion	4 years ago
requirements.txt	close editdistance package format warning	4 years ago
setup.sh	refactor raw ctc decoder into ctcdecoder	4 years ago

README.md

PaddlePaddle Speech to Any toolkit

DeepSpeech is an open-source implementation of end-to-end Automatic Speech Recognition engine, with PaddlePaddle platform. Our vision is to empower both industrial application and academic research on speech recognition, via an easy-to-use, efficient, samller and scalable implementation, including training, inference & testing module, and deployment.

Features

See feature list for more information.

Setup

All tested under:

Ubuntu 16.04
python>=3.7
paddlepaddle==2.1.2

Please see install.

Getting Started

Please see Getting Started and tiny egs.

More Information

Questions and Help

You are welcome to submit questions in Github Discussions and bug reports in Github Issues. You are also welcome to contribute to this project.

License

DeepSpeech is provided under the Apache-2.0 License.

Acknowledgement

We depends on many open source repos. See References for more information.