Easy-to-use Speech Toolkit including SOTA/Streaming ASR with punctuation, influential TTS with text frontend, Speaker Verification System and End-to-End Speech Simultaneous Translation.

sound-classification transformer asr speech-synthesis voice-cloning punctuation-restoration streaming-tts speech-recognition vocoder kws streaming-asr speech-alignment tts conformer speech-translation voice-recognition

Go to file

Hui Zhang a107b75bac transform; librispeech/s2 data process ok		4 years ago
.github	add stale config (#604 )	5 years ago
.pre-commit-hooks	add pre commit hooks	5 years ago
.travis	E2E/Streaming Transformer/Conformer ASR (#578 )	5 years ago
deepspeech	transform; librispeech/s2 data process ok	4 years ago
docs	fix img link; rsl format;	4 years ago
examples	transform; librispeech/s2 data process ok	4 years ago
hub	add requirements for hub	4 years ago
parakeet	setup.py deps from requirements.txt	4 years ago
speechnn	fix for kaldi	4 years ago
tests	fix the bug of benchmark after merge the parakeet, add the condition of using kaldi in aishll s1	4 years ago
third_party	Kaldi (#839 )	4 years ago
tools	transform; librispeech/s2 data process ok	4 years ago
utils	transform; librispeech/s2 data process ok	4 years ago
.bashrc	fix bugs	4 years ago
.clang-format	E2E/Streaming Transformer/Conformer ASR (#578 )	5 years ago
.flake8	refactor feature, dict and argument for new config format	4 years ago
.gitconfig	E2E/Streaming Transformer/Conformer ASR (#578 )	5 years ago
.gitignore	more utils to support kaldi/espnet data preocess	4 years ago
.mergify.yml	add delpoy mergify label	5 years ago
.pre-commit-config.yaml	E2E/Streaming Transformer/Conformer ASR (#578 )	5 years ago
.readthedocs.yml	merge parakeet repo into deepspeech	4 years ago
.style.yapf	Add ci and code format checking.	8 years ago
.travis.yml	E2E/Streaming Transformer/Conformer ASR (#578 )	5 years ago
.vimrc	E2E/Streaming Transformer/Conformer ASR (#578 )	5 years ago
LICENSE	Create License	8 years ago
README.md	update readme	4 years ago
env.sh	fix env.sh PATH postion	4 years ago
requirements.txt	transform; librispeech/s2 data process ok	4 years ago
setup.cfg	setup.py install cpp deps	4 years ago
setup.py	transform; librispeech/s2 data process ok	4 years ago
setup.sh	transform; librispeech/s2 data process ok	4 years ago

README.md

PaddlePaddle Speech toolkit

DeepSpeech is an open-source implementation of end-to-end Automatic Speech Recognition engine, with PaddlePaddle platform. Our vision is to empower both industrial application and academic research on speech recognition, via an easy-to-use, efficient, samller and scalable implementation, including training, inference & testing module, and deployment.

Features

See feature list for more information.

Setup

All tested under:

Ubuntu 16.04
python>=3.7
paddlepaddle==2.1.2

Please see install.

Getting Started

Please see Getting Started and tiny egs.

More Information

Questions and Help

You are welcome to submit questions in Github Discussions and bug reports in Github Issues. You are also welcome to contribute to this project.

License

DeepSpeech is provided under the Apache-2.0 License.

Acknowledgement

We depends on many open source repos. See References for more information.