Easy-to-use Speech Toolkit including SOTA/Streaming ASR with punctuation, influential TTS with text frontend, Speaker Verification System and End-to-End Speech Simultaneous Translation.
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
 
Go to file
yangyaming 3f63e069e0
Fix typos and follow comments.
8 years ago
data Update DS2 README.md and fix bug in librispeech.py 8 years ago
README.md Update DS2 README.md and fix bug in librispeech.py 8 years ago
audio_data_utils.py 1. Fix incorrect decoder result printing. 8 years ago
decoder.py Refactor decoder interfaces and add ./data directory. 8 years ago
error_rate.py Fix typos and follow comments. 8 years ago
infer.py Refactor decoder interfaces and add ./data directory. 8 years ago
model.py Refactor decoder interfaces and add ./data directory. 8 years ago
requirements.txt 1. Fix incorrect decoder result printing. 8 years ago
train.py Refactor decoder interfaces and add ./data directory. 8 years ago

README.md

Deep Speech 2 on PaddlePaddle

Installation

Please replace $PADDLE_INSTALL_DIR with your own paddle installation directory.

pip install -r requirements.txt
export LD_LIBRARY_PATH=$PADDLE_INSTALL_DIR/Paddle/third_party/install/warpctc/lib:$LD_LIBRARY_PATH

For some machines, we also need to install libsndfile1. Details to be added.

Usage

Preparing Data

cd data
python librispeech.py
cd ..

More help for arguments:

python librispeech.py --help

Traininig

For GPU Training:

CUDA_VISIBLE_DEVICES=0,1,2,3 python train.py --trainer_count 4

For CPU Training:

python train.py --trainer_count 8 --use_gpu False

More help for arguments:

python train.py --help

Inferencing

python infer.py

More help for arguments:

python infer.py --help