PaddleSpeech/cloud/pcloud_train.sh

#! /usr/bin/env bash

TRAIN_MANIFEST=$1
DEV_MANIFEST=$2
MODEL_PATH=$3
NUM_GPU=$4
BATCH_SIZE=$5
IS_LOCAL=$6

python ./cloud/split_data.py \
--in_manifest_path=${TRAIN_MANIFEST} \
--out_manifest_path='/local.manifest.train'

python ./cloud/split_data.py \
--in_manifest_path=${DEV_MANIFEST} \
--out_manifest_path='/local.manifest.dev'

mkdir ./logs

python -u train.py \
--batch_size=${BATCH_SIZE} \
--trainer_count=${NUM_GPU} \
--num_passes=200 \
--num_proc_data=${NUM_GPU} \
--num_conv_layers=2 \
--num_rnn_layers=3 \
--rnn_layer_size=2048 \
--num_iter_print=100 \
--learning_rate=5e-4 \
--max_duration=27.0 \
--min_duration=0.0 \
--use_sortagrad=True \
--use_gru=False \
--use_gpu=True \
--is_local=${IS_LOCAL} \
--share_rnn_weights=True \
--train_manifest='/local.manifest.train' \
--dev_manifest='/local.manifest.dev' \
--mean_std_path='data/librispeech/mean_std.npz' \
--vocab_path='data/librispeech/vocab.txt' \
--output_model_dir='./checkpoints' \
--output_model_dir=${MODEL_PATH} \
--augment_conf_path='conf/augmentation.config' \
--specgram_type='linear' \
--shuffle_method='batch_shuffle_clipped' \
2>&1 | tee ./logs/train.log
fix bugs for model.py and demo_server.py. 7 years ago			`#! /usr/bin/env bash`
Re-organize folder structure and hierarchy for DS2. 7 years ago
Seperate data uploading from job summission for DS2 cloud training and add support for multiple shards uploading. 7 years ago			`TRAIN_MANIFEST=$1`
			`DEV_MANIFEST=$2`
			`MODEL_PATH=$3`
Bug fix and refine cloud training for DS2. Summary: 1. Add missing is_local argument (when set False, use pserver). 2. Add exception thrown if cp failed. 3. Add cloud mkdir if not cloud path for uploading does not exist. 4. Fix a bug using common path ./local_manifest for all nodes. (convert to /local_manifest) 5. Refine coding style. 7 years ago			`NUM_GPU=$4`
Seperate data uploading from job summission for DS2 cloud training and add support for multiple shards uploading. 7 years ago			`BATCH_SIZE=$5`
			`IS_LOCAL=$6`
Bug fix and refine cloud training for DS2. Summary: 1. Add missing is_local argument (when set False, use pserver). 2. Add exception thrown if cp failed. 3. Add cloud mkdir if not cloud path for uploading does not exist. 4. Fix a bug using common path ./local_manifest for all nodes. (convert to /local_manifest) 5. Refine coding style. 7 years ago
Implement uploading data in submit scripts and fix issues 7 years ago			`python ./cloud/split_data.py \`
Seperate data uploading from job summission for DS2 cloud training and add support for multiple shards uploading. 7 years ago			`--in_manifest_path=${TRAIN_MANIFEST} \`
			`--out_manifest_path='/local.manifest.train'`
Refine submitting scripts for deepspeech2 on paddle cloud. 7 years ago
Implement uploading data in submit scripts and fix issues 7 years ago			`python ./cloud/split_data.py \`
Seperate data uploading from job summission for DS2 cloud training and add support for multiple shards uploading. 7 years ago			`--in_manifest_path=${DEV_MANIFEST} \`
			`--out_manifest_path='/local.manifest.dev'`
Refine submitting scripts for deepspeech2 on paddle cloud. 7 years ago
Bug fixed for cloud training for DS2. 7 years ago			`mkdir ./logs`

Print log to pfs for DS cloud training and set use_gru to False by default. 7 years ago			`python -u train.py \`
Re-organize folder structure and hierarchy for DS2. 7 years ago			`--batch_size=${BATCH_SIZE} \`
Bug fix and refine cloud training for DS2. Summary: 1. Add missing is_local argument (when set False, use pserver). 2. Add exception thrown if cp failed. 3. Add cloud mkdir if not cloud path for uploading does not exist. 4. Fix a bug using common path ./local_manifest for all nodes. (convert to /local_manifest) 5. Refine coding style. 7 years ago			`--trainer_count=${NUM_GPU} \`
Re-organize folder structure and hierarchy for DS2. 7 years ago			`--num_passes=200 \`
			`--num_proc_data=${NUM_GPU} \`
			`--num_conv_layers=2 \`
			`--num_rnn_layers=3 \`
			`--rnn_layer_size=2048 \`
			`--num_iter_print=100 \`
			`--learning_rate=5e-4 \`
			`--max_duration=27.0 \`
			`--min_duration=0.0 \`
			`--use_sortagrad=True \`
			`--use_gru=False \`
			`--use_gpu=True \`
Bug fix and refine cloud training for DS2. Summary: 1. Add missing is_local argument (when set False, use pserver). 2. Add exception thrown if cp failed. 3. Add cloud mkdir if not cloud path for uploading does not exist. 4. Fix a bug using common path ./local_manifest for all nodes. (convert to /local_manifest) 5. Refine coding style. 7 years ago			`--is_local=${IS_LOCAL} \`
Re-organize folder structure and hierarchy for DS2. 7 years ago			`--share_rnn_weights=True \`
			`--train_manifest='/local.manifest.train' \`
			`--dev_manifest='/local.manifest.dev' \`
			`--mean_std_path='data/librispeech/mean_std.npz' \`
Bug fixed for cloud training for DS2. 7 years ago			`--vocab_path='data/librispeech/vocab.txt' \`
Re-organize folder structure and hierarchy for DS2. 7 years ago			`--output_model_dir='./checkpoints' \`
			`--output_model_dir=${MODEL_PATH} \`
			`--augment_conf_path='conf/augmentation.config' \`
			`--specgram_type='linear' \`
			`--shuffle_method='batch_shuffle_clipped' \`
Bug fixed for cloud training for DS2. 7 years ago			`2>&1 \| tee ./logs/train.log`