PaddleSpeech/cloud/README.md

# Train DeepSpeech2 on PaddleCloud

>Note:
>Please make sure [PaddleCloud Client](https://github.com/PaddlePaddle/cloud/blob/develop/doc/usage_cn.md#%E4%B8%8B%E8%BD%BD%E5%B9%B6%E9%85%8D%E7%BD%AEpaddlecloud) has be installed and current directory is `deep_speech_2/cloud/`

## Step 1:  Upload Data

Provided with several input manifests, `pcloud_upload_data.sh` will pack and upload all the containing audio files to PaddleCloud filesystem, and also generate some corresponding manifest files with updated cloud paths.

Please modify the following arguments in `pcloud_upload_data.sh`:

- `IN_MANIFESTS`： Paths (in local filesystem) of manifest files containing the audio files to be uploaded. Multiple paths can be concatenated with a whitespace delimeter.
- `OUT_MANIFESTS`: Paths (in local filesystem) to write the updated output manifest files to. Multiple paths can be concatenated with a whitespace delimeter. The values of `audio_filepath` in the output manifests are updated with cloud filesystem paths.
- `CLOUD_DATA_DIR`:  Directory (in PaddleCloud filesystem) to upload the data to. Don't forget to replace `USERNAME` in the default directory and make sure that you have the permission to write it.
- `NUM_SHARDS`: Number of data shards / parts (in tar files) to be generated when packing and uploading data. Smaller `num_shards` requires larger temoporal local disk space for packing data.

By running:

```
sh pcloud_upload_data.sh
```
all the audio files will be uploaded to PaddleCloud filesystem, and you will get modified manifests files in `OUT_MANIFESTS`.

You have to take this step only once, in the very first time you do the cloud training. Later on, the data is persisitent on the cloud filesystem and reusable for further job submissions.

## Step 2:  Configure Training

Configure cloud training arguments in `pcloud_submit.sh`, with the following arguments:

- `TRAIN_MANIFEST`: Manifest filepath (in local filesystem) for training. Notice that the`audio_filepath` should be in cloud filesystem, like those generated by `pcloud_upload_data.sh`.
- `DEV_MANIFEST`: Manifest filepath (in local filesystem) for validation.
- `CLOUD_MODEL_DIR`: Directory (in PaddleCloud filesystem) to save the model parameters (checkpoints). Don't forget to replace `USERNAME` in the default directory and make sure that you have the permission to write it.
- `BATCH_SIZE`: Training batch size for a single node.
- `NUM_GPU`: Number of GPUs allocated for a single node.
- `NUM_NODE`: Number of nodes (machines) allocated for this job.
- `IS_LOCAL`: Set to False to enable parameter server, if using multiple nodes.

Configure other training hyper-parameters in `pcloud_train.sh` as you wish, just as what you can do in local training.

By running:

```
sh pcloud_submit.sh
```
you submit a training job to PaddleCloud. And you will see the job name when the submission is done.


## Step 3  Get Job Logs

Run this to list all the jobs you have submitted, as well as their running status:

```
paddlecloud get jobs
```

Run this, the corresponding job's logs will be printed.
```
paddlecloud logs -n 10000 $REPLACED_WITH_YOUR_ACTUAL_JOB_NAME
```

## More Help

For more information about the usage of PaddleCloud, please refer to [PaddleCloud Usage](https://github.com/PaddlePaddle/cloud/blob/develop/doc/usage_cn.md#提交任务).
-												Update README for DS2 cloud training.

											
										
										
											7 years ago
+								# Train DeepSpeech2 on PaddleCloud
-												Refine submitting scripts for deepspeech2 on paddle cloud.

											
										
										
											7 years ago
-												Fix some syntax errors.

											
										
										
											7 years ago
+								>Note:
-												Update README for DS2 cloud training.

											
										
										
											7 years ago
+								>Please make sure [PaddleCloud Client](https://github.com/PaddlePaddle/cloud/blob/develop/doc/usage_cn.md#%E4%B8%8B%E8%BD%BD%E5%B9%B6%E9%85%8D%E7%BD%AEpaddlecloud) has be installed and current directory is `deep_speech_2/cloud/`
-												Refine submitting scripts for deepspeech2 on paddle cloud.

											
										
										
											7 years ago
-												Update README for DS2 cloud training.

											
										
										
											7 years ago
+								## Step 1:  Upload Data
-												Refine submitting scripts for deepspeech2 on paddle cloud.

											
										
										
											7 years ago
-												Update README for DS2 cloud training.

											
										
										
											7 years ago
+								Provided with several input manifests, `pcloud_upload_data.sh` will pack and upload all the containing audio files to PaddleCloud filesystem, and also generate some corresponding manifest files with updated cloud paths.
-												Refine submitting scripts for deepspeech2 on paddle cloud.

											
										
										
											7 years ago
-												Update README for DS2 cloud training.

											
										
										
											7 years ago
+								Please modify the following arguments in `pcloud_upload_data.sh`:
-												Update DS2 cloud training according to review comments.

											
										
										
											7 years ago
+								- `IN_MANIFESTS`： Paths (in local filesystem) of manifest files containing the audio files to be uploaded. Multiple paths can be concatenated with a whitespace delimeter.
 								- `OUT_MANIFESTS`: Paths (in local filesystem) to write the updated output manifest files to. Multiple paths can be concatenated with a whitespace delimeter. The values of `audio_filepath` in the output manifests are updated with cloud filesystem paths.
 								- `CLOUD_DATA_DIR`:  Directory (in PaddleCloud filesystem) to upload the data to. Don't forget to replace `USERNAME` in the default directory and make sure that you have the permission to write it.
-												Update README for DS2 cloud training.

											
										
										
											7 years ago
+								- `NUM_SHARDS`: Number of data shards / parts (in tar files) to be generated when packing and uploading data. Smaller `num_shards` requires larger temoporal local disk space for packing data.
-												Implement uploading data to PaddleCloud
1. Refine data_utils/data.py, reuse process_utterance function.
2. Modified README.
3. Implement uploading data in cloud/upload_data.py
4. Merge branch 'develop' of https://github.com/PaddlePaddle/models into ds2_pcloud

											
										
										
											7 years ago
-												Update README for DS2 cloud training.

											
										
										
											7 years ago
+								By running:
-												Implement uploading data to PaddleCloud
1. Refine data_utils/data.py, reuse process_utterance function.
2. Modified README.
3. Implement uploading data in cloud/upload_data.py
4. Merge branch 'develop' of https://github.com/PaddlePaddle/models into ds2_pcloud

											
										
										
											7 years ago
-												Update README for DS2 cloud training.

											
										
										
											7 years ago
+								```
 								sh pcloud_upload_data.sh
 								```
 								all the audio files will be uploaded to PaddleCloud filesystem, and you will get modified manifests files in `OUT_MANIFESTS`.
-												Implement uploading data to PaddleCloud
1. Refine data_utils/data.py, reuse process_utterance function.
2. Modified README.
3. Implement uploading data in cloud/upload_data.py
4. Merge branch 'develop' of https://github.com/PaddlePaddle/models into ds2_pcloud

											
										
										
											7 years ago
-												Update DS2 cloud training according to review comments.

											
										
										
											7 years ago
+								You have to take this step only once, in the very first time you do the cloud training. Later on, the data is persisitent on the cloud filesystem and reusable for further job submissions.
-												Implement uploading data to PaddleCloud
1. Refine data_utils/data.py, reuse process_utterance function.
2. Modified README.
3. Implement uploading data in cloud/upload_data.py
4. Merge branch 'develop' of https://github.com/PaddlePaddle/models into ds2_pcloud

											
										
										
											7 years ago
-												Update README for DS2 cloud training.

											
										
										
											7 years ago
+								## Step 2:  Configure Training
-												Implement uploading data to PaddleCloud
1. Refine data_utils/data.py, reuse process_utterance function.
2. Modified README.
3. Implement uploading data in cloud/upload_data.py
4. Merge branch 'develop' of https://github.com/PaddlePaddle/models into ds2_pcloud

											
										
										
											7 years ago
-												Update README for DS2 cloud training.

											
										
										
											7 years ago
+								Configure cloud training arguments in `pcloud_submit.sh`, with the following arguments:
-												Implement uploading data to PaddleCloud
1. Refine data_utils/data.py, reuse process_utterance function.
2. Modified README.
3. Implement uploading data in cloud/upload_data.py
4. Merge branch 'develop' of https://github.com/PaddlePaddle/models into ds2_pcloud

											
										
										
											7 years ago
-												Update README for DS2 cloud training.

											
										
										
											7 years ago
+								- `TRAIN_MANIFEST`: Manifest filepath (in local filesystem) for training. Notice that the`audio_filepath` should be in cloud filesystem, like those generated by `pcloud_upload_data.sh`.
 								- `DEV_MANIFEST`: Manifest filepath (in local filesystem) for validation.
-												Update DS2 cloud training according to review comments.

											
										
										
											7 years ago
+								- `CLOUD_MODEL_DIR`: Directory (in PaddleCloud filesystem) to save the model parameters (checkpoints). Don't forget to replace `USERNAME` in the default directory and make sure that you have the permission to write it.
-												Update README for DS2 cloud training.

											
										
										
											7 years ago
+								- `BATCH_SIZE`: Training batch size for a single node.
 								- `NUM_GPU`: Number of GPUs allocated for a single node.
 								- `NUM_NODE`: Number of nodes (machines) allocated for this job.
 								- `IS_LOCAL`: Set to False to enable parameter server, if using multiple nodes.
 								Configure other training hyper-parameters in `pcloud_train.sh` as you wish, just as what you can do in local training.
 								By running:
-												Implement uploading data to PaddleCloud
1. Refine data_utils/data.py, reuse process_utterance function.
2. Modified README.
3. Implement uploading data in cloud/upload_data.py
4. Merge branch 'develop' of https://github.com/PaddlePaddle/models into ds2_pcloud

											
										
										
											7 years ago
-												Refine submitting scripts for deepspeech2 on paddle cloud.

											
										
										
											7 years ago
+								```
-												Update README for DS2 cloud training.

											
										
										
											7 years ago
+								sh pcloud_submit.sh
-												Refine submitting scripts for deepspeech2 on paddle cloud.

											
										
										
											7 years ago
+								```
-												Update README for DS2 cloud training.

											
										
										
											7 years ago
+								you submit a training job to PaddleCloud. And you will see the job name when the submission is done.
-												Refine submitting scripts for deepspeech2 on paddle cloud.

											
										
										
											7 years ago
-												Update README for DS2 cloud training.

											
										
										
											7 years ago
+								## Step 3  Get Job Logs
 								Run this to list all the jobs you have submitted, as well as their running status:
-												Refine submitting scripts for deepspeech2 on paddle cloud.

											
										
										
											7 years ago
 								```
-												Update README for DS2 cloud training.

											
										
										
											7 years ago
+								paddlecloud get jobs
-												Refine submitting scripts for deepspeech2 on paddle cloud.

											
										
										
											7 years ago
+								```
-												Update README for DS2 cloud training.

											
										
										
											7 years ago
 								Run this, the corresponding job's logs will be printed.
-												Implement uploading data to PaddleCloud
1. Refine data_utils/data.py, reuse process_utterance function.
2. Modified README.
3. Implement uploading data in cloud/upload_data.py
4. Merge branch 'develop' of https://github.com/PaddlePaddle/models into ds2_pcloud

											
										
										
											7 years ago
+								```
-												Update README for DS2 cloud training.

											
										
										
											7 years ago
+								paddlecloud logs -n 10000 $REPLACED_WITH_YOUR_ACTUAL_JOB_NAME
-												Implement uploading data to PaddleCloud
1. Refine data_utils/data.py, reuse process_utterance function.
2. Modified README.
3. Implement uploading data in cloud/upload_data.py
4. Merge branch 'develop' of https://github.com/PaddlePaddle/models into ds2_pcloud

											
										
										
											7 years ago
+								```
-												Update README for DS2 cloud training.

											
										
										
											7 years ago
 								## More Help
 								For more information about the usage of PaddleCloud, please refer to [PaddleCloud Usage](https://github.com/PaddlePaddle/cloud/blob/develop/doc/usage_cn.md#提交任务).