PaddleSpeech/docs/source/install.md

# Installation
There are 3 ways to use `PaddleSpeech`. According to the degree of difficulty, the 3 ways can be divided into **Easy**, **Medium** and **Hard**. You can choose one of the 3 ways to install `PaddleSpeech`.

| Way | Function                                                     |
| :---- | :----------------------------------------------------------- |
| Easy     | (1) Use command line functions of PaddleSpeech. <br> (2) Experience PaddleSpeech on Aistudio. |
| Medium     | Support major function，such as using the` ready-made `examples and using PaddleSpeech to train your own model.                                           |
| Hard     | Support full function of Paddlespeech，including training n-gram language model. And you are more able be a developer! |


## Prerequisites
- Python >= 3.7
- PaddlePaddle latest version (please refer to the [Installation Guide](https://www.paddlepaddle.org.cn/documentation/docs/en/beginners_guide/index_en.html))
- Hip: For Linux and Mac, do not use command `sh` instead of command `bash`

## Easy: Get the Basic Function (Support Linux, Mac and Windows)
- If you are newer to `PaddleSpeech` and want to experience it easily without your own machine. We recommend you to use [AI Studio](https://aistudio.baidu.com/aistudio/index) to experience it. There is a step-by-step tutorial for `PaddleSpeech` and you can use the basic function of `PaddleSpeech` with a free machine. 
### Install Conda
Conda is a management system of the environment. You can go to [minicoda](https://docs.conda.io/en/latest/miniconda.html) to install the conda.
And then Install  conda dependencies for `paddlespeech` :

```bash
conda install -y -c conda-forge sox libsndfile swig bzip2
```
### Install C++ environment

#### Windows
Since some required pypi packages need C++ environment, you need to install the visual studio firstly.

#### Mac
```bash
brew install gcc
```
#### Linux
```bash
#  centos
sudo yum install gcc gcc-c++
```
```bash
# ubuntu
sudo apt install build-essential
```
```bash
# Others
conda install -y -c gcc_linux-64=8.4.0 gxx_linux-64=8.4.0
```

### Install PaddleSpeech 
You can use the following command:
```bash
pip install paddlepaddle paddlespeech
```

- You can use the command line function of Paddlespeech. For more information, you can see the [cli](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/paddlespeech/cli).


## Medium: Get the Major Function (Support Linux)
If you want to get the major function of  `paddlespeech`. There are 3 steps you need to do.

### Install Conda
Conda is a management system of the environment. You can go to [minicoda](https://docs.conda.io/en/latest/miniconda.html) to select a version (py>=3.7) and install it by yourself or you can use the following command:
```bash
# download the miniconda
wget https://repo.continuum.io/miniconda/Miniconda3-latest-Linux-x86_64.sh
# install the miniconda
bash Miniconda3-latest-Linux-x86_64.sh -b
# conda init
$HOME/miniconda3/bin/conda init
# activate the conda
bash
```
Then you can create an conda virtual environment using the following command:
```bash
conda create -y -p tools/venv python=3.7
```
Activate the conda virtual environment:
```bash
conda activate tools/venv
```
Install  conda dependencies for `paddlespeech` :
```bash
conda install -y -c conda-forge sox libsndfile swig bzip2
```
Do not forget to install `gcc` and `gxx` on your system.
If you use linux, you can choose to use the scripts below to install them.

```bash
#  centos
sudo yum install gcc gcc-c++
```
```bash
# ubuntu
sudo apt install build-essential
```
```bash
# Others
conda install -y -c gcc_linux-64=8.4.0 gxx_linux-64=8.4.0
```
(Hip: Do not use the last script if you want to install by **Hard** way):
### Install PaddlePaddle
For example, for CUDA 10.2, CuDNN7.5 install paddle 2.2.0:
```bash
python3 -m pip install paddlepaddle-gpu==2.2.0
```
### Install PaddleSpeech 
If you want to use the` ready-made `examples in `paddlespeech`, you need to clone this repository and install  `paddlespeech`  by the following commands:
```bash
https://github.com/PaddlePaddle/PaddleSpeech.git
cd PaddleSpeech
pip install .
```
## Hard: Get the Full Function on Your Machine
### Prerequisites
- choice 1: working with `Ubuntu` Docker Container.
- choice 2: working on `Ubuntu` with `root` privilege. 

To avoid the trouble of environment setup, [running in Docker container](#running-in-docker-container) is highly recommended. Otherwise, if you work on `Ubuntu` with `root` privilege, you can skip the next step.

### Choice 1: Running in Docker Container (Recommand)
Docker is an open-source tool to build, ship, and run distributed applications in an isolated environment. A Docker image for this project has been provided in [hub.docker.com](https://hub.docker.com) with all the dependencies installed. This Docker image requires the support of NVIDIA GPU, so please make sure its availability and the [nvidia-docker](https://github.com/NVIDIA/nvidia-docker) has been installed.

Take several steps to launch the Docker image:
- Download the Docker image

For example, pull paddle 2.2.0 image:
```bash
nvidia-docker pull registry.baidubce.com/paddlepaddle/paddle:2.2.0-gpu-cuda10.2-cudnn7
```
- Clone this repository
```bash
git clone https://github.com/PaddlePaddle/PaddleSpeech.git
```
- Run the Docker image

```bash
sudo nvidia-docker run --net=host --ipc=host --rm -it -v $(pwd)/PaddleSpeech:/PaddleSpeech registry.baidubce.com/paddlepaddle/paddle:2.2.0-gpu-cuda10.2-cudnn7 /bin/bash
```
Now you can execute training, inference and hyper-parameters tuning in  Docker container.
### Choice 2: Running in Ubuntu with Root Privilege
- Install `build-essential` by apt
```bash
sudo apt install build-essential
```
- Clone this repository
```bash
git clone https://github.com/PaddlePaddle/PaddleSpeech.git
```
- Install paddle 2.2.0:
```bash
python3 -m pip install paddlepaddle-gpu==2.2.0
```
### Install the Conda
```bash
# download and install the miniconda
pushd tools
bash extras/install_miniconda.sh
popd
# use the "bash" command to make the conda environment works
bash
# create an conda virtual environment
conda create -y -p tools/venv python=3.7
# Activate the conda virtual environment:
conda activate tools/venv
# Install the conda packags
conda install -y -c conda-forge sox libsndfile swig bzip2 libflac bc
```
### Install PaddlePaddle
For example, for CUDA 10.2, CuDNN7.5 install paddle 2.2.0:

```bash
python3 -m pip install paddlepaddle-gpu==2.2.0
```
### Get the Function for Developing PaddleSpeech
```bash
pip install -e .[develop]
```
### Install the Kaldi (Optional)
```bash
pushd tools
bash extras/install_openblas.sh
bash extras/install_kaldi.sh
popd
```


## Setup for Other Platform 
- Make sure these libraries or tools in [dependencies](./dependencies.md) installed. More information please see: `setup.py `and `tools/Makefile`.
- The version of `swig` should >= 3.0
- we will simplify the install process in the future.
-												Support paddle 2.x (#538)

* 2.x model

* model test pass

* fix data

* fix soundfile with flac support

* one thread dataloader test pass

* export feasture size
add trainer and utils
add setup model and dataloader
update travis using Bionic dist

* add venv; test under venv

* fix unittest; train and valid

* add train and config

* add config and train script

* fix ctc cuda memcopy error

* fix imports

* fix train valid log

* fix dataset batch shuffle shift start from 1
fix rank_zero_only decreator error
close tensorboard when train over
add decoding config and code

* test process can run

* test with decoding

* test and infer with decoding

* fix infer

* fix ctc loss
lr schedule
sortagrad
logger

* aishell egs

* refactor train
add aishell egs

* fix dataset batch shuffle and add batch sampler log
print model parameter

* fix model and ctc

* sequence_mask make all inputs zeros, which cause grad be zero, this is a bug of LessThanOp
add grad clip by global norm
add model train test notebook

* ctc loss
remove run prefix
using ord value as text id

* using unk when training
compute_loss need text ids
ord id using in test mode, which compute wer/cer

* fix tester

* add lr_deacy
refactor code

* fix tools

* fix ci
add tune
fix gru model bugs
add dataset and model test

* fix decoding

* refactor repo
fix decoding

* fix musan and rir dataset

* refactor io, loss, conv, rnn, gradclip, model, utils

* fix ci and import

* refactor model
add export jit model

* add deploy bin and test it

* rm uselss egs

* add layer tools

* refactor socket server
new model from pretrain

* remve useless

* fix instability loss and grad nan or inf for librispeech training

* fix sampler

* fix libri train.sh

* fix doc

* add license on cpp

* fix doc

* fix libri script

* fix install

* clip 5 wer 7.39, clip 400 wer 7.54, 1.8 clip 400 baseline 7.49
											
										
										
											4 years ago
+								# Installation
-												[install.md]Update install.md (#1123)

* Update install.md

* Update install.md

* Update install.md

* Update install.md
											
										
										
											3 years ago
+								There are 3 ways to use `PaddleSpeech`. According to the degree of difficulty, the 3 ways can be divided into **Easy**, **Medium** and **Hard**. You can choose one of the 3 ways to install `PaddleSpeech`.
-												revise the install.md, setup.py and makefile, rm the setup.sh

											
										
										
											3 years ago
-												[install.md]Update install.md (#1123)

* Update install.md

* Update install.md

* Update install.md

* Update install.md
											
										
										
											3 years ago
+								| Way | Function                                                     |
 								| :---- | :----------------------------------------------------------- |
 								| Easy     | (1) Use command line functions of PaddleSpeech. <br> (2) Experience PaddleSpeech on Aistudio. |
 								| Medium     | Support major function，such as using the` ready-made `examples and using PaddleSpeech to train your own model.                                           |
 								| Hard     | Support full function of Paddlespeech，including training n-gram language model. And you are more able be a developer! |
-												revise the install.md, setup.py and makefile, rm the setup.sh

											
										
										
											3 years ago
-												[install.md]Update install.md (#1123)

* Update install.md

* Update install.md

* Update install.md

* Update install.md
											
										
										
											3 years ago
 								## Prerequisites
-												Support paddle 2.x (#538)

* 2.x model

* model test pass

* fix data

* fix soundfile with flac support

* one thread dataloader test pass

* export feasture size
add trainer and utils
add setup model and dataloader
update travis using Bionic dist

* add venv; test under venv

* fix unittest; train and valid

* add train and config

* add config and train script

* fix ctc cuda memcopy error

* fix imports

* fix train valid log

* fix dataset batch shuffle shift start from 1
fix rank_zero_only decreator error
close tensorboard when train over
add decoding config and code

* test process can run

* test with decoding

* test and infer with decoding

* fix infer

* fix ctc loss
lr schedule
sortagrad
logger

* aishell egs

* refactor train
add aishell egs

* fix dataset batch shuffle and add batch sampler log
print model parameter

* fix model and ctc

* sequence_mask make all inputs zeros, which cause grad be zero, this is a bug of LessThanOp
add grad clip by global norm
add model train test notebook

* ctc loss
remove run prefix
using ord value as text id

* using unk when training
compute_loss need text ids
ord id using in test mode, which compute wer/cer

* fix tester

* add lr_deacy
refactor code

* fix tools

* fix ci
add tune
fix gru model bugs
add dataset and model test

* fix decoding

* refactor repo
fix decoding

* fix musan and rir dataset

* refactor io, loss, conv, rnn, gradclip, model, utils

* fix ci and import

* refactor model
add export jit model

* add deploy bin and test it

* rm uselss egs

* add layer tools

* refactor socket server
new model from pretrain

* remve useless

* fix instability loss and grad nan or inf for librispeech training

* fix sampler

* fix libri train.sh

* fix doc

* add license on cpp

* fix doc

* fix libri script

* fix install

* clip 5 wer 7.39, clip 400 wer 7.54, 1.8 clip 400 baseline 7.49
											
										
										
											4 years ago
+								- Python >= 3.7
-												fix doc link

											
										
										
											3 years ago
+								- PaddlePaddle latest version (please refer to the [Installation Guide](https://www.paddlepaddle.org.cn/documentation/docs/en/beginners_guide/index_en.html))
-												[install.md]Update install.md (#1123)

* Update install.md

* Update install.md

* Update install.md

* Update install.md
											
										
										
											3 years ago
+								- Hip: For Linux and Mac, do not use command `sh` instead of command `bash`
 								## Easy: Get the Basic Function (Support Linux, Mac and Windows)
 								- If you are newer to `PaddleSpeech` and want to experience it easily without your own machine. We recommend you to use [AI Studio](https://aistudio.baidu.com/aistudio/index) to experience it. There is a step-by-step tutorial for `PaddleSpeech` and you can use the basic function of `PaddleSpeech` with a free machine.
 								### Install Conda
 								Conda is a management system of the environment. You can go to [minicoda](https://docs.conda.io/en/latest/miniconda.html) to install the conda.
 								And then Install  conda dependencies for `paddlespeech` :
-												revise the install.md, setup.py and makefile, rm the setup.sh

											
										
										
											3 years ago
-												[install.md]Update install.md (#1123)

* Update install.md

* Update install.md

* Update install.md

* Update install.md
											
										
										
											3 years ago
+								```bash
 								conda install -y -c conda-forge sox libsndfile swig bzip2
 								```
 								### Install C++ environment
 								#### Windows
 								Since some required pypi packages need C++ environment, you need to install the visual studio firstly.
 								#### Mac
 								```bash
 								brew install gcc
 								```
 								#### Linux
 								```bash
 								#  centos
 								sudo yum install gcc gcc-c++
 								```
 								```bash
 								# ubuntu
 								sudo apt install build-essential
 								```
 								```bash
 								# Others
 								conda install -y -c gcc_linux-64=8.4.0 gxx_linux-64=8.4.0
 								```
 								### Install PaddleSpeech
 								You can use the following command:
 								```bash
 								pip install paddlepaddle paddlespeech
 								```
 								- You can use the command line function of Paddlespeech. For more information, you can see the [cli](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/paddlespeech/cli).
 								## Medium: Get the Major Function (Support Linux)
 								If you want to get the major function of  `paddlespeech`. There are 3 steps you need to do.
-												Support paddle 2.x (#538)

* 2.x model

* model test pass

* fix data

* fix soundfile with flac support

* one thread dataloader test pass

* export feasture size
add trainer and utils
add setup model and dataloader
update travis using Bionic dist

* add venv; test under venv

* fix unittest; train and valid

* add train and config

* add config and train script

* fix ctc cuda memcopy error

* fix imports

* fix train valid log

* fix dataset batch shuffle shift start from 1
fix rank_zero_only decreator error
close tensorboard when train over
add decoding config and code

* test process can run

* test with decoding

* test and infer with decoding

* fix infer

* fix ctc loss
lr schedule
sortagrad
logger

* aishell egs

* refactor train
add aishell egs

* fix dataset batch shuffle and add batch sampler log
print model parameter

* fix model and ctc

* sequence_mask make all inputs zeros, which cause grad be zero, this is a bug of LessThanOp
add grad clip by global norm
add model train test notebook

* ctc loss
remove run prefix
using ord value as text id

* using unk when training
compute_loss need text ids
ord id using in test mode, which compute wer/cer

* fix tester

* add lr_deacy
refactor code

* fix tools

* fix ci
add tune
fix gru model bugs
add dataset and model test

* fix decoding

* refactor repo
fix decoding

* fix musan and rir dataset

* refactor io, loss, conv, rnn, gradclip, model, utils

* fix ci and import

* refactor model
add export jit model

* add deploy bin and test it

* rm uselss egs

* add layer tools

* refactor socket server
new model from pretrain

* remve useless

* fix instability loss and grad nan or inf for librispeech training

* fix sampler

* fix libri train.sh

* fix doc

* add license on cpp

* fix doc

* fix libri script

* fix install

* clip 5 wer 7.39, clip 400 wer 7.54, 1.8 clip 400 baseline 7.49
											
										
										
											4 years ago
-												[DOCS] Correct the grammar and spelling mistakes of install.md (#1115)


											
										
										
											3 years ago
+								### Install Conda
 								Conda is a management system of the environment. You can go to [minicoda](https://docs.conda.io/en/latest/miniconda.html) to select a version (py>=3.7) and install it by yourself or you can use the following command:
-												optimize the setup.py and setup.sh

											
										
										
											3 years ago
+								```bash
-												revise the install.md, setup.py and makefile, rm the setup.sh

											
										
										
											3 years ago
+								# download the miniconda
 								wget https://repo.continuum.io/miniconda/Miniconda3-latest-Linux-x86_64.sh
 								# install the miniconda
 								bash Miniconda3-latest-Linux-x86_64.sh -b
 								# conda init
 								$HOME/miniconda3/bin/conda init
 								# activate the conda
 								bash
-												optimize the setup.py and setup.sh

											
										
										
											3 years ago
+								```
-												Update install.md
											
										
										
											3 years ago
+								Then you can create an conda virtual environment using the following command:
-												revise the install.md, setup.py and makefile, rm the setup.sh

											
										
										
											3 years ago
+								```bash
-												fix install

											
										
										
											3 years ago
+								conda create -y -p tools/venv python=3.7
-												optimize the setup.py and setup.sh

											
										
										
											3 years ago
+								```
-												revise the install.md, setup.py and makefile, rm the setup.sh

											
										
										
											3 years ago
+								Activate the conda virtual environment:
 								```bash
-												fix install

											
										
										
											3 years ago
+								conda activate tools/venv
-												Support paddle 2.x (#538)

* 2.x model

* model test pass

* fix data

* fix soundfile with flac support

* one thread dataloader test pass

* export feasture size
add trainer and utils
add setup model and dataloader
update travis using Bionic dist

* add venv; test under venv

* fix unittest; train and valid

* add train and config

* add config and train script

* fix ctc cuda memcopy error

* fix imports

* fix train valid log

* fix dataset batch shuffle shift start from 1
fix rank_zero_only decreator error
close tensorboard when train over
add decoding config and code

* test process can run

* test with decoding

* test and infer with decoding

* fix infer

* fix ctc loss
lr schedule
sortagrad
logger

* aishell egs

* refactor train
add aishell egs

* fix dataset batch shuffle and add batch sampler log
print model parameter

* fix model and ctc

* sequence_mask make all inputs zeros, which cause grad be zero, this is a bug of LessThanOp
add grad clip by global norm
add model train test notebook

* ctc loss
remove run prefix
using ord value as text id

* using unk when training
compute_loss need text ids
ord id using in test mode, which compute wer/cer

* fix tester

* add lr_deacy
refactor code

* fix tools

* fix ci
add tune
fix gru model bugs
add dataset and model test

* fix decoding

* refactor repo
fix decoding

* fix musan and rir dataset

* refactor io, loss, conv, rnn, gradclip, model, utils

* fix ci and import

* refactor model
add export jit model

* add deploy bin and test it

* rm uselss egs

* add layer tools

* refactor socket server
new model from pretrain

* remve useless

* fix instability loss and grad nan or inf for librispeech training

* fix sampler

* fix libri train.sh

* fix doc

* add license on cpp

* fix doc

* fix libri script

* fix install

* clip 5 wer 7.39, clip 400 wer 7.54, 1.8 clip 400 baseline 7.49
											
										
										
											4 years ago
+								```
-												Update install.md
											
										
										
											3 years ago
+								Install  conda dependencies for `paddlespeech` :
-												optimize the setup.py and setup.sh

											
										
										
											3 years ago
+								```bash
-												Update install.md (#1117)


											
										
										
											3 years ago
+								conda install -y -c conda-forge sox libsndfile swig bzip2
 								```
 								Do not forget to install `gcc` and `gxx` on your system.
-												[install.md]Update install.md (#1123)

* Update install.md

* Update install.md

* Update install.md

* Update install.md
											
										
										
											3 years ago
+								If you use linux, you can choose to use the scripts below to install them.
-												Update install.md (#1117)


											
										
										
											3 years ago
-												[install.md]Update install.md (#1123)

* Update install.md

* Update install.md

* Update install.md

* Update install.md
											
										
										
											3 years ago
+								```bash
 								#  centos
 								sudo yum install gcc gcc-c++
-												Update install.md (#1117)


											
										
										
											3 years ago
+								```
-												[install.md]Update install.md (#1123)

* Update install.md

* Update install.md

* Update install.md

* Update install.md
											
										
										
											3 years ago
+								```bash
 								# ubuntu
 								sudo apt install build-essential
 								```
 								```bash
 								# Others
-												Update install.md (#1117)


											
										
										
											3 years ago
+								conda install -y -c gcc_linux-64=8.4.0 gxx_linux-64=8.4.0
-												add conda init, use gcc 8.4.0

											
										
										
											3 years ago
+								```
-												[install.md]Update install.md (#1123)

* Update install.md

* Update install.md

* Update install.md

* Update install.md
											
										
										
											3 years ago
+								(Hip: Do not use the last script if you want to install by **Hard** way):
-												revise

											
										
										
											3 years ago
+								### Install PaddlePaddle
 								For example, for CUDA 10.2, CuDNN7.5 install paddle 2.2.0:
 								```bash
 								python3 -m pip install paddlepaddle-gpu==2.2.0
 								```
-												Update install.md
											
										
										
											3 years ago
+								### Install PaddleSpeech
-												[DOCS] Correct the grammar and spelling mistakes of install.md (#1115)


											
										
										
											3 years ago
+								If you want to use the` ready-made `examples in `paddlespeech`, you need to clone this repository and install  `paddlespeech`  by the following commands:
-												revise the install.md, setup.py and makefile, rm the setup.sh

											
										
										
											3 years ago
+								```bash
 								https://github.com/PaddlePaddle/PaddleSpeech.git
 								cd PaddleSpeech
 								pip install .
 								```
-												[DOCS] Correct the grammar and spelling mistakes of install.md (#1115)


											
										
										
											3 years ago
+								## Hard: Get the Full Function on Your Machine
-												revise the install.md, setup.py and makefile, rm the setup.sh

											
										
										
											3 years ago
+								### Prerequisites
-												Update install.md
											
										
										
											3 years ago
+								- choice 1: working with `Ubuntu` Docker Container.
-												revise the install.md, setup.py and makefile, rm the setup.sh

											
										
										
											3 years ago
+								- choice 2: working on `Ubuntu` with `root` privilege.
-												[DOCS] Correct the grammar and spelling mistakes of install.md (#1115)


											
										
										
											3 years ago
+								To avoid the trouble of environment setup, [running in Docker container](#running-in-docker-container) is highly recommended. Otherwise, if you work on `Ubuntu` with `root` privilege, you can skip the next step.
-												revise the install.md, setup.py and makefile, rm the setup.sh

											
										
										
											3 years ago
 								### Choice 1: Running in Docker Container (Recommand)
-												[DOCS] Correct the grammar and spelling mistakes of install.md (#1115)


											
										
										
											3 years ago
+								Docker is an open-source tool to build, ship, and run distributed applications in an isolated environment. A Docker image for this project has been provided in [hub.docker.com](https://hub.docker.com) with all the dependencies installed. This Docker image requires the support of NVIDIA GPU, so please make sure its availability and the [nvidia-docker](https://github.com/NVIDIA/nvidia-docker) has been installed.
-												Support paddle 2.x (#538)

* 2.x model

* model test pass

* fix data

* fix soundfile with flac support

* one thread dataloader test pass

* export feasture size
add trainer and utils
add setup model and dataloader
update travis using Bionic dist

* add venv; test under venv

* fix unittest; train and valid

* add train and config

* add config and train script

* fix ctc cuda memcopy error

* fix imports

* fix train valid log

* fix dataset batch shuffle shift start from 1
fix rank_zero_only decreator error
close tensorboard when train over
add decoding config and code

* test process can run

* test with decoding

* test and infer with decoding

* fix infer

* fix ctc loss
lr schedule
sortagrad
logger

* aishell egs

* refactor train
add aishell egs

* fix dataset batch shuffle and add batch sampler log
print model parameter

* fix model and ctc

* sequence_mask make all inputs zeros, which cause grad be zero, this is a bug of LessThanOp
add grad clip by global norm
add model train test notebook

* ctc loss
remove run prefix
using ord value as text id

* using unk when training
compute_loss need text ids
ord id using in test mode, which compute wer/cer

* fix tester

* add lr_deacy
refactor code

* fix tools

* fix ci
add tune
fix gru model bugs
add dataset and model test

* fix decoding

* refactor repo
fix decoding

* fix musan and rir dataset

* refactor io, loss, conv, rnn, gradclip, model, utils

* fix ci and import

* refactor model
add export jit model

* add deploy bin and test it

* rm uselss egs

* add layer tools

* refactor socket server
new model from pretrain

* remve useless

* fix instability loss and grad nan or inf for librispeech training

* fix sampler

* fix libri train.sh

* fix doc

* add license on cpp

* fix doc

* fix libri script

* fix install

* clip 5 wer 7.39, clip 400 wer 7.54, 1.8 clip 400 baseline 7.49
											
										
										
											4 years ago
 								Take several steps to launch the Docker image:
 								- Download the Docker image
-												revise the install.md, setup.py and makefile, rm the setup.sh

											
										
										
											3 years ago
+								For example, pull paddle 2.2.0 image:
-												Support paddle 2.x (#538)

* 2.x model

* model test pass

* fix data

* fix soundfile with flac support

* one thread dataloader test pass

* export feasture size
add trainer and utils
add setup model and dataloader
update travis using Bionic dist

* add venv; test under venv

* fix unittest; train and valid

* add train and config

* add config and train script

* fix ctc cuda memcopy error

* fix imports

* fix train valid log

* fix dataset batch shuffle shift start from 1
fix rank_zero_only decreator error
close tensorboard when train over
add decoding config and code

* test process can run

* test with decoding

* test and infer with decoding

* fix infer

* fix ctc loss
lr schedule
sortagrad
logger

* aishell egs

* refactor train
add aishell egs

* fix dataset batch shuffle and add batch sampler log
print model parameter

* fix model and ctc

* sequence_mask make all inputs zeros, which cause grad be zero, this is a bug of LessThanOp
add grad clip by global norm
add model train test notebook

* ctc loss
remove run prefix
using ord value as text id

* using unk when training
compute_loss need text ids
ord id using in test mode, which compute wer/cer

* fix tester

* add lr_deacy
refactor code

* fix tools

* fix ci
add tune
fix gru model bugs
add dataset and model test

* fix decoding

* refactor repo
fix decoding

* fix musan and rir dataset

* refactor io, loss, conv, rnn, gradclip, model, utils

* fix ci and import

* refactor model
add export jit model

* add deploy bin and test it

* rm uselss egs

* add layer tools

* refactor socket server
new model from pretrain

* remve useless

* fix instability loss and grad nan or inf for librispeech training

* fix sampler

* fix libri train.sh

* fix doc

* add license on cpp

* fix doc

* fix libri script

* fix install

* clip 5 wer 7.39, clip 400 wer 7.54, 1.8 clip 400 baseline 7.49
											
										
										
											4 years ago
+								```bash
-												revise the install.md, setup.py and makefile, rm the setup.sh

											
										
										
											3 years ago
+								nvidia-docker pull registry.baidubce.com/paddlepaddle/paddle:2.2.0-gpu-cuda10.2-cudnn7
-												Support paddle 2.x (#538)

* 2.x model

* model test pass

* fix data

* fix soundfile with flac support

* one thread dataloader test pass

* export feasture size
add trainer and utils
add setup model and dataloader
update travis using Bionic dist

* add venv; test under venv

* fix unittest; train and valid

* add train and config

* add config and train script

* fix ctc cuda memcopy error

* fix imports

* fix train valid log

* fix dataset batch shuffle shift start from 1
fix rank_zero_only decreator error
close tensorboard when train over
add decoding config and code

* test process can run

* test with decoding

* test and infer with decoding

* fix infer

* fix ctc loss
lr schedule
sortagrad
logger

* aishell egs

* refactor train
add aishell egs

* fix dataset batch shuffle and add batch sampler log
print model parameter

* fix model and ctc

* sequence_mask make all inputs zeros, which cause grad be zero, this is a bug of LessThanOp
add grad clip by global norm
add model train test notebook

* ctc loss
remove run prefix
using ord value as text id

* using unk when training
compute_loss need text ids
ord id using in test mode, which compute wer/cer

* fix tester

* add lr_deacy
refactor code

* fix tools

* fix ci
add tune
fix gru model bugs
add dataset and model test

* fix decoding

* refactor repo
fix decoding

* fix musan and rir dataset

* refactor io, loss, conv, rnn, gradclip, model, utils

* fix ci and import

* refactor model
add export jit model

* add deploy bin and test it

* rm uselss egs

* add layer tools

* refactor socket server
new model from pretrain

* remve useless

* fix instability loss and grad nan or inf for librispeech training

* fix sampler

* fix libri train.sh

* fix doc

* add license on cpp

* fix doc

* fix libri script

* fix install

* clip 5 wer 7.39, clip 400 wer 7.54, 1.8 clip 400 baseline 7.49
											
										
										
											4 years ago
+								```
 								- Clone this repository
-												Update install.md
											
										
										
											3 years ago
+								```bash
-												revise the install.md, setup.py and makefile, rm the setup.sh

											
										
										
											3 years ago
+								git clone https://github.com/PaddlePaddle/PaddleSpeech.git
-												Support paddle 2.x (#538)

* 2.x model

* model test pass

* fix data

* fix soundfile with flac support

* one thread dataloader test pass

* export feasture size
add trainer and utils
add setup model and dataloader
update travis using Bionic dist

* add venv; test under venv

* fix unittest; train and valid

* add train and config

* add config and train script

* fix ctc cuda memcopy error

* fix imports

* fix train valid log

* fix dataset batch shuffle shift start from 1
fix rank_zero_only decreator error
close tensorboard when train over
add decoding config and code

* test process can run

* test with decoding

* test and infer with decoding

* fix infer

* fix ctc loss
lr schedule
sortagrad
logger

* aishell egs

* refactor train
add aishell egs

* fix dataset batch shuffle and add batch sampler log
print model parameter

* fix model and ctc

* sequence_mask make all inputs zeros, which cause grad be zero, this is a bug of LessThanOp
add grad clip by global norm
add model train test notebook

* ctc loss
remove run prefix
using ord value as text id

* using unk when training
compute_loss need text ids
ord id using in test mode, which compute wer/cer

* fix tester

* add lr_deacy
refactor code

* fix tools

* fix ci
add tune
fix gru model bugs
add dataset and model test

* fix decoding

* refactor repo
fix decoding

* fix musan and rir dataset

* refactor io, loss, conv, rnn, gradclip, model, utils

* fix ci and import

* refactor model
add export jit model

* add deploy bin and test it

* rm uselss egs

* add layer tools

* refactor socket server
new model from pretrain

* remve useless

* fix instability loss and grad nan or inf for librispeech training

* fix sampler

* fix libri train.sh

* fix doc

* add license on cpp

* fix doc

* fix libri script

* fix install

* clip 5 wer 7.39, clip 400 wer 7.54, 1.8 clip 400 baseline 7.49
											
										
										
											4 years ago
+								```
 								- Run the Docker image
 								```bash
-												revise

											
										
										
											3 years ago
+								sudo nvidia-docker run --net=host --ipc=host --rm -it -v $(pwd)/PaddleSpeech:/PaddleSpeech registry.baidubce.com/paddlepaddle/paddle:2.2.0-gpu-cuda10.2-cudnn7 /bin/bash
-												Support paddle 2.x (#538)

* 2.x model

* model test pass

* fix data

* fix soundfile with flac support

* one thread dataloader test pass

* export feasture size
add trainer and utils
add setup model and dataloader
update travis using Bionic dist

* add venv; test under venv

* fix unittest; train and valid

* add train and config

* add config and train script

* fix ctc cuda memcopy error

* fix imports

* fix train valid log

* fix dataset batch shuffle shift start from 1
fix rank_zero_only decreator error
close tensorboard when train over
add decoding config and code

* test process can run

* test with decoding

* test and infer with decoding

* fix infer

* fix ctc loss
lr schedule
sortagrad
logger

* aishell egs

* refactor train
add aishell egs

* fix dataset batch shuffle and add batch sampler log
print model parameter

* fix model and ctc

* sequence_mask make all inputs zeros, which cause grad be zero, this is a bug of LessThanOp
add grad clip by global norm
add model train test notebook

* ctc loss
remove run prefix
using ord value as text id

* using unk when training
compute_loss need text ids
ord id using in test mode, which compute wer/cer

* fix tester

* add lr_deacy
refactor code

* fix tools

* fix ci
add tune
fix gru model bugs
add dataset and model test

* fix decoding

* refactor repo
fix decoding

* fix musan and rir dataset

* refactor io, loss, conv, rnn, gradclip, model, utils

* fix ci and import

* refactor model
add export jit model

* add deploy bin and test it

* rm uselss egs

* add layer tools

* refactor socket server
new model from pretrain

* remve useless

* fix instability loss and grad nan or inf for librispeech training

* fix sampler

* fix libri train.sh

* fix doc

* add license on cpp

* fix doc

* fix libri script

* fix install

* clip 5 wer 7.39, clip 400 wer 7.54, 1.8 clip 400 baseline 7.49
											
										
										
											4 years ago
+								```
-												Update install.md
											
										
										
											3 years ago
+								Now you can execute training, inference and hyper-parameters tuning in  Docker container.
-												revise the install.md, setup.py and makefile, rm the setup.sh

											
										
										
											3 years ago
+								### Choice 2: Running in Ubuntu with Root Privilege
-												[install.md]Update install.md (#1123)

* Update install.md

* Update install.md

* Update install.md

* Update install.md
											
										
										
											3 years ago
+								- Install `build-essential` by apt
 								```bash
 								sudo apt install build-essential
 								```
-												revise the install.md, setup.py and makefile, rm the setup.sh

											
										
										
											3 years ago
+								- Clone this repository
-												Update install.md
											
										
										
											3 years ago
+								```bash
-												revise the install.md, setup.py and makefile, rm the setup.sh

											
										
										
											3 years ago
+								git clone https://github.com/PaddlePaddle/PaddleSpeech.git
 								```
-												[install.md]Update install.md (#1123)

* Update install.md

* Update install.md

* Update install.md

* Update install.md
											
										
										
											3 years ago
+								- Install paddle 2.2.0:
-												revise the install.md, setup.py and makefile, rm the setup.sh

											
										
										
											3 years ago
+								```bash
 								python3 -m pip install paddlepaddle-gpu==2.2.0
 								```
 								### Install the Conda
 								```bash
 								# download and install the miniconda
 								pushd tools
 								bash extras/install_miniconda.sh
 								popd
 								# use the "bash" command to make the conda environment works
 								bash
 								# create an conda virtual environment
-												[install.md]remove the gcc and gxx in hard (#1122)

* remove the gcc and gxx in hard

* Update install.md

* Create install.md

* Update install.md
											
										
										
											3 years ago
+								conda create -y -p tools/venv python=3.7
-												revise the install.md, setup.py and makefile, rm the setup.sh

											
										
										
											3 years ago
+								# Activate the conda virtual environment:
-												fix install

											
										
										
											3 years ago
+								conda activate tools/venv
-												revise the install.md, setup.py and makefile, rm the setup.sh

											
										
										
											3 years ago
+								# Install the conda packags
-												[install.md]remove the gcc and gxx in hard (#1122)

* remove the gcc and gxx in hard

* Update install.md

* Create install.md

* Update install.md
											
										
										
											3 years ago
+								conda install -y -c conda-forge sox libsndfile swig bzip2 libflac bc
-												revise the install.md, setup.py and makefile, rm the setup.sh

											
										
										
											3 years ago
+								```
-												revise

											
										
										
											3 years ago
+								### Install PaddlePaddle
 								For example, for CUDA 10.2, CuDNN7.5 install paddle 2.2.0:
 								```bash
 								python3 -m pip install paddlepaddle-gpu==2.2.0
 								```
-												[DOCS] Correct the grammar and spelling mistakes of install.md (#1115)


											
										
										
											3 years ago
+								### Get the Function for Developing PaddleSpeech
-												Support paddle 2.x (#538)

* 2.x model

* model test pass

* fix data

* fix soundfile with flac support

* one thread dataloader test pass

* export feasture size
add trainer and utils
add setup model and dataloader
update travis using Bionic dist

* add venv; test under venv

* fix unittest; train and valid

* add train and config

* add config and train script

* fix ctc cuda memcopy error

* fix imports

* fix train valid log

* fix dataset batch shuffle shift start from 1
fix rank_zero_only decreator error
close tensorboard when train over
add decoding config and code

* test process can run

* test with decoding

* test and infer with decoding

* fix infer

* fix ctc loss
lr schedule
sortagrad
logger

* aishell egs

* refactor train
add aishell egs

* fix dataset batch shuffle and add batch sampler log
print model parameter

* fix model and ctc

* sequence_mask make all inputs zeros, which cause grad be zero, this is a bug of LessThanOp
add grad clip by global norm
add model train test notebook

* ctc loss
remove run prefix
using ord value as text id

* using unk when training
compute_loss need text ids
ord id using in test mode, which compute wer/cer

* fix tester

* add lr_deacy
refactor code

* fix tools

* fix ci
add tune
fix gru model bugs
add dataset and model test

* fix decoding

* refactor repo
fix decoding

* fix musan and rir dataset

* refactor io, loss, conv, rnn, gradclip, model, utils

* fix ci and import

* refactor model
add export jit model

* add deploy bin and test it

* rm uselss egs

* add layer tools

* refactor socket server
new model from pretrain

* remve useless

* fix instability loss and grad nan or inf for librispeech training

* fix sampler

* fix libri train.sh

* fix doc

* add license on cpp

* fix doc

* fix libri script

* fix install

* clip 5 wer 7.39, clip 400 wer 7.54, 1.8 clip 400 baseline 7.49
											
										
										
											4 years ago
+								```bash
-												[install.md]remove the gcc and gxx in hard (#1122)

* remove the gcc and gxx in hard

* Update install.md

* Create install.md

* Update install.md
											
										
										
											3 years ago
+								pip install -e .[develop]
-												Support paddle 2.x (#538)

* 2.x model

* model test pass

* fix data

* fix soundfile with flac support

* one thread dataloader test pass

* export feasture size
add trainer and utils
add setup model and dataloader
update travis using Bionic dist

* add venv; test under venv

* fix unittest; train and valid

* add train and config

* add config and train script

* fix ctc cuda memcopy error

* fix imports

* fix train valid log

* fix dataset batch shuffle shift start from 1
fix rank_zero_only decreator error
close tensorboard when train over
add decoding config and code

* test process can run

* test with decoding

* test and infer with decoding

* fix infer

* fix ctc loss
lr schedule
sortagrad
logger

* aishell egs

* refactor train
add aishell egs

* fix dataset batch shuffle and add batch sampler log
print model parameter

* fix model and ctc

* sequence_mask make all inputs zeros, which cause grad be zero, this is a bug of LessThanOp
add grad clip by global norm
add model train test notebook

* ctc loss
remove run prefix
using ord value as text id

* using unk when training
compute_loss need text ids
ord id using in test mode, which compute wer/cer

* fix tester

* add lr_deacy
refactor code

* fix tools

* fix ci
add tune
fix gru model bugs
add dataset and model test

* fix decoding

* refactor repo
fix decoding

* fix musan and rir dataset

* refactor io, loss, conv, rnn, gradclip, model, utils

* fix ci and import

* refactor model
add export jit model

* add deploy bin and test it

* rm uselss egs

* add layer tools

* refactor socket server
new model from pretrain

* remve useless

* fix instability loss and grad nan or inf for librispeech training

* fix sampler

* fix libri train.sh

* fix doc

* add license on cpp

* fix doc

* fix libri script

* fix install

* clip 5 wer 7.39, clip 400 wer 7.54, 1.8 clip 400 baseline 7.49
											
										
										
											4 years ago
+								```
-												revise

											
										
										
											3 years ago
+								### Install the Kaldi (Optional)
-												revise the install.md, setup.py and makefile, rm the setup.sh

											
										
										
											3 years ago
+								```bash
 								pushd tools
 								bash extras/install_openblas.sh
 								bash extras/install_kaldi.sh
 								popd
 								```
 								## Setup for Other Platform
-												Create install.md
											
										
										
											3 years ago
+								- Make sure these libraries or tools in [dependencies](./dependencies.md) installed. More information please see: `setup.py `and `tools/Makefile`.
-												revise the install.md, setup.py and makefile, rm the setup.sh

											
										
										
											3 years ago
+								- The version of `swig` should >= 3.0
-												Update install.md
											
										
										
											3 years ago
+								- we will simplify the install process in the future.