parent
bd01bc155d
commit
a01c163dc3
@ -1,27 +1,77 @@
|
|||||||
# u2/u2pp Streaming ASR
|
# U2/U2++ Streaming ASR
|
||||||
|
|
||||||
|
A C++ deployment example for `PaddleSpeech/examples/wenetspeech/asr1` recipe. The model is static model from `export`, how to export model please see [here](../../../../examples/wenetspeech/asr1/). If you want using exported model, `run.sh` will download it, for the model link please see `run.sh`.
|
||||||
|
|
||||||
|
This example will demonstrate how to using the u2/u2++ model to recognize `wav` and compute `CER`. We using AISHELL-1 as test data.
|
||||||
|
|
||||||
## Testing with Aishell Test Data
|
## Testing with Aishell Test Data
|
||||||
|
|
||||||
### Download wav and model
|
### Source `path.sh` first
|
||||||
|
|
||||||
|
```bash
|
||||||
|
source path.sh
|
||||||
|
```
|
||||||
|
|
||||||
|
All bins are under `echo $SPEECHX_BUILD` dir.
|
||||||
|
|
||||||
|
### Download dataset and model
|
||||||
|
|
||||||
```
|
```
|
||||||
./run.sh --stop_stage 0
|
./run.sh --stop_stage 0
|
||||||
```
|
```
|
||||||
|
|
||||||
### compute feature
|
### process `cmvn` and compute feature
|
||||||
|
|
||||||
```
|
```bash
|
||||||
./run.sh --stage 1 --stop_stage 1
|
./run.sh --stage 1 --stop_stage 1
|
||||||
```
|
```
|
||||||
|
|
||||||
### decoding using feature
|
If you only want to convert `cmvn` file format, can using this cmd:
|
||||||
|
|
||||||
|
```bash
|
||||||
|
./local/feat.sh --stage 1 --stop_stage 1
|
||||||
|
```
|
||||||
|
|
||||||
|
### Decoding using `feature` input
|
||||||
|
|
||||||
```
|
```
|
||||||
./run.sh --stage 2 --stop_stage 2
|
./run.sh --stage 2 --stop_stage 2
|
||||||
```
|
```
|
||||||
|
|
||||||
### decoding using wav
|
### Decoding using `wav` input
|
||||||
|
|
||||||
```
|
```
|
||||||
./run.sh --stage 3 --stop_stage 3
|
./run.sh --stage 3 --stop_stage 3
|
||||||
```
|
```
|
||||||
|
|
||||||
|
This stage using `u2_recognizer_main` to recognize wav file.
|
||||||
|
|
||||||
|
The input is `scp` file which look like this:
|
||||||
|
```text
|
||||||
|
# head data/split1/1/aishell_test.scp
|
||||||
|
BAC009S0764W0121 /workspace/PaddleSpeech/speechx/examples/u2pp_ol/wenetspeech/data/test/S0764/BAC009S0764W0121.wav
|
||||||
|
BAC009S0764W0122 /workspace/PaddleSpeech/speechx/examples/u2pp_ol/wenetspeech/data/test/S0764/BAC009S0764W0122.wav
|
||||||
|
...
|
||||||
|
BAC009S0764W0125 /workspace/PaddleSpeech/speechx/examples/u2pp_ol/wenetspeech/data/test/S0764/BAC009S0764W0125.wav
|
||||||
|
```
|
||||||
|
|
||||||
|
If you want to recognize one wav, you can make `scp` file like this:
|
||||||
|
```text
|
||||||
|
key path/to/wav/file
|
||||||
|
```
|
||||||
|
|
||||||
|
Then specify `--wav_rspecifier=` param for `u2_recognizer_main` bin. For other flags meaning, please see `help`:
|
||||||
|
```bash
|
||||||
|
u2_recognizer_main --help
|
||||||
|
```
|
||||||
|
|
||||||
|
The exmaple using `u2_recgonize_main` bin please see `local/recognizer.sh`.
|
||||||
|
|
||||||
|
### Decoding with `wav` using quant model
|
||||||
|
|
||||||
|
`local/recognizer_quant.sh` is same to `local/recognizer.sh`, but using quanted model.
|
||||||
|
|
||||||
|
|
||||||
|
## Results
|
||||||
|
|
||||||
|
Please see [here](./RESULTS.md).
|
||||||
|
Loading…
Reference in new issue