You can not select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
|
|
3 years ago | |
|---|---|---|
| .. | ||
| data | 3 years ago | |
| README.md | 3 years ago | |
| get_textnorm_data.py | 4 years ago | |
| path.sh | 4 years ago | |
| run.sh | 4 years ago | |
| test_textnorm.py | 4 years ago | |
README.md
Text Normalization
For text normalization, the test data is data/textnorm_test_cases.txt, we use | as the separator of raw_data and normed_data.
We use CER as an evaluation criterion.
Start
Run the command below to get the results of the test.
cd ../../../tools
bash extras/install_sclite.sh
cd -
./run.sh
The avg CER of text normalization is: 0.00730093543235227
,-----------------------------------------------------------------.
| | # Snt # Wrd | Corr Sub Del Ins Err S.Err |
|--------+--------------+-----------------------------------------|
| Sum/Avg| 125 2254 | 99.4 0.1 0.5 0.2 0.8 4.8 |
`-----------------------------------------------------------------'