You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
PaddleSpeech/examples/other/tn
TianYuan 65c3217b8b
Update textnorm_test_cases.txt
2 years ago
..
data Update textnorm_test_cases.txt 2 years ago
README.md Added pre-install doc for G2P and TN modules and updated the dependency version of pypinyin (#2364) 2 years ago
get_textnorm_data.py change nprocs to ngpu, add aishell3/voc1 3 years ago
path.sh [TTS]fix praatio version, test=tts (#1158) 3 years ago
run.sh change nprocs to ngpu, add aishell3/voc1 3 years ago
test_textnorm.py change nprocs to ngpu, add aishell3/voc1 3 years ago

README.md

Text Normalization

For text normalization, the test data is data/textnorm_test_cases.txt, we use | as the separator of raw_data and normed_data.

We use CER as an evaluation criterion.

Start

Run the command below to get the results of the test.

cd ../../../tools
bash extras/install_sclite.sh
cd -
./run.sh

The avg CER of text normalization is: 0.00730093543235227

      ,-----------------------------------------------------------------.
      |        | # Snt  # Wrd | Corr    Sub    Del    Ins    Err  S.Err |
      |--------+--------------+-----------------------------------------|
      | Sum/Avg|  125    2254 | 99.4    0.1    0.5    0.2    0.8    4.8 |
      `-----------------------------------------------------------------'