You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
PaddleSpeech/examples/other/g2p
david.95 61422e71e3
add tool to compare test badcase and add run examples,test=tts
2 years ago
..
README.md Added pre-install doc for G2P and TN modules and updated the dependency version of pypinyin (#2364) 2 years ago
compare_badcase.py add tool to compare test badcase and add run examples,test=tts 2 years ago
get_g2p_data.py [asr] logfbank with dither (#1179) 3 years ago
path.sh [TTS]fix praatio version, test=tts (#1158) 3 years ago
run.sh Update run.sh 3 years ago
test_g2p.py change nprocs to ngpu, add aishell3/voc1 3 years ago

README.md

G2P

For g2p, we use BZNSYP's phone label as the ground truth and we delete silence tokens in labels and predicted phones.

You should Download BZNSYP from its Official Website and extract it. Assume the path to the dataset is ~/datasets/BZNSYP.

We use WER as an evaluation criterion.

Start

Run the command below to get the results of the test.

cd ../../../tools
bash extras/install_sclite.sh
cd -
./run.sh

The avg WER of g2p is: 0.024075726733983775

     ,--------------------------------------------------------------------.
     |                         ./exp/g2p/text.g2p                         |
     |--------------------------------------------------------------------|
     | SPKR   | # Snt    # Wrd  | Corr    Sub    Del    Ins    Err  S.Err |
     | Sum/Avg|  9996   299181  | 97.6    2.4    0.0    0.0    2.4   49.0 |
     `--------------------------------------------------------------------'