You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
PaddleSpeech/examples/other/g2p/README.md

941 B

G2P

For g2p, we use BZNSYP's phone label as the ground truth and we delete silence tokens in labels and predicted phones.

You should Download BZNSYP from its Official Website and extract it. Assume the path to the dataset is ~/datasets/BZNSYP.

We use WER as an evaluation criterion.

Start

Run the command below to get the results of the test.

./run.sh

The avg WER of g2p is: 0.024169315564825305

     ,--------------------------------------------------------------------.
     |                         ./exp/g2p/text.g2p                         |
     |--------------------------------------------------------------------|
     | SPKR   | # Snt    # Wrd  | Corr    Sub    Del    Ins    Err  S.Err |
     | Sum/Avg|  9996   299181  | 97.6    2.4    0.0    0.0    2.4   49.2 |
     `--------------------------------------------------------------------'