PaddleSpeech/examples/dataset/primewords
Hui Zhang 9e99f99b3c add thchs30, aidatatang; 4 years ago
..
README.md add thchs30, aidatatang; 4 years ago

README.md

Primewords

This free Chinese Mandarin speech corpus set is released by Shanghai Primewords Information Technology Co., Ltd. The corpus is recorded by smart mobile phones from 296 native Chinese speakers. The transcription accuracy is larger than 98%, at the confidence level of 95%. It is free for academic use.

The mapping between the transcript and utterance is given in JSON format.