PaddleSpeech/dataset/primewords
Hui Zhang cc7096dd27 examples/dataset to dataset 3 years ago
..
README.md examples/dataset to dataset 3 years ago

README.md

Primewords

This free Chinese Mandarin speech corpus set is released by Shanghai Primewords Information Technology Co., Ltd. The corpus is recorded by smart mobile phones from 296 native Chinese speakers. The transcription accuracy is larger than 98%, at the confidence level of 95%. It is free for academic use.

The mapping between the transcript and utterance is given in JSON format.