You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
PaddleSpeech/paddlespeech/text
Hui Zhang fe83adfbcb
nproc to ngpu
3 years ago
..
examples/punctuation_restoration nproc to ngpu 3 years ago
speechtask/punctuation_restoration nproc to ngpu 3 years ago
.gitignore merge deepspeech, parakeet and text_processing into paddlespeech 3 years ago
README.md merge deepspeech, parakeet and text_processing into paddlespeech 3 years ago
requirements.txt merge deepspeech, parakeet and text_processing into paddlespeech 3 years ago

README.md

PaddleSpeechTask

A speech library to deal with a series of related front-end and back-end tasks

环境

  • python==3.6.13
  • paddle==2.1.1

中/英文文本加标点任务 punctuation restoration

数据集: data

  • 中文数据来源data/chinese
    1.iwlst2012zh
    2.平凡的世界

  • 英文数据来源: data/english
    1.iwlst2012en

  • iwlst2012数据获取过程见data/README.md

模型speechtask/punctuation_restoration/model

1.BLSTM模型

2.BertLinear模型

3.BertBLSTM模型