You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
PaddleSpeech/third_party/python-pinyin/phrase-pinyin-data/CHANGELOG.md

214 lines
7.1 KiB

E2E/Streaming Transformer/Conformer ASR (#578) * add cmvn and label smoothing loss layer * add layer for transformer * add glu and conformer conv * add torch compatiable hack, mask funcs * not hack size since it exists * add test; attention * add attention, common utils, hack paddle * add audio utils * conformer batch padding mask bug fix #223 * fix typo, python infer fix rnn mem opt name error and batchnorm1d, will be available at 2.0.2 * fix ci * fix ci * add encoder * refactor egs * add decoder * refactor ctc, add ctc align, refactor ckpt, add warmup lr scheduler, cmvn utils * refactor docs * add fix * fix readme * fix bugs, refactor collator, add pad_sequence, fix ckpt bugs * fix docstring * refactor data feed order * add u2 model * refactor cmvn, test * add utils * add u2 config * fix bugs * fix bugs * fix autograd maybe has problem when using inplace operation * refactor data, build vocab; add format data * fix text featurizer * refactor build vocab * add fbank, refactor feature of speech * refactor audio feat * refactor data preprare * refactor data * model init from config * add u2 bins * flake8 * can train * fix bugs, add coverage, add scripts * test can run * fix data * speed perturb with sox * add spec aug * fix for train * fix train logitc * fix logger * log valid loss, time dataset process * using np for speed perturb, remove some debug log of grad clip * fix logger * fix build vocab * fix logger name * using module logger as default * fix * fix install * reorder imports * fix board logger * fix logger * kaldi fbank and mfcc * fix cmvn and print prarams * fix add_eos_sos and cmvn * fix cmvn compute * fix logger and cmvn * fix subsampling, label smoothing loss, remove useless * add notebook test * fix log * fix tb logger * multi gpu valid * fix log * fix log * fix config * fix compute cmvn, need paddle 2.1 * add cmvn notebook * fix layer tools * fix compute cmvn * add rtf * fix decoding * fix layer tools * fix log, add avg script * more avg and test info * fix dataset pickle problem; using 2.1 paddle; num_workers can > 0; ckpt save in exp dir;fix setup.sh; * add vimrc * refactor tiny script, add transformer and stream conf * spm demo; librisppech scripts and confs * fix log * add librispeech scripts * refactor data pipe; fix conf; fix u2 default params * fix bugs * refactor aishell scripts * fix test * fix cmvn * fix s0 scripts * fix ds2 scripts and bugs * fix dev & test dataset filter * fix dataset filter * filter dev * fix ckpt path * filter test, since librispeech will cause OOM, but all test wer will be worse, since mismatch train with test * add comment * add syllable doc * fix ds2 configs * add doc * add pypinyin tools * fix decoder using blank_id=0 * mmseg with pybind11 * format code
3 years ago
# ChangeLog
## [0.10.5] (2020-11-22)
* 增加 `还君明珠` 的拼音。
## [0.10.4] (2020-10-08)
* 纠正一些词语的拼音。
## [0.10.3] (2020-07-05)
* 增加 `还珠` 的拼音。
## [0.10.2] (2019-10-26)
* 纠正一些词语的拼音。
## [0.10.1] (2019-07-06)
* 修正部分拼音数据。
## [0.10.0] (2019-05-10)
* 新增 `cc_cedict.txt`: [cc-cedict.org](https://cc-cedict.org/) 拼音数据。Thanks [@hanabi1224]
* 纠正一些词语的拼音
## [0.9.2] (2019-04-06)
* 修复部分词语的拼音声调标错了位置的问题
## [0.9.1] (2019-03-31)
* 纠正一批词语的的拼音:
* `鸟事`
* `虮虱相吊`
* `别鹤离鸾`
* `年华垂暮`
* `本枝百世`
* `操戈同室`
* 部分词语中 `丢` 的拼音
## [0.9.0] (2019-02-23)
* 新增 `腌臢: ā zā`
* `朝阳` 增加 `cháo yáng` 这个音
* 新增 `土地`、`领地`、`基地`
## [0.8.5] (2018-12-26)
* 纠正 `油炸`、`洗发` 的拼音
## [0.8.4] (2018-09-16)
* 纠正 `步履蹒跚` 的拼音
* 纠正部分词语中 `长` 的拼音
## [0.8.3] (2018-08-04)
* 纠正部分 `查`、`大` 的读音 (via [ee1ded4])
## [0.8.2] (2018-07-28)
* 纠正 `有一只` 的读音 (via [330b348])
## [0.8.1] (2018-07-28)
* 纠正几个 `一` 的读音 (via [6e3b9eb])
* 修复部分拼音包含 `xh` 的问题 (via [ae12df98])
## [0.8.0] (2018-07-08)
* 纠正 `称雨道晴` 的拼音 (via [67412ab])
* 纠正部分词语中 `干` 的拼音 (via [38474cb])
* 增加 `时长` 的拼音 (via [c40b965])
## [0.7.3] (2018-06-10)
* 纠正 `一语中的`, `一语中人` 的拼音 (via [3b62ed3])
## [0.7.2] (2018-06-10)
* 纠正部分拼音数据 (via [af5d783])
## [0.7.1] (2018-06-04)
* 纠正 `负债累累` `经纶济世` 的拼音 (via [#16])
## [0.7.0] (2018-05-27)
* 新增 zdic_cibs.txt 和 zdic_cybs.txt (via [#13])
* `zdic_cibs.txt`: [汉典网](http://www.zdic.net) 汉语词典拼音数据
* `zdic_cybs.txt`: [汉典网](http://www.zdic.net) 成语词典拼音数据
* 增加基于 zdic_cibs.txt 和 zdic_cybs.txt 的 large_pinyin.txt (via [#13])
* 纠正部分读音(via [#10],[#11], [#15])
## [0.6.0] (2018-03-11)
* Revert [#3](https://github.com/mozillazg/phrase-pinyin-data/pull/3) 增加的拼音数据(错误有点多)
## [0.5.1] (2017-10-25)
* 修正一批缺少 ā 和 dī 不对的词语(via [#7][#7])
## [0.5.0] (2017-07-09)
* 增加 `还贷` 的拼音(Thanks [@zhuangh](https://github.com/zhuangh))
## [0.4.1] (2017-04-10)
* 纠正 `朝阳`, `昂昂自若` 的拼音(via [e6d6d27][e6d6d27], [6e7ea16][6e7ea16])
## [0.4.0] (2017-03-22)
* 新增2万多个词组拼音数据(via [fc50fcd][fc50fcd], 感谢 [@onsunsl][@onsunsl] 分享他/她收集的43400个拼音数据: [#3][#3] ).
## [0.3.1] (2017-03-13)
* 纠正 `斯事体大` 的拼音
## [0.3.0] (2017-03-12)
* 增加 overwrite.txt 用于新增/纠正拼音数据
* 纠正 `便宜`, `所长`, `打开天窗说亮话` 的拼音数据
* 增加 `朝阳区`
## [0.2.0] (2017-03-04)
* 添加一批拼音(via [04de9f7][04de9f7])。
## 0.1.0 (2017-03-04)
* Initial Release
[0.10.4]: https://github.com/mozillazg/phrase-pinyin-data/compare/v0.10.3...v0.10.4
[0.10.3]: https://github.com/mozillazg/phrase-pinyin-data/compare/v0.10.2...v0.10.3
[0.10.2]: https://github.com/mozillazg/phrase-pinyin-data/compare/v0.10.1...v0.10.2
[0.10.1]: https://github.com/mozillazg/phrase-pinyin-data/compare/v0.10.0...v0.10.1
[0.10.0]: https://github.com/mozillazg/phrase-pinyin-data/compare/v0.9.2...v0.10.0
[0.9.2]: https://github.com/mozillazg/phrase-pinyin-data/compare/v0.9.1...v0.9.2
[0.9.1]: https://github.com/mozillazg/phrase-pinyin-data/compare/v0.9.0...v0.9.1
[0.9.0]: https://github.com/mozillazg/phrase-pinyin-data/compare/v0.8.5...v0.9.0
[0.8.5]: https://github.com/mozillazg/phrase-pinyin-data/compare/v0.8.4...v0.8.5
[0.8.4]: https://github.com/mozillazg/phrase-pinyin-data/compare/v0.8.3...v0.8.4
[0.8.3]: https://github.com/mozillazg/phrase-pinyin-data/compare/v0.8.2...v0.8.3
[0.8.2]: https://github.com/mozillazg/phrase-pinyin-data/compare/v0.8.1...v0.8.2
[0.8.1]: https://github.com/mozillazg/phrase-pinyin-data/compare/v0.8.0...v0.8.1
[0.8.0]: https://github.com/mozillazg/phrase-pinyin-data/compare/v0.7.3...v0.8.0
[0.7.3]: https://github.com/mozillazg/phrase-pinyin-data/compare/v0.7.2...v0.7.3
[0.7.2]: https://github.com/mozillazg/phrase-pinyin-data/compare/v0.7.1...v0.7.2
[0.7.1]: https://github.com/mozillazg/phrase-pinyin-data/compare/v0.7.0...v0.7.1
[0.7.0]: https://github.com/mozillazg/phrase-pinyin-data/compare/v0.6.0...v0.7.0
[0.6.0]: https://github.com/mozillazg/phrase-pinyin-data/compare/v0.5.0...v0.6.0
[0.5.1]: https://github.com/mozillazg/phrase-pinyin-data/compare/v0.5.0...v0.5.1
[0.5.0]: https://github.com/mozillazg/phrase-pinyin-data/compare/v0.4.1...v0.5.0
[0.4.1]: https://github.com/mozillazg/phrase-pinyin-data/compare/v0.4.0...v0.4.1
[0.4.0]: https://github.com/mozillazg/phrase-pinyin-data/compare/v0.3.1...v0.4.0
[0.3.1]: https://github.com/mozillazg/phrase-pinyin-data/compare/v0.3.0...v0.3.1
[0.3.0]: https://github.com/mozillazg/phrase-pinyin-data/compare/v0.2.0...v0.3.0
[0.2.0]: https://github.com/mozillazg/phrase-pinyin-data/compare/v0.1.0...v0.2.0
[04de9f7]: https://github.com/mozillazg/phrase-pinyin-data/commit/04de9f7f520e2f2188cb4c468c30d6fb811a20ba
[fc50fcd]: https://github.com/mozillazg/phrase-pinyin-data/commit/fc50fcd7faa94205096d582fc7a1b31265943a85
[@onsunsl]: https://github.com/onsunsl
[#3]: https://github.com/mozillazg/phrase-pinyin-data/pull/3
[e6d6d27]: https://github.com/mozillazg/phrase-pinyin-data/commit/e6d6d270900fdca32ccbe9a414ea4642e537e522
[6e7ea16]: https://github.com/mozillazg/phrase-pinyin-data/commit/6e7ea167dee0c812514f0bf9701ff5c103a566af
[#7]: https://github.com/mozillazg/phrase-pinyin-data/pull/7
[#10]: https://github.com/mozillazg/phrase-pinyin-data/pull/10
[#11]: https://github.com/mozillazg/phrase-pinyin-data/pull/11
[#13]: https://github.com/mozillazg/phrase-pinyin-data/pull/13
[#15]: https://github.com/mozillazg/phrase-pinyin-data/pull/15
[#16]: https://github.com/mozillazg/phrase-pinyin-data/pull/16
[af5d783]: https://github.com/mozillazg/phrase-pinyin-data/commit/af5d7831b0e84e4a5306e304b3b2da3268e35f17
[3b62ed3]: https://github.com/mozillazg/phrase-pinyin-data/commit/3b62ed303f129868c7ccee4f2d5e44dcea7d30d4
[67412ab]: https://github.com/mozillazg/phrase-pinyin-data/commit/67412abbf8570ac80a41dc012f228c0864823a62
[38474cb]: https://github.com/mozillazg/phrase-pinyin-data/commit/38474cb91dedd27b3d51b39811704f3d045837b1
[c40b965]: https://github.com/mozillazg/phrase-pinyin-data/commit/c40b9653ea2ab066d1c0606e9e07dd4225ff2485
[6e3b9eb]: https://github.com/mozillazg/phrase-pinyin-data/commit/6e3b9eb805ed3e3a5955c179e752ec5e1293216f
[ae12df98]: https://github.com/mozillazg/phrase-pinyin-data/commit/ae12df98438a508249bdf591334b6415bb5ccf8d
[330b348]: https://github.com/mozillazg/phrase-pinyin-data/commit/330b3481ba350de07b580991a5a8b7a83aaefde9
[ee1ded4]: https://github.com/mozillazg/phrase-pinyin-data/commit/ee1ded4938624ac4ce3dc7991ab370e09dbd745c
[@hanabi1224]: https://github.com/hanabi1224
[0.10.5]: https://github.com/mozillazg/phrase-pinyin-data/compare/v0.10.4...v0.10.5