You can not select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
59 lines
2.4 KiB
59 lines
2.4 KiB
3 years ago
|
# [CC-CEDICT](https://cc-cedict.org/wiki/)
|
||
|
|
||
|
What is CC-CEDICT?
|
||
|
CC-CEDICT is a continuation of the CEDICT project.
|
||
|
The objective of the CEDICT project was to create an online, downloadable (as opposed to searchable-only) public-domain Chinese-English dictionary.
|
||
|
CEDICT was started by Paul Andrew Denisowski in October 1997.
|
||
|
For the most part, the project is modeled on Jim Breen's highly successful EDICT (Japanese-English dictionary) project and is intended to be a collaborative effort,
|
||
|
with users providing entries and corrections to the main file.
|
||
|
|
||
|
|
||
|
## Parse CC-CEDICT to Json format
|
||
|
|
||
|
1. Parse to Json
|
||
|
|
||
|
```
|
||
|
run.sh
|
||
|
```
|
||
|
|
||
|
2. Result
|
||
|
|
||
|
```
|
||
|
exp/
|
||
|
|-- cedict
|
||
|
`-- cedict.json
|
||
|
|
||
|
0 directories, 2 files
|
||
|
```
|
||
|
|
||
|
```
|
||
|
4c4bffc84e24467fe1b2ea9ba37ed6b6 exp/cedict
|
||
|
3adf504dacd13886f88cc9fe3b37c75d exp/cedict.json
|
||
|
```
|
||
|
|
||
|
```
|
||
|
==> exp/cedict <==
|
||
|
# CC-CEDICT
|
||
|
# Community maintained free Chinese-English dictionary.
|
||
|
#
|
||
|
# Published by MDBG
|
||
|
#
|
||
|
# License:
|
||
|
# Creative Commons Attribution-ShareAlike 4.0 International License
|
||
|
# https://creativecommons.org/licenses/by-sa/4.0/
|
||
|
#
|
||
|
# Referenced works:
|
||
|
|
||
|
==> exp/cedict.json <==
|
||
|
{"traditional": "2019\u51a0\u72c0\u75c5\u6bd2\u75c5", "simplified": "2019\u51a0\u72b6\u75c5\u6bd2\u75c5", "pinyin": "er4 ling2 yi1 jiu3 guan1 zhuang4 bing4 du2 bing4", "english": "COVID-19, the coronavirus disease identified in 2019"}
|
||
|
{"traditional": "21\u4e09\u9ad4\u7d9c\u5408\u75c7", "simplified": "21\u4e09\u4f53\u7efc\u5408\u75c7", "pinyin": "er4 shi2 yi1 san1 ti3 zong1 he2 zheng4", "english": "trisomy"}
|
||
|
{"traditional": "3C", "simplified": "3C", "pinyin": "san1 C", "english": "abbr. for computers, communications, and consumer electronics"}
|
||
|
{"traditional": "3P", "simplified": "3P", "pinyin": "san1 P", "english": "(slang) threesome"}
|
||
|
{"traditional": "3Q", "simplified": "3Q", "pinyin": "san1 Q", "english": "(Internet slang) thank you (loanword)"}
|
||
|
{"traditional": "421", "simplified": "421", "pinyin": "si4 er4 yi1", "english": "four grandparents, two parents and an only child"}
|
||
|
{"traditional": "502\u81a0", "simplified": "502\u80f6", "pinyin": "wu3 ling2 er4 jiao1", "english": "cyanoacrylate glue"}
|
||
|
{"traditional": "88", "simplified": "88", "pinyin": "ba1 ba1", "english": "(Internet slang) bye-bye (alternative for \u62dc\u62dc[bai2 bai2])"}
|
||
|
{"traditional": "996", "simplified": "996", "pinyin": "jiu3 jiu3 liu4", "english": "9am-9pm, six days a week (work schedule)"}
|
||
|
{"traditional": "A", "simplified": "A", "pinyin": "A", "english": "(slang) (Tw) to steal"}
|
||
|
```
|