摘要
本文探讨Unicode大字符集汉字属性整理与展示平台"字网"的建设,梳理了需要整合的已有汉字属性材料,指出当前汉字属性研究与整理中的问题。针对这些问题,提出多维度属性标注集标注法的应用方法,探讨了汉字属性系统化、秩序化展示的规则网络建构原则,以及用数据挖掘发现隐性联系的非线性联想网络的建构原则。最后讨论了"字网"在辞书编纂、修订中的应用。
This article discusses the Chinese character property arrangement and display platform of Unicode large character set--CharacterWeb. Chinese character property material was collated and problems in the current Chinese character attribute research were pointed out. To solve these problems, we proposed a multi-dimensional property annotation set, and discussed the principles of network construction about the display of Chinese character property systematically and orderly, as well as using the data to find the construction principle of non-linear associative network of recessive association. Finally, we discuss the application of CharacterWeb in lexicographical compilation and amendment.
出处
《语言文字应用》
CSSCI
北大核心
2011年第2期125-134,共10页
Applied Linguistics
基金
教育部重大攻关项目"中华大字符集创建工程"(编号:04JDZ00032)
国家社科基金项目(编号:08BYY046)
国家语委课题(编号:YB115-23
YB115-34
YB115-39)
关键词
Unicode字集
“字网”
汉字属性整理与展示
Unicode character set
CharacterWeb
Chinese character property arrangement and display