摘要
本文以一个包括136351条术语的数据库抽取的术语用字数据库为基础,分析了术语用字的数量及使用情况。并将术语用字和“现代汉语常用字表”的3500个汉字进行比较,同时将术语用字的使用情况和真实语料中汉字的使用频度进行比较,在此基础上分析出术语常用字和术语专用字。文章还统计了信息技术领域术语用字的首字和尾字的特点及使用情况。这些属性会对术语的自动提取及术语学相关的研究有一定的帮助。
In This study we build a database of 2359 Chinese characters from 136351 information technology terms, we want to know how many Chinese characters are used in the terms and how they are used in the terms, such as frequency, position etc. We also compare with the 3500 Common Characters. As a result, we know that which characters are often used in the terms, and some of them are only used in the terms.
出处
《术语标准化与信息技术》
2005年第1期41-44,共4页
Terminology Standardization & Information Technology
关键词
信息技术
术语
数据库
汉字
information technology field, term, Chinese characters in terms, Chinese character