摘要
汉字是表义文字 ,具有丰富的语义内容 ,汉字是一个有限的封闭集 ,它的数目是有限的 ,而汉语的词是一个开放系统 ,它是无限的。本文以“字义基元化、词义组合化”为基本思想 ,从字义着手 ,研究二字词词义组合。首先以经过整理的《现代汉语规范字典》、《现代汉语词典》和《同义词词林》为资源 ,从中自动搜索、抽取出二字词词义组合 ,建立汉字字义、词义知识库 ,然后再采用《同义词词林》的语义体系 ,通过语义相关度等的计算确定它们的组合类型 。
As an ideography of abundant semantic contents, the Chinese character is a closed set with limited number while the Chinese word is an open system which is unlimited.Following the idea of' character sense elementalization and word sense combinationalization', this paper researches the combination of word sense with the character sense as the starting point. Firstly, it establishes the database of character sense and word sense by searching automatically the combinations of two character words' word sense from three main dictionaries.Then it defines the combination types through the calculating of semantic relativity.The author hopes this paper can provide references for the research of the combination of two character words' word sense.
出处
《中文信息学报》
CSCD
北大核心
2001年第6期1-6,26,共7页
Journal of Chinese Information Processing
基金
山西省自然科学基金 (2 0 0 0 10 32 )