摘要
中文语句中广泛存在缺省现象,缺省项识别的准确与否关系到缺省消解结果,因此对缺省项的识别很重要。介绍了一种基于规则的中文缺省项识别方法,即采用CTB语料构建基准语料库,以动词驱动为核心提出规则来获得缺省项的结构化信息。实验结果显示,基于规则的中文缺省项识别方法具有可行性。
The phenomenon of ellipsis is widely existed in Chinese and the results of ellipsis resolution are directly im- pacted by correctness of the ellipsis identification. So ellipsis identification is very important. We introduced a learning approach of rule-base ellipsis identification in Chinese. That approach constructs a corpus-base by marking all sentences in CTB manually and then proposes a verb-driven method to extract rules to get syntax structure information. Experi- mental results shows that our method is feasible.
出处
《计算机科学》
CSCD
北大核心
2011年第12期255-257,273,共4页
Computer Science
基金
国家自然科学基金(90920004
60970056
61070123
61003153)
江苏省高校自然科学重大基础研究项目(08KJA520002)资助
关键词
缺省识别
规则
动词
Ellipsis identification,Rule-based,Verbs