摘要
连词能够连接词语、短语、小句、句子乃至句群,连词结构短语是连词所连接对象的一种,不同的连词形成不同长度、不同关系的连词结构短语。该文根据虚词用法知识库中的连词用法,构建了连词结构短语识别规则,实现了基于规则的连词结构短语识别,并将连词用法作为特征采用条件随机场模型实现了基于统计的连词结构短语识别。实验结果表明,统计的识别效果高于规则的识别效果,连词用法能够较好地用于连词结构短语的识别中。
Conjunctions connect words, phrases, clauses, sentences and even sentence groups. The conjunction phrase is the words or phrases connected by conjuctions, bearing different lengths and relations. According to conjunction usage in the functional word usage knowledge base, the paper formulates a rule based method for the recognition of conjunction structure phrases. Meanwhile, the paper adopts the conditional random field to build a statistical model for the conjunction phrase recognition based on the conjunction usage. Results indicate that the statistical method performs better than the rule method, and conjunction usage is beneficial to the conjunction phrase recogni tion.
出处
《中文信息学报》
CSCD
北大核心
2012年第6期72-78,共7页
Journal of Chinese Information Processing
基金
国家自然科学基金资助项目(60970083)
模式识别国家重点实验室开放课题基金资助项目
河南省科技创新人才杰出青年基金资助项目(104100510026)
关键词
连词结构短语
连词用法
条件随机场
conjunction phrase
conjunction usages
conditional random fields