摘要
引入有限状态转录机技术,参考Penn树库发展的思想,通过规则分析的方法综合利用词性标注结果、识别关联词、标点、词表映射及进行组块分析的方法将英语复句进行切分简化处理,最终结果以关联词及其论元的形式表示。
This paper introduces the technology of Finite State Transducer, and references to the thinking of development of Penn Treebank, through the analysis of rules and the results of comprehensive utilization of POS tagging, recognition of discourse connectives, punctuations, vocabulary mapping, and chunk to simplify the complicated sentences. Final results are expressed in the form of proposition.
出处
《现代图书情报技术》
CSSCI
北大核心
2008年第3期40-44,共5页
New Technology of Library and Information Service
基金
国家科技支撑计划基金项目"多语言信息服务环境关键技术研究与应用"(项目编号:2006BAH03B02)的研究成果之一
关键词
规则
论元
英语复句
关联词
有限状态转录机
Rule-based Argument English complicated sentences Discourse connectives Finite state transducer