期刊文献+

基于归约的汉语最长名词短语识别方法 被引量:4

Chinese Maximal Noun Phrase Recognition Based on Reduction
下载PDF
导出
摘要 该文提出了最长名词短语(MNP)的操作性定义,分析了其构造和分布特征,并设计了一种基于baseNP归约的识别方法,利用MNP结构特性及起始有定成分、语义核心等语言学特征,缓解了最长名词短语长距离依赖与模型观察窗口受限的矛盾。开放测试取得了88.68%的正确率和89.21%的召回率;归约方法全面提升了识别性能,特别是将多词结构的调和平均值提高1%,优化幅度达6%以上,并且对长距离复杂结构有着更好的识别效果。 This paper proposes an operational definition of Maximal Noun Phrase(MNP), and then analyzes its structure and distribution features. A MNP recognition based on baseNP reduction is also designed, which exploits the structural characteristics of MNP as well as the linguistic features such as initial definite references and semantic heads. This method eases the conflict between the long distance dependency of MNP and the limits of observation windows in classical models. The experiment indicates a good precision of 88.68% and a recall of 89.21%. The reduction method comprehensively improves system performance, especially it improves Fl-score by 1% and optimal margin by 6 % on multiword MNP, showing its efficiency in complex MNP recognition.
作者 钱小飞 侯敏
出处 《中文信息学报》 CSCD 北大核心 2015年第2期40-48,共9页 Journal of Chinese Information Processing
基金 上海市高校青年教师培养资助计划(shu11053) 国家语言资源监测与研究中心科研项目(YZYS08-04)
关键词 最长名词短语 识别 归约 基本名词短语 maximal noun phrase recognize reduction baseNP
  • 相关文献

参考文献14

  • 1Voutilainen A. NPTool: a detector of English nounphrases [C]//Proceedings of the Workshop on VeryLarge Corpora: Academic and Industrial Perspectives,1993.
  • 2李文捷,周明,潘海华,等.基于语料库的中文最长名词短语的自动提取[C]//陈力为,袁琦,计算语言学进展与应用.北京:清华大学出版社,1995,119-124.
  • 3周强,孙茂松,黄昌宁.汉语最长名词短语的自动识别[J].软件学报,2000,11(2):195-201. 被引量:37
  • 4Guiping Zhang, Wenjing Lang, Qiaoli Zhou, et al. I-dentification of Maximal-Length Noun Phrases Basedon Maximal-Length Preposition Phrases in Chinese[C]//Proceedings of IALP 2010 : 65-68.
  • 5Changhao Yin. Identification of Maximal Noun Phrasein Chinese: Using the Head of Base Phrases [D].POSTECH, Korea,2005.
  • 6Xue-Mei Bai, Jin-Ji Li, Dong-U Kim, et al. Identifica-tion of Maximal-Length Noun Phrases Based on Ex-panded Chunks and Classified Punctuations in Chinese[Cj//Proceedings of the 21st ICCPOL,2006 : 268-276.
  • 7Kuang-hua Chen. Extracting noun phrases from large-scale texts: a hybrid approach and its automatic evalu-ation[C]//Proceedings of the 32nd ACL, 1994.
  • 8代翠,周俏丽,蔡东风,杨洁.统计和规则相结合的汉语最长名词短语自动识别[J].中文信息学报,2008,22(6):110-115. 被引量:16
  • 9鉴萍,宗成庆.基于双向标注融合的汉语最长短语识别方法[J].智能系统学报,2009,4(5):406-413. 被引量:9
  • 10Steven Abney. Syntactic affixation and performancestructures[C] //Proceeding of Views on Phrase Struc-ture, 1990.

二级参考文献58

共引文献64

同被引文献24

引证文献4

二级引证文献20

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部