期刊文献+

基于辅助短语标记的名词短语识别 被引量:2

Recognition of Chinese noun phrase based on auxiliary phrase mark
下载PDF
导出
摘要 名词短语的识别是自然语言处理领域中非常重要的子任务。而名词短语的识别性能与识别效率一直是研究人员关注的焦点,为了达到兼顾二者的目的,提出了一种基于辅助短语标记识别名词短语的方法。首先,在分析了短语不同分类体系的基础上,构建了一种映射公式,并根据该公式对不同分类体系的短语类别之间进行映射。然后,根据映射结果及短语的概率分布进行辅助短语标记的组合。实验结果表明,本文的方法在提高F值的基础上,有效地降低了系统的时间开销。 Noun Phrase Recognition is one of the most critical components in natural language processing field. The noun phrase recognition performance and its efficiency are the focus of researchers' attention. In order to combine the two elements, this paper proposes a method of recognizing noun phrases based on auxil- iary phrase mark. First, this paper presents a mapping between phrases by using the mapping formula based on the detailed analysis of the different classification system of the phrases. Then, according to the mapping results and the probability of the distribution of the auxiliary phrase mark, lots of combinations are estab- lished. Experimental results show that this method effectively reduces the time of noun phrase recognition without reducing the F-value.
出处 《沈阳航空航天大学学报》 2014年第1期52-59,共8页 Journal of Shenyang Aerospace University
基金 国家科技支撑计划项目(项目编号:2012BAH14F00) 辽宁省教育厅科学研究一般项目(项目编号:L2012056)
关键词 辅助短语标记 名词短语 映射公式 auxiliary phrase mark noun phrase mapping formula
  • 相关文献

参考文献12

  • 1梁颖红.基于多Agent的英汉文本语块识别技术研究[D].哈尔滨:哈尔滨工业大学,2006:8-14.
  • 2Angel S Y, Kam Fai Wong, et al. Effectiveness analy- sis of linguistics and corpus based noun phrase partial parsers [ C ]. In Proceedings of Natural Language Pro- cessing Pacific Rim Symposium, 1995:252 - 257.
  • 3Abney S. Partial parsing via finite-state cascades [ J ]. Natural Language Engineering, 1996,2 (4) : 337 - 344.
  • 4Ramshaw,Lance and Mitch Marcus. Text chunking u- sing transformation-based learning [ C ]. Somerset, New Jersey:Association for Computational Linguis- tics, 1995.
  • 5周雅倩,郭以昆,黄萱菁,吴立德.基于最大熵方法的中英文基本名词短语识别[J].计算机研究与发展,2003,40(3):440-446. 被引量:62
  • 6Koeling, Rob. Chunking with maximum entropy mod- else [C]. 2nd Workshop on Learning Language in Log- ic and the 4th Conference on Computational Natural Language Learning, 2000 : 139 - 141.
  • 7李荣.基于隐马尔可夫模型的汉语非嵌套名词短语识别[J].忻州师范学院学报,2004,20(5):122-124. 被引量:1
  • 8Kudo, Taku and Yuji Matumoto. Chunking with sup- port vector machines [ C ].2nd Meeting of the North American Chapter of the Association for Computation- al Linguistics on Language Technologies. Pittsburgh, Pennsylvania: Association for Computational Linguis- tics ,2001 : 1 - 8.
  • 9Sha Fei and Femando Pereira. Shallow parsing with conditional random fields [ C ]. Conference of theNorth American Chapter of the Association for Com- putational Linguistics on Human Language Technolo- gy. Edmonton, Canada: Association for Computational Linguistics, 2003 : 134 - 141.
  • 10周强,俞士汶.汉语短语标注标记集的确定[J].中文信息学报,1996,10(4):1-11. 被引量:35

二级参考文献33

  • 1周明,黄昌宁.面向语料库标注的汉语依存体系的探讨[J].中文信息学报,1994,8(3):35-52. 被引量:40
  • 2[3]K. Lari,S.J.Young. The estimation of stochastic context-free grammars using the Inside-Outside algorithm[J].Compter Speech and Language, 1990,4(1):35 - 36.
  • 3[4]M. Wang,J.Hirschbery. Automatic classification ofintonnationalphraseboundaries[J].Compter Speech and Language,1992,6(2):176 - 196.
  • 4周强,计算机研究与运用,1993年
  • 5李子云,汉语句法规则,1992年
  • 6房玉清,实用汉语语法,1992年
  • 7吴竞存,现代汉语句法结构与分析,1992年
  • 8范晓,汉语的短语,1991年
  • 9团体著者,世界汉语教学,1989年,1期
  • 10朱德熙,语法答问,1985年

共引文献92

同被引文献22

  • 1郭永辉,杨红卫,马芳,王炳锡.基于粗糙集的基本名词短语识别[J].中文信息学报,2006,20(3):14-21. 被引量:2
  • 2邹宏梅,王挺.SVM和基于转换的错误驱动学习相结合的汉语组块识别[J].计算机工程与科学,2007,29(4):91-94. 被引量:4
  • 3游斓.基于转换的基本名词短语识别[C].复旦大学·政学者论文集,2002:236-245.
  • 4梁颖红,赵铁军,翟舒.规则和边界统计相结合的英语基本名词短语识别[C].语言计算与基于内容的文本处理——全国第七届计算语言学联合学术会议论文集,2003.
  • 5Gareev R, Tkachenko M, Solovyev V, et al. Intro- ducing baselines for russian named entity recogni-tionE C . Computational Linguistics and Intelligent Text Processing. Springer Berlin Heidelberg, 2013 : 329 - 342.
  • 6Lafferty J, McCallum A, Pereira F C N. Conditional random fields:probabilistic models for segmenting and labeling sequence data[J]. 2001:139 - 141.
  • 7Xun E, Huang C, Zhou M. A unified statistical model for the identification of English baseNP E CJ. Proceedings of the 38th Annual Meeting on Association for Computational Linguistics. Associ- ation for Computational Linguistics, 2000: 109 - 116.
  • 8. Sang E F. Noun phrase recognition by system com- bination [ C ]. Proceedings of the 1 st North Ameri- can chapter of the Association for Computational Linguistics conference. Association for Computa- tional Linguistics ,2000:50 - 55.
  • 9徐艳华.基于语料库的基本名词短语研究[J].语言文字应用,2008(1):120-125. 被引量:5
  • 10代翠,周俏丽,蔡东风,杨洁.统计和规则相结合的汉语最长名词短语自动识别[J].中文信息学报,2008,22(6):110-115. 被引量:16

引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部