期刊文献+

基于条件随机场的英文地理行政实体识别 被引量:5

GPE-entity Recognition Based on Conditional Random Fields
下载PDF
导出
摘要 采用基于条件随机场的方法,对ACE评测的英文语料中的地理行政类型实体(Geographical Political Enti-ties,GPE)及其子类型进行识别。提出一种从ACE语料中选取的特征集,并根据不同的特征组合对GPE识别的贡献与其它特征集进行比较,实验表明该特征集能取得较高的召回率和准确率。 This paper detects Geographical Political Entities (GPE) and it subtypes from the English corpus of Automatic Content Extraction (ACE) evaluation, based on Conditional Random Fields (CRFs). A feature set is extracted from the ACE corpus, and contributions of different feature sets to the detection of GPE entities are evaluated in the experiments. The results show that the feature set extracted in this paper can get higher rate of recall and accuracy.
出处 《现代图书情报技术》 CSSCI 北大核心 2009年第2期51-55,共5页 New Technology of Library and Information Service
基金 “863”计划重点项目“跨媒体搜索关键技术研究及服务产品开发”(项目编号:2006AA010105) 国家自然科学基金项目“基于语义分析和统计的自动主题标引研究”(项目编号:60872133) 北京市属高等学校人才强教计划项目“创新团队-智能搜索引擎和文本挖掘”(项目编号:PXM2007_014224_044677)的研究成果之一
关键词 ACE评测 地理行政实体 实体识别 条件随机场 特征选择 ACE GPE Entity detection CRF Feature selection
  • 相关文献

参考文献10

  • 1Linguistic Data Consortium. ACE (Automatic Content Extraction ) English Annotation Guidelines for Entities Version 6. 1 [ EB/OL]. [ 2008 - 03 - 29 ]. http ://projects. ldc. upenn, edu/ace.
  • 2ZHOU GD, SU J. Named Entity Recognition Using an HMM-based Chunk Tagger[ C ]. In: Proceedings of the 40^th Annual Meeting of the Association for Computation Linguistics, Philadelphia. USA : Association for Computational Linguistics,2002:473 -480.
  • 3Bender O, Ney H. Maximum Entropy Models for Named Entity Recognition [ C ]. In: Proceedings of the Conference on Computational Natural Language Learning, Edmonton, Canada. USA: Association for Computational Linguistics, 2003 : 148 - 151.
  • 4Lafferty J, McCallum A, Pereira F. Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Seqquence Data [J]. The Journal of Manchine Learning Research,2001, ICML01 : 282 - 289.
  • 5Hacioglu K, Douglas B, Chen Y. Detection of Entity Mentions Occurling in English and Chinese Text [ C ]. In : Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing, Cannada. USA : Association for Computational Linguistics ,2005 (10) : 379 - 386.
  • 6The ACE 2008 Evaluation Plan. Assessment of Detection and Recognition of Entities and Relations Within and Across Documents [ EB/ OL]. [2008 -05 -07 ]. http://www, nist. gov/speeeh/tests/aee/ ace08/doc/.
  • 7Sutton C, McCallum A, Rohanimanesh K. Dynamic Conditional Random Fields: Factorized Probabilistic Models for Labeling and Segmenting Sequence Data [ J ]. The Journal of Machine Learning Research,2007,8 ( 3 ) :693 - 723.
  • 8廖先桃.CRF理论、工具包的使用及在NE上的应用[R/OL].[2008- 04 -02 3. http ://ir. hit. edu. cnfphpwebsite/index, php? module = doeuments&JAS_ DoeumentManager_ op = downloadFile &JAS_File_id = 215.
  • 9张海雷,曹菲菲,陈文亮,任飞亮,王会珍,朱靖波.基于多层次特征集成的中文实体指代识别[J].中文信息学报,2007,21(5):126-130. 被引量:1
  • 10Florian R, Hassan H, Jing H, et al. Factorizing Complex Models : A Case Study in Mention Detection [ J ]. Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics. 2006 (9) :473 -480.

二级参考文献14

  • 1刘非凡,赵军,吕碧波,徐波,于浩,夏迎炬.面向商务信息抽取的产品命名实体识别研究[J].中文信息学报,2006,20(1):7-13. 被引量:47
  • 2http://chasen.org/~taku/software/CRF++/
  • 3The ACE 2007 (ACE07) Evaluation Plan v1.3.http://www.nist.gov/speech/tests/ace07/doc/.
  • 4K.Hacioglu,B.Douglas,Y.Chen.Detection of Entity Mentions Occurring in English and Chinese Text[A].In:Proceedings of HLT/EMNLP-2005[C].Vancouver:2005.379-386.
  • 5R.Florian,H.Hassan,A.Ittycheriah et al.A Statistical Model for Multilingual Entity Detection and Tracking[A].In:Proceeding of HLT-NAACL 2004[C].Boston:2004,1-8.
  • 6G.D.Zhou,J.Su.Named Entity Recognition using an HMM-based Chunk Tagger[A].In:Proceeding of the 40th Annual Meeting of the ACL[C].Philadelphia:2002,473-480.
  • 7吴雪军,朱靖波,王会珍,等.Co-Training的机器学习方法在中文机构名识别中的应用[A].全国第七届计算语言学联合学术会议[C].2003.85-90.
  • 8J.Lafferty,A.McCallum,F.Pereira.Conditional Random Fields:Probabilistic Models for Segmenting and Labeling Sequence Data[A].International Conference on Machine Learning (ICML01)[C].2001.282-289.
  • 9W.L.Chen,Y.J.Zhang,H.Isahara.Chinese Named Entity Recognition with Conditional Random Fields[A].In:Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing[C].Sydney:2006.118-121.
  • 10R.Florian,H.Jing,N.Kambhatla et al.Factorizing Complex Models:A Case Study in Mention Detection[A].In:Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the ACL[C].Sydney:2006.473-480.

同被引文献40

引证文献5

二级引证文献30

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部