期刊文献+

基于规则的越南语命名实体识别研究 被引量:15

Rule-based Recognition of Vietnamese Named Entities
下载PDF
导出
摘要 命名实体识别是信息抽取的重要研究内容,主要包括对组织机构名、地名和人名的自动识别。针对英语和汉语的命名实体识别研究开始较早,主要采用基于规则和基于统计的方法进行识别,但目前国内还少有针对越南语命名实体识别的研究。该文分析了越南语命名实体的语言学特点,对其分类并进行了形式化表达,提出了一种基于规则的越南语命名实体识别方法,实验结果显示,该方法能够达到较高的识别准确率。 Named Entity Recognition (NER) is an important task for Information Extraction. NER mainly includes the recognition of person names, location names and organization names. Studies on English and Chinese NER began relatively earlier, mainly using rule-based methods or statistical methods. There are fewer studies carried out on Vietnamese NER, and there are even no domestic studies. This paper presents a rule based method to recognize Vietnamese Named Entities on the basis of their linguistic formations. Experiments results validate the effectiveness of this method.
机构地区 洛阳外国语学院
出处 《中文信息学报》 CSCD 北大核心 2014年第5期198-205,214,共9页 Journal of Chinese Information Processing
基金 中国-东盟研究中心资助课题(201205)
关键词 命名实体识别 越南语 规则 named entity recognition Vietnamese rule
  • 相关文献

参考文献13

  • 1Tri Tran Q,Thao Pham T X,Hung Ngo Q,et al.Named Entity Recognition in Vietnamese documents[J].Progress in Informatics,2007,4:5-13.
  • 2Daniel Jurafsky,James H.Martin著,冯志伟,孙乐译.自然语言处理综论[M].北京:电子工业出版社,2005.
  • 3俞鸿魁,张华平,刘群,吕学强,施水才.基于层叠隐马尔可夫模型的中文命名实体识别[J].通信学报,2006,27(2):87-94. 被引量:160
  • 4张晓艳,王挺,陈火旺.基于混合统计模型的汉语命名实体识别方法[J].中文信息学报,2009,(2).
  • 5Chen,Hsin-His,Yang Changhua & Ying Lin.Learning Formulation and Transformation Rules for Multilingual Named Entities[C]// Proceedings of ACL-2003.
  • 6Chieu,Hai leong & Hwee Tou Ng.Named Entity Recognition with a Maximum Entropy Approach[C]// Proceedings of CoNLL-2003.
  • 7Dat Bat Nguyen,Son Huu Hoang,Son Bao Pham & Thai Phuong Nguyen.Named Entity Recognition for Vietnamese[J].ACIIDS2010.Part Ⅱ,LNAI5991,pp.205-214.
  • 8Klein,Dan,Joseph Smarr,Huy Nguyen & Christopher D.Manning.Named Entity Recognition with Character-Level Models[C]// Proceedings of CoNLL-2003.
  • 9Mayfield,James,Paul McNamee & Christine Piatko.Named Entity Recognition using Hundreds of Thousands of Features[C]// Proceedings of CoNLL-2003.
  • 10Thao Pham T.X,Tri T.Q.,Ai Kawazoe,Dien Dinh & Nigel Collier.Construction of Vietnamese Corpora for Named Entity Recognition[C]// Conference RIAO2007.Pittsburgh PA,U.S.A.May 30-June 1,2007.

二级参考文献12

  • 1刘群,张华平,俞鸿魁,程学旗.基于层叠隐马模型的汉语词法分析[J].计算机研究与发展,2004,41(8):1421-1429. 被引量:198
  • 2罗智勇 宋柔.现代汉语自动分词中专名的一体化、快速识别方法[A]..ICCC,Singapore[C].,2001.11..
  • 3季姮,罗振声.基于反比概率模型和规则的中文姓名自动辨识系统[A].自然语言理解与机器翻译[C].北京:清华大学出版社,2001.123-128.
  • 4何燕.基于单字词转移概率的未登录词识别[A].自然语言理解与机器翻译[C].北京:清华大学出版社,2001 141-146.
  • 5张艳丽,黄德根等.统计和规则相结合的中文机构名称识别[A].自然语言理解与机器翻译[C].北京:清华大学出版社,2001.233-239.
  • 6SUN J,GAO J F,ZHANG L,et al.Chinese named entity identification using class-based language model[A].Proc of the 19th International Conference on Computational Linguistics[C].Taipei:Morgan Kauffmann Press,2002.967-973.
  • 7YU H,ZHANG H,LIU Q.Recognition of Chinese organization name based on role tagging[A].Advances in Computation of Oriental Languages[C].Beijing:Tsinghua University Press,2003.79-87
  • 8ZHANG H,LIU Q,YU H,et al.Chinese named entity recognition using role model[J].The International Journal of Computational Linguistics and Chinese Language Processing,2003,8(2):1-31.
  • 9RICHARD S,THOMAS E.The first international Chinese word segmentation bakeoff[A].Second SIGHAN Workshop on Chinese Language Processing[C].Sapporo:Sapporo Press,2003.133-143.
  • 10吕雅娟,赵铁军,杨沐昀,于浩,李生.基于分解与动态规划策略的汉语未登录词识别[J].中文信息学报,2001,15(1):28-33. 被引量:43

共引文献164

同被引文献85

引证文献15

二级引证文献72

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部