期刊文献+

HMM和CRFs在信息抽取应用中的比较研究 被引量:12

Comparative Study on HMM and CRFs Applying in Information Extraction
下载PDF
导出
摘要 在比较HMM和CRFs数学理论的基础上,分别提出基于HMM词角色标注和基于CRFs字角色标注的人名实体抽取模型,并通过开放性测试和实践应用两次验证、比较两者的有效性,从而在实践中证明从理论比较中得出的结论:CRFs较之HMM更适合于解决序列标注或对象分类问题。 This paper brings forward two models for person - name entity extraction based on the comparison of math theory between HMM and CRFs, one using word role label based HMM and the other using character role label based CRFs, then validates and compares the effect of both by open - testing and applying in practice, and thereby proves in practice that CRFs is fitter for sequence labeling and object classifying than HMM.
作者 王昊 邓三鸿
出处 《现代图书情报技术》 CSSCI 北大核心 2007年第12期57-63,共7页 New Technology of Library and Information Service
关键词 HMM CRFS 信息抽取 人名实体抽取 角色标注 特征 HMM CRFs Information extraction Person- name entity extraction Role label Feature
  • 相关文献

参考文献10

  • 1傅爱平.计算语言学和自然语言信息处理研究和应用综述[EB/OL].http://www.cass.net.cn/chinese/s18_yys/yingyong/courses/nlpbase.htm.
  • 2王昊.基于层次模式匹配的命名实体识别模型[J].现代图书情报技术,2007(5):62-68. 被引量:8
  • 3Zhou G D, Su J. Named Entity Recognition Using an HMM - based Chunk Tagger [ C ]. In : Proceedings of the 40th Annual Meeting of the ACL. Philadelphia, PA. , USA, 2002:473 -480.
  • 4Settles B. Biomedical Named Entity Recognition Using Conditional Random Fields and Rich Feature Sets[ C ]. In:Proceedings of the International Joint Workshop on Natural Language Processing in Biomedicine and its Application(NLPBA). Geneva,Switzerland, 2004 : 103 - 107.
  • 5詹卫东.词汇分析(二)--从词串到词性标记串[EB/OL].[2007- 10 -01 ]. http://ccl.pku.edu.cn/doubtfire/course/ computational linguistics/contents/Chapter_07_2_pdf_format.pdf.
  • 6钱晶,张杰,张涛.基于最大熵的汉语人名地名识别方法研究[J].小型微型计算机系统,2006,27(9):1761-1765. 被引量:26
  • 7laputa.最大熵模型与自然语言处理[EB/OL].[2007-10-01].http://www.cs.caltech.edu/-weixl/research/read/summary/MaxEnt2.ppt.
  • 8黄昌宁,赵海.由字构词--中文分词新方法[C].中国中文信息学会第六次全国会员代表大会暨成立二十五周年学术会议,2006.
  • 9郭家清,蔡东风,王智超,刘浩公.一种基于条件随机场的人名识别方法[J].通讯和计算机(中英文版),2007,4(2):22-25. 被引量:6
  • 10CRF + + - 0.49 [ CP/OL]. [ 2007 - 10 - 01 ]. http://soureeforge.net.

二级参考文献13

  • 1王睿,张洁,张由仪,于禛,姚天昉.基于混合模型的中文命名实体抽取系统[J].清华大学学报(自然科学版),2005,45(S1):1908-1914. 被引量:10
  • 2李荣陆,王建会,陈晓云,陶晓鹏,胡运发.使用最大熵模型进行中文文本分类[J].计算机研究与发展,2005,42(1):94-101. 被引量:95
  • 3王胜,朱明.基于最大熵马尔可夫模型的地址信息抽取[J].计算机工程与应用,2005,41(21):192-194. 被引量:7
  • 4Chen H H, Ding Y W, Tsa S C, et al. Description of the NITU System Used for MET2. In: Proc. of 7th Message Understanding Conference, 1998
  • 5Black W J, Rinaldi F, Mowatt D. Facile: Description of the NE System Used For MUC - 7. In.. Proc. of 7th Message Understanding Conf, 1998
  • 6Fukumoto J, Shimohata M, Masui F,et al. Electric Industry: Description of the Oki System as Used for MET-2. In: Proc. of 7th Message Understanding Conf, 1998
  • 7Berners- Lee T, Fischetti M,Dertouzos T M. Weaving the Web: The Original Design and Ultimate Destiny of the World Wide Web by its Inventor. Harper, San Francisco. 1999
  • 8Zhou G D, Su J. Named Entity Recognition using an HMM - based Chunk Tagger. In: Proc. of the 40th Annual Meeting of the ACL, Philadelphia, PA 2002, 473 - 480
  • 9Bender O, Och F J, Ney H. Maximum Entropy Models for Named Entity Recognition, Proceedings of the Conference on Computational Natural Language Learning. Edmonton, Canada, 2003, 148- 151
  • 10丁丰,袁保宗.一种基于最大熵原理的汉语实体提取方法[J].铁道学报,2001,23(5):34-37. 被引量:1

共引文献37

同被引文献196

引证文献12

二级引证文献75

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部