期刊文献+

一种基于动态规划的缩写词定义识别方法

A dynamic programming algorithm for identifying definitions of abbreviations
下载PDF
导出
摘要 生物文献挖掘是数据挖掘中的热点问题,论文针对文献挖掘中的缩写词定义识别问题提出了一种新的基于动态规划的比对算法,弥补了已有算法只能识别缩写词中的所有字符都来自于定义中字符这种形式的不足.实验结果表明,该算法相对于已有的缩写词定义识别算法取得了较好的回收率和准确率. Biological text mining is a hot spot in data mining. In this paper, we proposed a new alignment algorithm for abbreviation-definition recognition in biological text mining based on dynamic programming. The other existed algorithms for abbreviation-definition recognition could only extract the abbreviation whose characters were all from the characters of its corresponding definition, and our algorithm could recognize more abbreviations and definitions. Experimental results illustrated that our algorithm achieved higher precision and recall than others.
出处 《安徽大学学报(自然科学版)》 CAS 北大核心 2008年第6期40-43,共4页 Journal of Anhui University(Natural Science Edition)
关键词 缩写词定义识别 动态规划 生物文献挖掘 准确率 回收率 abbreviation-definition recognition dynamic programming biological text mining precision recall
  • 相关文献

参考文献8

  • 1Fred H L, Cheng T O. Acronymesis : the exploding misuse of acronyms [ J ]. Tex Heart Inst J,2003,30 (4) :255 - 257.
  • 2Pustejovsky J, Castano J, Cochran B. Automatic extraction of acronym-meaning pairs from MEDLINE databases [ J]. Medinfo,2001,10( 1 ) :371 - 375.
  • 3Zhou W, Torvik V I, Smalheiser N R. ADAM: another database of abbreviations in MEDLINE[ J]. Bioinformatics, 2006,22(28) :2813 -2818.
  • 4Schwartz A S, Hearst M A. A simple algorithm for identifying abbreviation definitions in biomedical text [ C ]. Proceedings of the 2003 Pacific Symposium on Biocomputing,2003.
  • 5Taghva K, Gibreth J. Recognizing acronyms and their definitions [ R ]. Technical Report, ISRI ( Information Science Research Institute) UNLV, 1995.
  • 6Chang J T, Schutze H, Ahman R B. Creating an online dictionary of abbreviations from MEDLINE [ J]. J Am Med Inform, Assoc ,2002,9(6) :612 - 620.
  • 7Park Y, Byrd R J. Hybrid text mining for finding abbreviations and their definitions [ C ]. In Proceedings of the 6^th Conference on Empirical Methods in Natural language Processing,2001.
  • 8Ao H, Takagi T. Alice: an algorithm to extract abbreviations from MEDLINE [ J ]. J Am Med Inform Assoc,2005,12 (5) :576 -586.

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部