期刊文献+

A Unified Active Learning Framework for Biomedical Relation Extraction 被引量:1

A Unified Active Learning Framework for Biomedical Relation Extraction
原文传递
导出
摘要 Supervised machine learning methods have been employed with great success in the task of biomedical relation extraction. However, existing methods are not practical enough, since manual construction of large training data is very expensive. Therefore, active learning is urgently needed for designing practical relation extraction methods with little human effort. In this paper, we describe a unified active learning framework. Particularly, our framework systematically addresses some practical issues during active learning process, including a strategy for selecting informative data, a data diversity selection algorithm, an active feature acquisition method, and an informative feature selection algorithm, in order to meet the challenges due to the immense amount of complex and diverse biomedical text. The framework is evaluated on protein- protein interaction (PPI) extraction and is shown to achieve promising results with a significant reduction in editorial effort and labeling time. Supervised machine learning methods have been employed with great success in the task of biomedical relation extraction. However, existing methods are not practical enough, since manual construction of large training data is very expensive. Therefore, active learning is urgently needed for designing practical relation extraction methods with little human effort. In this paper, we describe a unified active learning framework. Particularly, our framework systematically addresses some practical issues during active learning process, including a strategy for selecting informative data, a data diversity selection algorithm, an active feature acquisition method, and an informative feature selection algorithm, in order to meet the challenges due to the immense amount of complex and diverse biomedical text. The framework is evaluated on protein- protein interaction (PPI) extraction and is shown to achieve promising results with a significant reduction in editorial effort and labeling time.
出处 《Journal of Computer Science & Technology》 SCIE EI CSCD 2012年第6期1302-1313,共12页 计算机科学技术学报(英文版)
基金 supported by the National Natural Science Foundation of China under Grant No.60973104 the National Basic Research 973 Program of China under Grant No.2012CB316301
关键词 biomedical relation extraction active learning unified framework biomedical relation extraction, active learning, unified framework
  • 相关文献

参考文献32

  • 1Faro A, Giordano D, Spampinato C. Combining literature text mining with microarray data: Advances for system biol- ogy modeling. Brief Bioinform, 2012, 13(1): 61-82.
  • 2Hunter L, Cohen K. Biomedical language processing: What's beyond PubMed? Mol Cell, 2006, 21(5): 589-594.
  • 3Huang M, Ding S, Wang H, Zhu X. Mining physical protein- protein interactions from the literature. Genome Biology, 2008, 9(Suppl 2): S12.
  • 4Katrenko S, Adriaans P. Learning relations from biomedical corpora using dependency trees. In Lecture Notes in Com- puter Science, Tuyls K, Westra R, Saeys T et al. (eds.), Springer-Verlag, 2007, 4366, pp.61-80.
  • 5Miwa M, Saetre R, Miyao Y, Tsujii J. A rich feature vector for protein-protein interaction extraction from multiple corpora. In Proc. the Conference on Empirical Methods in Natural Language Processing, August 2009, pp.121-130.
  • 6Yang Z, Lin H, Li Y. BioPPISVMExtractor: A protein- protein interaction extractor for biomedical literature using SVM and rich feature sets. Journal of Biomedical Informat- ics, 2010, 43(1): 88-96.
  • 7Li Y, Hu X, Lin H, Yang Z. Learning an enriched representa- tion from unlabelled data for protein-protein interaction ex- traction. BMC Bioinformatics, 2010, 11(Suppl 2): S7.
  • 8Landeghem S, Abed T, Saeys Y, Peer Y. Discriminative and informative features for biomolecular text mining with ensem- ble feature selection. Bioinformatics, 2010, 26(18): 554-560.
  • 9Bui Q, Katrenko S, Sloot P. A hybrid approach to extract protein-protein interactions. Bioinformatics, 2011, 27(2): 259-265.
  • 10van Landeghem S, Saeys Y, Deu Baets B, van De Peer Y. Extracting protein-protein interactions from text using rich feature vectors and feature selection. In Proc. the 3th In- ternational Symposium on Semantic Mining in Biomedicine, September 2008, pp.77-84.

同被引文献6

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部