期刊文献+

文本信息抽取技术研究

Research on text information extraction technology
下载PDF
导出
摘要 提出一种基于案例分析的文本数据抽取方法,通过将知识进行特征化表示,借助"用户特征—案例特征—案例知识"三者之间的映射关系和概念模块间的知识关联,完成复杂信息的知识抽取,同时引入增量式案例知识学习模型,有效地避免了因人工干预导致的知识拓展的不连续性,提高了抽取过程的识别效率. In this paper, we propose an information extraction technology based on case study, which realizes the knowledge extraction of complex information by means of characteristically expressing knowledge and via the help of the mapping relations among "the user characteristic-case knowledge-case characteristic" and the associations of the knowledge module. Meantime, the bootstrapping knowledge learning model is introduced to effectively avoid the discontinuousness of knowledge expansion caused by human interference~ thus, the recognition efficiency is greatly enhanced.
出处 《辽宁师专学报(自然科学版)》 2014年第2期1-3,65,共4页 Journal of Liaoning Normal College(Natural Science Edition)
关键词 文本信息 信息抽取技术 知识抽取 text information information extraction technology knowledge extraction
  • 相关文献

参考文献2

二级参考文献13

  • 1欧健文,董守斌,蔡斌.模板化网页主题信息的提取方法[J].清华大学学报(自然科学版),2005,45(S1):1743-1747. 被引量:70
  • 2张志刚,陈静,李晓明.一种HTML网页净化方法[J].情报学报,2004,23(4):387-393. 被引量:57
  • 3王琦,唐世渭,杨冬青,王腾蛟.基于DOM的网页主题信息自动提取[J].计算机研究与发展,2004,41(10):1786-1792. 被引量:81
  • 4高强,张敬之,耿桦,潘金贵.基于重复模式的Web信息抽取[J].计算机科学,2007,34(4):210-212. 被引量:6
  • 5Freitag D. Machine learning for information extraction in information domains[J]. Machine Learning, 2000,39 (2/3): 169-202.
  • 6Gupta, Kaiser G, Neistadt D, et al. DOM-based content extraction of HTML documents[C]// Proc. of the 12th Int'l World Wide Web Conf. New York:ACM Press, 2003:207-214.
  • 7Gupta S, Kaiser G E, Grimm P, et al. Automating Content Extraction of HTML Documents [J]. World Wide Web Journal.
  • 8Deng C,Yu S P,Wen J R,et al. VIPS:a Vision Based Page Segmentation algorithm[R]. MSR-TR-2003-79. 2003.
  • 9AGRAWAL R, IMIELINSKI T, SWAMI A. Mining association rules between sets of items in large databases[A]. Peter Btmeman and Sushil Jajodia Proceedings of ACM SIGMOD International conference on Management of Data[C]. Washington, D C:[-s. n. ],1993. 207--216.
  • 10HIDBER C. Online association rule mining[A]. Proceedings of ACM SIGMOD International Conference on Management of Data [C]. Washington, D C: [-s. n. ],1999. 145--156.

共引文献22

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部