期刊文献+

基于动态异构的Web信息集成网页分析方法 被引量:7

Analysis method based on dynamic and heterogeneous Web pages for information integration
下载PDF
导出
摘要 将动态异构的Web信息资源进行抽取以统一的方式供用户查询和使用,是当前迫切需要解决的问题。介绍了分析相关Web页面的方法和经验,实现了自动提交HTML表单获得所需页面和对页面的信息抽取。最后,实验证明了此方法的有效性。 It was an open problem crying for being solved to integrate dynamic and heterogeneous websites for users to query in a uniform way. This paper presented a method of analyzing relevant websites, which implemented the automatic submission of HTML forms to get required websites and the information extraction of websites. The experiment performance demonstrates the efficiency and effectiveness of the method.
出处 《计算机应用研究》 CSCD 北大核心 2007年第12期204-206,共3页 Application Research of Computers
基金 国家自然科学基金资助项目(90412010)
关键词 网页分析 信息抽取 模式匹配 Web pages analysis information extraction pattern matching
  • 相关文献

参考文献10

  • 1LAENDER A H F, RIBEIRO-NETO B A. A brief survey of Web data extraction tools[ J]. SIG2MOD Record, 2002,31 (2) :84-93.
  • 2CHANG C H, LUI S C. IEPADz information extraction based on pattern discovery [C]//Proc of the 10th International Conference on World Wide Web. Hong Kong:[ s. n. ], 2001:681-688.
  • 3KUSHMERICK N. Wrapper induction: efficiency and expressiveness [ J]. Artificial Intelligence, 2000,118(1-2) :15-68.
  • 4SEYMORE K, McCALLUM A, ROSENFELD R. Learning hidden Markov model structure for information extraction [ C ]//Proc of AAAI' 99 Workshop on Machine Learning for Information Extraction. Orlando : AAAI Press, 1999:37- 42.
  • 5KUSHMERICK N, WELD D S, DOORENBOS R. Wrapper induction or information extraction [ C]//Proc of the 15th International Joint Conference on Artificial Intelligence. Japan: [ s. n. ] , 1997: 729- 735.
  • 6李保利,陈玉忠,俞士汶.信息抽取研究综述[J].计算机工程与应用,2003,39(10):1-5. 被引量:177
  • 7宋武伟.异构Web数据库集成检索系统的网页分析技术[J].情报杂志,2006,25(3):102-104. 被引量:4
  • 8钱防震 杜小勇.DLPers的资源整合.计算机科学,2003,30(10):263-266.
  • 9郭志鑫.基于本体的文档引文元数据信息抽取[J].微计算机信息,2006,22(06X):304-306. 被引量:18
  • 10李跃进,赵晶,林鸿飞.基于Internet的军事演习信息抽取系统[J].计算机工程与应用,2006,42(14):214-218. 被引量:6

二级参考文献44

  • 1娄雅斌,陶凤梅,马垣.基于“本体”的异构数据源的集成方法研究[J].微计算机信息,2005,21(10X):117-118. 被引量:20
  • 2[16]Hobbs J,Appelt D,Bear J et al.FASTUS:A Cascaded Finite-State Transducer for Extracting Information from Natural-Language Text[C].In:Roche,Schabes eds. Finite State Devices for Natural Language Processing, MIT Press,Cambridge MA, 1996
  • 3[17]Appelt D E.Introduction to Information Extraction[J].AI COMMUNICATIONS, 1999; 12(3)
  • 4[18]Yangarber R.Scenario Customization for Information Extraction[D].Ph D Thesis.New York University,2001-01
  • 5[19]Cowie J, Lehnert W.Information Extraction[J].Communications of the ACM, 1996;39(1)
  • 6[20]Grishman R Adaptive information extraction and sublangu age analysis[C].In:Proceedings of IJCAI-2001 Workshop on Adaptive Text Extraction and Mining,2001
  • 7[1]Applet D E,Israel D J.Introduction to Information Extraction Technology. A Tutorial for IJCAI-99,1999
  • 8[2]Gaizauskas R,Wilks Y.Information Extraction:Beyond Document Retrieval[J].Journal of Documentation, 1997
  • 9[3]Sager N.Natural Language Information Processing. Reading,Massachusetts:Addison Wesley, 1981
  • 10[4]Dejong G.An Overview of the FRUMP System[C].In:LEHNERT W,RINGLE M h eds. Strategies for Natural Language Processing,Lawrence Erlbaum, 1982:149~176

共引文献198

同被引文献44

引证文献7

二级引证文献12

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部