期刊文献+

基于Agent的Web页面结构化信息抽取

Agent-Based Structured Information Extraction from Web Pages
下载PDF
导出
摘要 结合当前Web站点的数据特点,以信息项在页面中的出现位置为信息抽取的路径,利用PAT树技术,提出了一个多Agent协作的自动信息抽取模型.该模型能够自动分析样本页面数据特征,归纳学习整个站点的数据模式,生成抽取规则,指导以后的抽取动作.实验结果表明,该模型对Web页面的结构化信息抽取具有较高的效率.
出处 《计算机研究与发展》 EI CSCD 北大核心 2007年第z2期344-349,共6页 Journal of Computer Research and Development
基金 国家自然科学基金项目(70371052)
  • 相关文献

参考文献12

  • 1[1]A Gulli,A Signorini.The indexable Web is more than 11.5 billion pages.International World Wide Web Conf,Tokyo,2005
  • 2李保利,陈玉忠,俞士汶.信息抽取研究综述[J].计算机工程与应用,2003,39(10):1-5. 被引量:178
  • 3[3]G Webb,J Well,Z Zheng.An experimental evaluation of integrating machine learning with knowledge acquisition.Machine Learning,1999,311(1):5-23
  • 4[4]N Kushmeriek,D Weld.Induction for information extraction.The 15th Int'l Joint Conf on Artificial Intelligent,Nagoya,1997
  • 5[5]H Ouahid,A Karmouch.An XML-based Web mining agent.MATA'99,Ottawa,1999
  • 6[6]Shiren Ye,Tat-Seng Chua.Learning object model from product Web pages.IEEE Trans on Knowledge and Data Engineering,2006,18(3):334-349
  • 7周明建,高济,李飞.基于本体论的Web信息抽取[J].计算机辅助设计与图形学学报,2004,16(4):535-541. 被引量:34
  • 8[8]W3C.http://www.w3.org/People/Raggett/tidy/,2007
  • 9[9]Chia -Hui Chang,Shao -Chen Lui,Yen-Chin Wu.Applying pattern mining to Web information extraction.The 5th Pacific-Asia Conference on Knowledge Discovery and Data Mining,Hong Kong,2001
  • 10[10]Monash University.http://www.csse.monash.edu.au/~lloyd/tildeAlgDS/Tree/Suffix/,2007

二级参考文献26

  • 1[16]Hobbs J,Appelt D,Bear J et al.FASTUS:A Cascaded Finite-State Transducer for Extracting Information from Natural-Language Text[C].In:Roche,Schabes eds. Finite State Devices for Natural Language Processing, MIT Press,Cambridge MA, 1996
  • 2[17]Appelt D E.Introduction to Information Extraction[J].AI COMMUNICATIONS, 1999; 12(3)
  • 3[18]Yangarber R.Scenario Customization for Information Extraction[D].Ph D Thesis.New York University,2001-01
  • 4[19]Cowie J, Lehnert W.Information Extraction[J].Communications of the ACM, 1996;39(1)
  • 5[20]Grishman R Adaptive information extraction and sublangu age analysis[C].In:Proceedings of IJCAI-2001 Workshop on Adaptive Text Extraction and Mining,2001
  • 6[1]Applet D E,Israel D J.Introduction to Information Extraction Technology. A Tutorial for IJCAI-99,1999
  • 7[2]Gaizauskas R,Wilks Y.Information Extraction:Beyond Document Retrieval[J].Journal of Documentation, 1997
  • 8[3]Sager N.Natural Language Information Processing. Reading,Massachusetts:Addison Wesley, 1981
  • 9[4]Dejong G.An Overview of the FRUMP System[C].In:LEHNERT W,RINGLE M h eds. Strategies for Natural Language Processing,Lawrence Erlbaum, 1982:149~176
  • 10[5]Grishman R,Sundheim B.Message Understanding Conference-6:A Brief History[C].In :Proceedings of the 16h International Conference on Computational Linguistics(COLING-96),1996-08

共引文献209

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部