期刊文献+

面向电子交易的商品供应信息抽取模型 被引量:1

Product Information Extraction Model for Electronic Trading
下载PDF
导出
摘要 随着互联网的普及和电子商务的发展,形成了大量的商品供应信息资源。从企业门户网站和电子市场的海量商品网页中抽取出供应信息资源,是电子交易迫切需要解决的问题。在分析信息抽取过程和商品网页结构的基础上,构建了基于网页DOM树的商品供应信息抽取模型。该模型由网页采集层、HTML文档解析层、信息抽取层和结果处理层组成,并重点对信息抽取层的抽取规则进行了探讨。 With the development of Internet and electronic commerce, there exists tremendous of product supplying information resources. The crucial problem of electronic trading is the ability to extract useful resources from the huge product pages of enterprises portal and electronic marketplaces. Based on the analysis of information extraction process and the structure of product web page, a product information extraction model based on DOM tree is established. This model is composed by page gathering layer, document parsing layer, information extracting layer and result processing layer. And the extraction rules of information extracting layer is highlighted.
作者 傅魁 聂规划
出处 《武汉理工大学学报(信息与管理工程版)》 CAS 2007年第7期96-99,共4页 Journal of Wuhan University of Technology:Information & Management Engineering
基金 国家自然科学基金资助项目(70572079)
关键词 电子交易 信息抽取模型 DOM 电子商务 electronic trading information extraction model DOM electronic commerce
  • 相关文献

参考文献6

二级参考文献43

  • 1[1]Joachim Hammer, Hector Garcia-Molina, Jumghoo Cho, et al.Extracting Semistructured Information from the Web [C].Proceedings of the First Workshop on Management of Semistructured Data, Tucson, Arizona, 1997.18-25.
  • 2[2]Arnaud Sahuguet, Fabien Azavant. Building Light-weight Wrap-pers for Legacy Web Data-sources Using W4F[C]. International Conference on Very Large Databases (VLDB), Edinburgh,Scotland, 1999.738-741.
  • 3[3]S Soderland. Learning Information Extraction Rules for Semi-structured and FreeText [ J ]. Machine Learning, 1999, 1-44.
  • 4[4]N Kushmerick, D Weld, B Doorenbos. Wrapper Induction for Information Extraction [ C ]. In Proceedings of the Fifteenth International Joint Conference on Artificial Intelligence (IJCAI-97), Osaka, Japan, 1997.729-737.
  • 5[5]Ion Muslea, Steve Minton, Craig Knoblock. Stalker: Learning Extraction Rules for Semistructured, Web-based Information Sources [ C ]. AAAI-98 Workshop on "AI & Information Integration", Madison, 1998.74-81.
  • 6[6]Ion Muslea. Extraction Patterns: From Information Extraction to Wrapper Induction[ R]. Technical Report, Information Sciences Institute, University of Southern Californi, 1998.
  • 7[16]Hobbs J,Appelt D,Bear J et al.FASTUS:A Cascaded Finite-State Transducer for Extracting Information from Natural-Language Text[C].In:Roche,Schabes eds. Finite State Devices for Natural Language Processing, MIT Press,Cambridge MA, 1996
  • 8[17]Appelt D E.Introduction to Information Extraction[J].AI COMMUNICATIONS, 1999; 12(3)
  • 9[18]Yangarber R.Scenario Customization for Information Extraction[D].Ph D Thesis.New York University,2001-01
  • 10[19]Cowie J, Lehnert W.Information Extraction[J].Communications of the ACM, 1996;39(1)

共引文献292

同被引文献5

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部