期刊文献+

Web数据挖掘中数据异构问题解决方法的研究 被引量:3

Research on Heterogeneous Data Problem Solving Method in the Process of Web Data Mining
下载PDF
导出
摘要 Web是动态性极强的信息源,访问、分析信息必须研究异构数据的集成问题,并选择合适的技术进行数据分析、集成和处理。怎样对Web海量的数据信息进行深层次的应用已成为数据挖掘技术的研究热点。本文介绍了XML(可扩展标记语言)在Web数据挖掘中的应用,探讨了Web数据挖掘中的数据异构问题。通过XML技术建立数据抽取模型,解决互联网上绝大多数因异构、非结构化所导致的Web数据挖掘问题。 The web was an information resource with dynamic state, to access and analyze the data we must study how to integrate heterogeneous architecture data and choose fit techniques to analyze, manage and integrate the data.How to apply plentiful web data to the field of web data mining has been brought into focus. The article discusses the data heterogeneity problem in Web by introducing the application of XML in the field of web data mining. By using XML technology a data extraction model is established for solving most of the difficulties in Web data mining caused by heterogeneous, unstructured problems on Internet.
出处 《中国科技资源导刊》 2012年第4期85-90,共6页 China Science & Technology Resources Review
基金 国家国际科技合作计划项目“异构信息知识挖掘与可视化关键技术研究”(2010DFA14390).
关键词 数据挖掘 半结构化 XML技术 数据抽取 模型 data mining semi-structured XML technology data extraction mode
  • 相关文献

参考文献10

  • 1Han Jiawei, Kamber Micheline. Data Mining: Concept and Tbchniques[M]. San Francisco: Morgan KaUfmann Publishers. Inc. 2001.
  • 2Shanmugasundaram Jayavel, Tufte Kristin, He Gang, et al. Relational Databases for Querying XML Documents: Limitations and Opportunities[C]//Edinbergh, Scotland: Proceeding of the 25th International Conference on Very Large DataBases(VLDB). 1999:302-314.
  • 3Fan W, Simeon J. Integnty Constraints for XML[J]. Journal of Computer and System Science(JCSS), 2003, 66(1):254-291.
  • 4Lee Dongwon, Chu Wesley W. Constraints-preserving Transformation from XML Document Type Definition to Relational Schema[C]//Salk Luke City, Utah: Pro- ceedings of the 19th international conference on Con- ceptual Modeling(ER), 2000:323-338.
  • 5Lee Dongwon, Mani Murali, Chu Wesley W. Conver- sions Methods between XML Schema and Relations Models[M]//Knowledge Transformation for the Se- mantic Web. Amsterdan: IOS Press, 2003.
  • 6方翔,李伟生.关系模式到XML模式的影射[J].计算机应用研究,2002,19(1):130-132. 被引量:26
  • 7Etzioni O, Mine G, Widener T. The World Wide Web: Quagmire or Gold Mine[J]. Communication of the ACM, 1996, 39(11):65-68.
  • 8DalviDinagGrayJoe.NETXML高级编程[M].英宁,林琪,费广正,译.北京:清华大学出版社,2002.
  • 9RirdanRebeccaM.ADO.NET程序设计[M].李高健,译.北京:清华大学出版社,2002:23-25.
  • 10CoyleFrankRXML、Web服务和数据革命[M].袁勤勇,莫青,译.北京:清华大学出版社,2003.

二级参考文献7

  • 1[1]T Bray,J Paoli,C M Sperberg-Mcqueen.eXtensible Markup Language (XML) 1.0 [EB/OL].http://www.w3.org/TR/RECXML.
  • 2[3]R Bourret,C Bornhovd,A Buchmann.A Generic Load/Extract Utility for Data Transfer between XML Documents and Relational Database[J] .TR-DVS99-1,DVS,Dep.CS,Darmstadt U.of Technology,Germany,1999,( 12 ).
  • 3[4]Kevin Williams,Michael Brundage,Patrick Dengler.XML Structures for Existing Databases Eleven Rules for Moving a Aelational Database to XML [EB/OL].http:∥www- 106.ibm.eom/de veloperworks/library/x-struct.
  • 4[5]Volker Turau.DB2XML:A Tool for Transforming Relational Databases into XML Documents [EB/OL] .http :∥www.informatik.fh-wiesbaden.de/~ turau/DB2XML/index.html.
  • 5[6]Henry S Thompson,David Beech,Murray Maloney,et al.XML Schema Part 0: Primer [EB/OL] .http:∥www.w3.org/TR/2001 / REC-xmlschema-0-20010502.
  • 6[7]Henry S Thompson,David Beech,Murray Maloney,et al.XML Schema Part 1: Structures [EB/OL] .http:∥www.w3.org/TR/2001 / REC-xmlschema-0-20010502.
  • 7[8]Henry S Thompson,David Beech,Murray Maloney.XML Schema Part 2: Datatypes [EB/OL].http:∥www.w3.org/TR/2001/REC-xmlschema-0-20010502.

共引文献25

同被引文献33

引证文献3

二级引证文献6

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部