期刊文献+

多策略数据挖掘系统的分析与设计 被引量:1

Analysis and Design of Multi-Strategy Data Mining System
下载PDF
导出
摘要 为了满足数据规模的膨胀和分析需求的增长,在对数据挖掘系统的发展史进行回顾的基础上,分析了国内外典型数据挖掘系统的特点,设计了一个多策略的数据挖掘系统。并针对数据挖掘面临的大规模海量数据的处理问题,为系统引入和设计了算法插件思想、缓冲区处理技术、基于XML(Extensib le M arkup Lan-guage)语言的配置文件和相应的并行处理技术。最后讨论了系统今后开发过程中需要注意算法更新及评估的问题。 The development of the database technology and the comprehensive application of the dataoase management system result in the data expanding and the increasing of the analysis requirement. Many kinds of data mining system and business intelligence software are developed continuously. The paper reviews the development history of the data mining system, analyzes the characteristic of the typical data mining system, and designs a multi strategy data mining system, in dealing with the large scale data, introduces and designs the algorithm groupware idea, buffer processing technology, configuration file based on the XML (Extensible Markup Language) and the parallel processing technology. Finally, discuss the future problem during the development of the system.
出处 《吉林大学学报(信息科学版)》 CAS 2006年第6期610-617,共8页 Journal of Jilin University(Information Science Edition)
基金 国家自然科学基金资助项目(60275026)
关键词 数据挖掘 海量数据处理 算法插件 data mining largescale data algorithm groupware
  • 相关文献

参考文献18

  • 1FAYYAD U,SHAPIRO G P,SMYTH P,et al.Advances in Knowledge Discovery and Data Mining[M].Califonia:AAAI/MIT Press,1996.
  • 2BRACHMAN R J,ANAND T.The Process of Knowledge Discovery in Databases:A Human-Centered Approach[C] // Advance in Knowledge Discovery and Data Mining.Cambridge:AAAI/MIT Press,1996:37-58.
  • 3REINARTZ THOMAS.Focusing Solutions for Data Mining[M].Berlin Heidelberg:Springer-Verlag,1999:1-44.
  • 4IMIELINSKI T,MANNILA H.A Database Perspective on Knowledge Discovery[J].Communications of the ACM,1996,39(11):58-64.
  • 5VIRMANI A.Second Generation Data Mining:Concepts and Implementation[D].NJ,USA:Rutger University,1998.
  • 6GROSSMAN ROBERT.Supporting the Data Mining Process with Next Generation Data Mining Systems[EB/OL].(1998-05)[2006-10].http://www.lac.uic.edu/~grossman/papers/esj-98htm.
  • 7GOEBEL M,GRUENWALD L.A Survey of Data Mining and Knowledge Discovery Software Tools[C] //ACM SIGKDD.San Diego,California,USA:SIGKDD Explorations,1999:20-33.
  • 8GREGORY,PIATETSKY SHAPRIO.Knowledge Discovery in Databases:10 Years after[C] //ACM SIGKDD.Boston,USA:SIGKDD Explorations,2000:59-61.
  • 9LU Hong-jun.Seamless Integration of DM with DBMS and Applications[EB/OL].(2001-06)[2006-10].http://www.cs.uct.hk/~luhj/ps/pakdd01.pdf.PAKDD01.
  • 10HAN Jia-wei.Data Mining-Current Status and Research Directions.Seminar Presentation[EB/OL].(2001-02)[2006-10].http://db.cs.sfu.ca/sections/publication/slides/slides.html.

二级参考文献28

  • 1汪芸.CORBA技术及其应用[M].东南大学出版社,1999..
  • 2史忠植.高级人工智能[M].北京:科学出版社,1997.60-100.
  • 3屠立德.操作系统基础[M].北京:清华大学出版社,1998..
  • 4王军.数据库知识发现的研究:博士论文[M].北京:中国科学院软件研究所,1997..
  • 5谭宁.面向对象知识处理系统:硕士论文[M].合肥:中国科学技术大学,1999..
  • 6(英)Harjinder S Gill 王仲谋(译).数据仓库-客户/服务器计算指南[M].北京:清华大学出版社,1997..
  • 7张颖.数据采掘的研究与应用:博士论文[M].北京:中国科学院计算技术研究所,1999..
  • 8Han Jiawei.Data Mining Concepts and Techniques(中译本)[M].Morgan Kaufmann,2001..
  • 9Li B, Shasha D. Free Parallel Data Mining. SIGMOD '98, Seattle, WA,1998-06.
  • 10George F. DMS: A Parallel Data Mining Server. VLDB, 1998:702.

共引文献25

同被引文献8

  • 1管恩政,常晓宇,王喆,周春光.快速频繁序列模式挖掘算法[J].吉林大学学报(理学版),2005,43(6):768-772. 被引量:7
  • 2SUNITA SARAWAGI,SHIBY THOMAS,RAKESH AGRAWAL.Integrating Association Rule Mining with Relational Database Systems:Alternatives and Implications[J].Data Mining and Knowledge Discovery,2004 (2/3):89-125.
  • 3RAKESH AGRAWAL,KYUSEOK SHIM.Developing Tightly-Coupled Data Mining Applications on a Relational Database System[C] //Proc of the 2nd Int'l Conference on Knowledge Discovery in Databases and Data Mining.Portland:[s.n.],1996:287-291.
  • 4AMIR NETZ,SURAJIT CHAUDHURI,JEFF BERNHARDT,et al.Integration of Data Mining and Relational Databases[C]//Proceedings of the 26th International Conference on Very Large Databases.Cairo,Egypt:[s.n.],2000:719-722.
  • 5GUPTA H,MCLAREN I,VELLA A.A Step Beyond Data Mining:Database mining[C] //Proc of Conference and Workshop on New Approaches in Computing.Coventry,UK:[s.n.],1997:246-251.
  • 6HAN J,FU Y,WANG W,et al.DMQL:A Data Mining Query Language for Relational Databases[C] //Proc of the Workshop on Research Issue on Data Mining and Knowledge Discovery.Montreal:[s.n.],1996:196-202.
  • 7HAN J,KOPERSKI K,STEFANOVIC GEOMIINER N.A System Prototype for Spatial Data Mining[C] //Proc 1997 ACMSIGMOD Conf on Management of Data (SIGMOD'97).Tucson:ACM-SIGMOD,1997:36-43.
  • 8高韬,谢昆青,马修军,陈冠华.SDML:基于空间数据库的空间数据挖掘语言[J].北京大学学报(自然科学版),2004,40(3):465-472. 被引量:7

引证文献1

二级引证文献7

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部