期刊文献+

基于G^4 ICCS系统的数据挖掘并行算法 被引量:3

Data Mining Parallel Algorithm Based on G^4 ICCS
下载PDF
导出
摘要 针对传统决策树SPRINT(Scalable Parallelizable Induction of Decision Trees)算法不能处理海量地学数据挖掘的问题,设计实现了基于G4ICCS(Geology Geography Geochemistry Geophysics Information Cloud ComputingSystem)的决策树并行分类算法PSPRINT。该算法使用哈希表存储连续属性分割点两侧的数据记录,为并行节点的分割提供依据,在MapReduce架构下解决了海量地学数据挖掘问题。实验结果表明,在模拟的云计算环境下,决策树并行算法可以处理海量地学数据分类问题,并获得较好的稳定性和较高的处理速度。 For the traditional decision tree SPRINT (Scalable Parallelizable Induction of Decision Trees ) algorithm cannot solve the problem of mass geoscience data mining, the paper designed and realized PSPRINT algorithm. It is a decision tree parallel classification algorithm based on G4 ICES (Geology Geography Geochemistry Geophysics Information Cloud Computing System). The algorithm uses hash table to save data record on both sides of continuous attributes pointof division, providing basis for the division of parallel node, and solved mass geoscience data mining problem. The experimental results show that the decision tree parallel algorithm can deal with the classification problem of mass geoscience data under the simulated environment of cloud computing. And the algorithm has better stability and processing speed.
出处 《吉林大学学报(信息科学版)》 CAS 2013年第3期324-327,共4页 Journal of Jilin University(Information Science Edition)
基金 吉林省"十二五"矿产资源规划预测基金资助项目(3R212H104422)
关键词 地学G4ICCS系统 数据挖掘 决策树算法 并行 geology geography geochemistry geophysics information cloud computing system(G4ICCS) data mining decision tree algorithm parallel
  • 相关文献

参考文献9

二级参考文献63

共引文献292

同被引文献27

  • 1曾衍伟,龚健雅.空间数据质量控制与评价方法及实现技术[J].武汉大学学报(信息科学版),2004,29(8):686-690. 被引量:67
  • 2杨澍,初禹,杨湘奎,娄本君.层次分析法(AHP)在三江平原地质环境质量评价中的应用[J].地质通报,2005,24(5):485-490. 被引量:25
  • 3郎显宇,陆忠华,迟学斌.一种基于“基因表达谱”的并行聚类算法[J].计算机学报,2007,30(2):311-316. 被引量:11
  • 4路来君,韩冰.地学G4I系统中数据集成技术研究[D] .长春:吉林大学地球科学学院,2011:137-145.
  • 5SAATY T L.Decision-Making with the AHP:Why is the Principal Eigenvector Necessary[J] .European Journal of Operational Research,2003,145(21):85-91.
  • 6ZHANG Jie,TANG Hong,SU Kai.Research on Methods of Effectiveness Evaluation[M] .Beijing:Defense Industry Press,2009.
  • 7HE Bin,ZHAO Hongzhou,YU Saifa.Research on Evaluation Electromagnetic Environment Effects of Tactical Communications Training Based on AHP[C] ∥Proceedings of the Third Electromagnetic Environment Effects and Protection Technology Symposium.[S.l.] :Scientific Research Publishing,2012:245-246.
  • 8Zhao W, Ma H, He Q. Parallel K-Means Clustering Basedon MapRe- duce[ C ]//Proc. of Cloud Computing,2009:674 - 679.
  • 9Apache Mahout:Scalable machine learning and data mining[ EB/OL]. 2013 - 4 - 24. http ://mahout. apache, org/.
  • 10Li B, Zhao H, Lv Z. Parallel ISODATA Clustering of Remote Sensing Images Based on MapReduce [ C ]//Proc. of Cyber-Enabled Distribu- ted Computing and Knowledge Discovery ,2010:380 - 383.

引证文献3

二级引证文献12

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部