期刊文献+

一种大数据交互式挖掘框架与实现

A Framework and Implementation of Big Data Interactive Mining
原文传递
导出
摘要 在传统的数据挖掘过程中,用户需根据专业知识对数据进行预处理,为模型设定参数后构建模型,通过评估指标判断模型是否可行。该过程的不便性体现在模型以黑盒的方式构建,用户不可见其中间过程,模型产生的结果也不易被理解。在海量数据的环境下,传统数据挖掘过程在预处理时异常数据的定位和模型生成后知识的表达方面都有不便。为了解决传统数据挖掘过程存在的问题,本文提出了一种大数据环境下的交互式数据挖掘框架。该框架使交互贯穿整个数据挖掘的过程,使得用户可以轻松定位异常输入源数据,参与模型训练过程,对模型生成的结果溯源。本文还基于Spark对该框架进行了实现,并在食源性疾病爆发预测场景下验证了其可行性。 In the traditional data mining process, the dataare pre-processedbasedon the professional knowledge first, then the algorithm parameters are set and are verified.Finally the feasibility ofmodel is determined according to the evaluation of output of data procession. However, there is inconvenience in the process. For example, the model is built in black-box way, the intermediate results can't be seen, and the final results are not easy to be interpreted. In order to solve the problem in traditional data mining process, this paper presents an interactive data mining framework in large data environment. The framework supports
作者 王锐君 黎建辉 Wang Ruijun Li Jianhui(Computer Network Information Center, Chinese Academy of Sciences, Beijing 100190 University of Chinese Academy of Sciences, Beijing 100049)
出处 《科研信息化技术与应用》 2016年第6期61-68,共8页 E-science Technology & Application
基金 国家自然科学基金(41371386 91224006) 中国科学院战略重点研究计划(XDA06010307 XDA05050601) 十二五科技支撑计划(2013BAD15B02) 国家卫生和计划生育委员会国家卫生家庭特别研究经费(201302005)
关键词 交互式挖掘 大数据 SPARK interactive mining big data Spark
  • 相关文献

参考文献2

二级参考文献6

  • 1[1]Keim D A. Visual Database Exploration Techniques. Proc. Tutorial Int.Conf. on Knowledge Discovery & Data Mining, Newport Beach, CA,1997
  • 2[2]Ankerst M. Visual Classification:An Interactive Approach to Decision Tree Construction. Proc.5th Int. Conf. On Knowledge Discovery & Data Mining(KDD'99),SanDiego,CA,1999:392-396
  • 3[3]Kohonen T. Self-organizing Maps Springer- verlag. Berlin,1997
  • 4[4]Inselberg A. The Plane with Parallel Coordinates. Special Issue on Computational Geometry, The Visual Computer, 1985, 1:69-97
  • 5[5]Ultsch A, Siemon H P. Kohonen's Self-organizing Feature Maps for Exploratory Data Analysis. In Proc. INNC'90,International Neural Network Conference,Dordrecht,Netherlands, 1990:305-308
  • 6马昱欣,徐佳逸,彭帝超,张婷,金呈哲,屈华民,陈为,彭群生.A Visual Analysis Approach for Community Detection of Multi-Context Mobile Social Networks[J].Journal of Computer Science & Technology,2013,28(5):797-809. 被引量:3

共引文献49

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部