期刊文献+

基于Spark的大数据三枝决策分类方法 被引量:1

Processing Big Data with Three Way Decision Based on Spark
下载PDF
导出
摘要 针对大规模的数据,借助Spark平台的分布式快速处理能力,提出了基于Spark的大数据三枝决策分类方法。该方法基于三枝决策理论,使用Spark对数据进行并行化处理。由经验数据获得数据的决策边界后,通过并行的方式进行正例和反例的判断,从而提高了在大数据集上的决策效率。采用多轮的分步决策方法提高了决策的效率与准确率。通过在UCI公开数据集mushroom和connect-4上的试验结果表明,新方法适用于大数据情况下的决策问题,大大提高了三枝决策分类算法的效率。 Aiming to solve decision-making problems on big data, we combine the ability of dis-tributed data processing of spark with three-way decision theory o This method is based on three-way decision theory. The boundary of the decision regions is firstly calculated by given data, and then each sample is estimated in terms of the belongings by paralleling, which can increase the effi-ciency greatly. Multi-round step by step decision making is used to further increase the efficiency and accuracy. The experiments are conducted on the UCI datasets (mushroom and connect-4). The results show that the proposed method is effective in processing big data.
作者 刘牧雷 徐菲菲 LIU Mulei;XU Feifei(School of Computer Science and Technology,Shanghai University of Electric Power,Shanghai 200090,China)
出处 《上海电力学院学报》 CAS 2018年第5期483-490,共8页 Journal of Shanghai University of Electric Power
基金 国家自然科学基金(61272437 61305094) 上海市教育发展基金会和上海市教育委员会"晨光计划"(13CG58)
关键词 三枝决策 SPARK 大数据 three-way decision Spark big data
  • 相关文献

参考文献3

二级参考文献14

共引文献16

同被引文献6

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部