期刊文献+

一种基于内存计算的电力用户聚类分析方法 被引量:7

A method for cluster analysis of electric power consumers based on in-memory computing
下载PDF
导出
摘要 随着智能电表与采集终端采集的用电数据迅猛增长,传统数据分析方法已经不能满足大数据环境下智能用电行为分析的需要。鉴于K-means算法具有计算效率高、容易并行化等特点,采用弹性分布式数据集与并行内存计算框架对其进行改进与并行化,减少作业的运行与输入输出操作时间,提高聚类分析的处理能力。对用电测量数据进行预处理构建实验数据集,实验结果表明本方法对电力用户聚类分析的准确率高于单机K-means方法,其处理速度和能力明显优于单机和基于Map Reduce并行计算框架的聚类方法,并对数据的增长具有较好的适应性。 With the rapid growth of electricity consumption data collected by smart electric meters and data acquisition terminals,the traditional data analysis method cannot meet the demand of smart power consumption behavior analysis in the big data environment. Since K-means algorithm demonstrates high calculation efficiency,easy parallelization and other characteristics,a method for improving and parallelizing K-means with the resilient distributed data set and parallel in-memory computing framework is presented,the running time of job operation and I / O operations is reduced,and the ability of clustering analysis is improved. The experimental data set is built by preprocessed electricity consumption data. Eexperimental results show that the accuracy rate by this cluster analysis method for electric power users is obviously better than the single machine K-means algorithm. The processing speed and ability of this method are superior to the single machine and the clustering method based on Map Reduce parallel computing framework,and this method has good adaptability for the growth of data.
出处 《智能系统学报》 CSCD 北大核心 2015年第4期569-576,共8页 CAAI Transactions on Intelligent Systems
基金 国家自然科学基金资助项目(61074078) 中央高校基本科研业务费专项资金资助项目(12MS113)
关键词 大数据 智能用电 弹性分布式数据集 内存计算 聚类分析 big data smart electricity consumption resilient distributed data set in-memory computing cluster analysis
  • 相关文献

参考文献19

二级参考文献238

共引文献1218

同被引文献75

引证文献7

二级引证文献58

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部