期刊文献+

矩阵压缩Apriori算法分析 被引量:11

Analysis of matrix compression Apriori algorithm
下载PDF
导出
摘要 Apriori算法在处理较大的数据集时存在着不足:1)会产生数量庞大的候选项集,对算法运算时间和主存空间来说挑战巨大;2)多次扫描事务数据库会产生巨大的I/O负载。针对上述问题,提出了基于聚类和矩阵压缩的Apriori算法——KCCM算法。首先,通过K-means算法对大型数据集进行预处理,将其划分为若干个较小的数据集,并给出了合理性分析和证明;然后,将各个小数据集转化为布尔矩阵的形式,通过矩阵压缩的运算方式进行关联规则挖掘;最后,通过Matlab软件对算法进行了多组实验仿真,分别对Apriori算法和KCCM算法从运算时间、运行结果上进行了分析对比,实验结果表明,相比Apriori算法,KCCM算法的运行效率提高了近46.1%。 As the most important algorithm of association rule, Apriori algorithm is widely used in all walks of life. However,Apriori algorithm exists some problems when dealing with large data sets: 1) Apriori algorithm produces a large number of candidate items, which brings great challenge to operation time and storage space; 2) scanning the data set for many times produces a great I/O load. In view of these problems, an Apriori algorithm based on K-means algorithm and matrix compression was proposed, which named KCCM algorithm. Firstly, K-means algorithm was used to divide a large data set into several small data sets, and the rationality analysis and proof were given. Then, each small data set was converted to a Boolean matrix, and association rules were mined through matrix compression. At last, the operation step was simulated by Matlab software, and Apriori algorithm and KCCM algorithm were analyzed in operation time and running results. The experimental results show that, compared with Apriori algorithm,KCCM algorithm increased the run efficiency by nearly 46. 1%.
出处 《计算机应用》 CSCD 北大核心 2017年第A02期207-209,240,共4页 journal of Computer Applications
基金 国家自然科学基金青年科学基金资助项目(51409065 51309068)
关键词 数据挖掘 关联分析 K-MEANS算法 矩阵压缩 APRIORI算法 data mining association analysis K-means algorithm matrix compression Apriori algorithm
  • 相关文献

参考文献9

二级参考文献146

共引文献412

同被引文献70

引证文献11

二级引证文献51

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部