摘要
在云计算背景下,海量数据之间会相互影响,影响了既定的关联原则,线性思维占据主导性地位,数据挖掘效果较差。简要分析当前传统思维方式下数据挖掘平台存在的问题,并将该思维方式转变为分布式思维,介绍了在分布式思维下建立数据挖掘平台的优势,并分析建立、设计方案。该方案可以有效解决冗余干扰问题,计算出区域内部的相似程度,在分布式思维数据之间产生关联。
In the background of cloud computing,massive data can affect each other,affecting the established correlation principle,and linear thinking dominates,which makes data mining less effective.The advantages of establishing a data mining platform under distributed thinking are introduced,and the establishment and design scheme are analysed.The solution can effectively solve the problem of redundant interference,calculate the degree of similarity within regions,and generate correlations between distributed thinking data.
作者
王哲
赵爽
Wang Zhe;Zhao Shuang(Tiefa Coal Group Big Data Operation Limited Liability Company,Tieling Liaoning 112700)
出处
《现代工业经济和信息化》
2022年第8期110-111,共2页
Modern Industrial Economy and Informationization
关键词
分布式思维
云计算数据挖掘平台
架构设计
distributed thinking
cloud computing data mining platform
architecture design