摘要
针对传统密度网格算法在聚类中自动获取密度阈值不够精确的问题,提出了一种基于密度网格参数自适应的数据流聚类算法A-Stream。通过引入"双密度阈值",并以平均值作为密度阈值,对传统聚类算法进行了改进,解决了算法不能获取精确值的问题。实验结果表明,A-Stream算法不仅保留了传统密度网格算法的高效性,而且较大程度上提高了聚类精度。
For the problem that traditional density grid-based stream clustering algorithm cannot get accurate density value, this paper introduces a new density grid-based stream clustering algorithm with parameter automatization A-Stream. Through the introduction of the double density, the traditional density grid-based clustering algorithm for data stream is improved by taking the average as the grid density, resolving the problem that algorithm cannot get accurate value automatically. The experimental results show that not only the high efficiency of the grid-based algorithm is utilized, but also the clustering accuracy is raised significantly.
出处
《计算机科学与探索》
CSCD
2011年第10期953-958,共6页
Journal of Frontiers of Computer Science and Technology
关键词
聚类
数据流
网格
参数自适应
密度阈值
clustering
data stream
grid
parameter adaptation
density threshold