摘要
针对现实世界中样本对象的不确定性及样本对象间界限划分的模糊性,提出基于模糊C-均值的空间不确定数据聚类算法UFCM。但由于UFCM算法在聚类过程中涉及大量期望距离的复杂积分计算,导致UFCM算法性能不理想,进而给出改进算法I_UFCM,将空间不确定对象聚类问题转化为传统的确定对象聚类问题,采用相似度计算公式减少期望距离的计算量,提高聚类结果的质量。实验结果表明,与UFCM和UK-Means算法相比,I_UFCM算法在空间不确定数据集上具有更好的聚类性能,CUP耗时降低了90%以上。
Aiming at the uncertainty of sample object in real world and the fuzzy boundary between sample objects,this paper proposes a Uncertain Fuzzy C-Means(UFCM)algorithm.Because of a lot of complex integral calculation in expected distance computation,UFCM algorithm is inefficiency.Further,an improved algorithm called I_UFCM is proposed.In this algorithm,the spatial uncertain objects are transformed into the traditional certain objects for clustering.Besides,a new formula for calculation similarity is introduced instead of traditional Euclidean norm to evaluate the distance between objects.The quality of clustering results is improved by reducing the computational amount of excepted distance.Experimental results demonstrate the clustering performance of I_UFCM algorithm is more effective than UFCM and UK-Means algorithm,and its CPU time is reduced by 90%.
出处
《计算机工程》
CAS
CSCD
北大核心
2015年第10期47-52,共6页
Computer Engineering
基金
黑龙江省自然科学基金资助项目(F201014
F201134
F201302)
黑龙江省教育厅科学技术研究基金资助项目(12531120
12541128
12511100)
关键词
模糊C-均值
不确定数据
概率密度函数
期望距离
质心
fuzzy C-means
uncertain data
probability density function
excepted distance
centroid