基于点密度的半监督CA算法在图像聚类中的应用被引量：2

Semi-supervised competitive agglomeration algorithm based on dot density and application in image clustering

下载PDF

导出

摘要经典竞争凝聚(CA)算法具有自动寻找聚类总数的特性,避免了预判参数对聚类结果的影响,但在聚类过程中,该算法并未利用样本数据中普遍存在的少量已知信息,而这些已知信息往往能够对整个聚类过程提供有益的帮助;此外算法在相似度度量函数上采用了最为常见的欧氏距离,该距离仅适用于球状的聚类,且存在等划分的趋势,这就制约了算法的应用范围.针对上述问题,通过引入具有半监督学习能力的半监督项,增强隶属度矩阵的划分能力,并利用样本数据的点密度信息,生成距离调节因子修正欧氏距离,最终得到了基于点密度的半监督CA算法.在人造模拟图像和真实图像上的聚类分割结果,以及与其它算法的性能比较,表明了所得算法,能得到较为准确的中心值,有更佳的聚类效果. The competitive agglomeration（CA）is a very classic algorithm in clustering algorithm.The algorithm has the ability to get cluster number automatically.It judges and gives up the false clustering centers during iterative process of continuous until the last number of cluster is most appropriate for sample date.Through this way it avoids the influence on the clustering results by anticipating parameters incorrectly,and does not need to set precise clustering number for sample date.But during its clustering,it fails to take into account the known information,which is little but prevalent in the sample data.However those known informations are important for the clustering results.Obviously,making proper use of the information is conducive to improve the clustering rate.Moreover,the algorithm uses the Euclidean distance as the similarity function.Even though the distance formula has the advantages in calculation and is wildly used in common algorithms,the distance is only applicable to spherical clustering and it has the trend of equal partition for data sets.There are many different kinds of sample data may need cluster.And considering the diversity of sample data,a conclusion would be gotten,that all these above have restricted the application scope of the algorithm.To solve these problems,the semi-supervised entry was introduced to enhance partitioning capabilityof membership matrix.It has the ability of learning which could help the algorithm make full use of the information that known in sample data.And a distance correction with the information of dot density was built.The dot density could reflect the importance of one point in data clustering and could be built for adjusting the Euclidean distance,in order to avoiding the distance leading a trend of equal partition for clustering result.Finally a semi-supervised algorithm based on density was proposed.Four images were divided into two groups,which were artificial image and real images.And they were designed for examining the segmentation.Three other algorithms were used for comparison with the algorithm proposed.Through the clustering segmentation results of images and the comparison with other algorithm in performance,the results show that the proposed algorithm can get more accurate center value and get better clustering results.

作者于平王士同

机构地区江南大学数字媒体学院

出处《南京大学学报（自然科学版）》 CAS CSCD 北大核心 2014年第4期447-456,共10页 Journal of Nanjing University（Natural Science）

基金国家自然科学基金(61170122) 江苏省自然科学基金(BK2012552)

关键词竞争凝聚(CA)算法欧氏距离半监督点密度距离调节因子 competitive agglomeration（CA）algorithm Euclidean distance semi-supervised dot density distance correction factor

分类号 TP18 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献18

1高阳.中国数据挖掘研究进展[J].南京大学学报（自然科学版）,2011,47(4):351-353. 被引量：27
2Bezdek J C, Hathaway R,Sabiu M, et al. Convergence theory for fuzzy C-means-counterexample and repairs. IEEE Transactions on Systems, Man, and Cybernetics, 1987, 17(5): 873-877.
3Hall L O,Goldgof D B.Convergence of the single-pass and online fuzzy C-means algorithms. IEEE Transactions on Fuzzy Systems, 2011, 19(4): 792-794.
4Zhu L,Chtmg F L,Wang S T.Generalized fuzzy C-means clustering algorithm with improved fuzzy partitions. IEEE Transactions on Systems, Man, and Cybernetics, 2009, 39(3): 578-591.
5Frigui H, Krishnapuram R. Clustering by competitive agglomeration. Pattern Recognition, 1997, 30(7): 1109-1119.
6Boujemaa N. Generalized competitive clustering for image segmentation. In: Proceedings of the 19th International Conference of the North American Fuzzy Information Processing Society - NAFIPS. IEEE, 2000: 133-137.
7刘小芳,曾黄麟,吕炳朝.点密度函数加权模糊C-均值算法的聚类分析[J].计算机工程与应用,2004,40(24):64-65. 被引量：28
8Tang C L,Wang S G,Xu W.New fuzzy C-means clustering model based on the data weighted approach. Data & Knowledge Engineering, 2010, 69(9): 887-900.
9Endo Y, Hamasuna Y, Yamashiro M, et al. On semi-supervised fuzzy C-means clustering. In: Proceedings of IEEE International Conference on Fuzzy Systems, Korea: FUZZ-IEEE, 2009: 1119-1124.
10姚紫阳.半监督中心最大化模糊C均值算法[J].计算机工程与应用,2012,48(33):188-193. 被引量：5

二级参考文献37

1刘小芳,曾黄麟,吕炳朝.点密度函数加权模糊C-均值算法的聚类分析[J].计算机工程与应用,2004,40(24):64-65. 被引量：28
2Bezdek J C.Patteru Recognition with Fuzzy Objective Function Algorithms[M].New York: Plenum Press, 1981
3Bensaid A M,Hall L O.Partially Supervised Clustering for Image Segnentation[J].Pattern Recognition, 1996; 29 (5): 859～872
4Pal N R,Bezdek J C.On Cluster Validity for the Fuzzy C-Means ModeI[J].IEEE Trans, Fuzzy Systems, 1995; 3 (4): 370～379
5Han JW,Kambr M.Data mining concepts and techniques[M].Beijing:Higher Education Press,2001.
6Ester M,Kregel H P,Sander J,et al.A density-based algorithm for discovering clusters in large spatial databases with noise[C]//Proceedings of 2nd International Conference on Knowledge Discovery and Data Mining, Portland , Oregon, U S A,1996.
7Ankerst M,Breunig M,Kriegel H P,et al.OPTICS:Ordering Points To Identify the Clustering Structure[C]//Proc ACM SIGMOD'99,Int Conf on Management of Data,Philadelphia, PA, 1999.
8Lin Chih-Yang,Chang Chin-Chen,Lin Chia-Chen.A new densitybased scheme for clustering based on genetic algorithm[J].Fundamenta Informatieae,2005,68(4) :315-331.
9Ma Daoying,Zhang Aidong.An adaptive density-based clustering algorithm for spatial database with noise[C]//ICDM'04.Fourth IEEE International Conference on Data Mining,1-4 Nov 2004:467-470.
10Dash M,Liu M,Xu X.1 +1 >2 : merging distance and density based clustering[C]//Proc of 7th Int Conf Database Systems for Advanced Applications(DASFAA'01 ) ,Hong Kong,April 2001 : 18-20.

共引文献66

1王超,姜威.基于K近邻加权的混合C均值聚类算法[J].计算机工程与应用,2006,42(30):84-87. 被引量：2
2刘晓勇,林健良.模糊聚类分析在金融机构财务分析中的应用[J].科学技术与工程,2007,7(1):99-101. 被引量：3
3付辉.模糊C-均值(FCM)聚类算法的改进[J].科学技术与工程,2007,7(13):3121-3123. 被引量：11
4丁亚明,王树忠,张志红,胡学钢.基于改进神经网络的模糊聚类算法[J].合肥工业大学学报（自然科学版）,2007,30(8):934-938. 被引量：4
5李正义,曾雪兰,覃菊莹.离差最大化特征加权模糊c-划分的聚类分析[J].模糊系统与数学,2008,22(4):170-174. 被引量：2
6王磊,杜文莉,祁荣宾,钱锋.基于样本密度信息与竞争网络的聚类中心点获取算法[J].华东理工大学学报（自然科学版）,2009,35(4):648-654.
7江秀勤.半监督加权模糊C均值聚类算法[J].计算机工程,2009,35(17):170-171. 被引量：6
8姜伦,丁华福.关于模糊C-均值(FCM)聚类算法的改进[J].计算机与数字工程,2010,38(2):4-6. 被引量：12
9蔡静颖,谢福鼎,张永.基于自适应马氏距离的模糊c均值算法[J].计算机工程与应用,2010,46(34):174-176. 被引量：12
10陈刚,陆媛,杨慧中.基于小波去噪和改进的FCM算法的基因表达数据分析[J].计算机与应用化学,2011,28(7):951-953. 被引量：3

同被引文献19

1万晓霞,谢德红,徐锦林.基于加网算法与算法适应性的半色调图像质量评价方法[J].武汉大学学报（信息科学版）,2006,31(9):765-768. 被引量：8
2赵小娥.半色调图像质量评价研究[J].西安工程科技学院学报,2006,20(3):347-350. 被引量：7
3Zhang Y J. Image engineering and related publications. International Journal of Image and Graphics, 2002,2 (3) : 441 - 452.
4Xu L, Jia J Y. Depth-aware motion deblurring. In: Proceedings of the IEEE International Conference on Computational Photography. Cluj Napoca, Romania : IEEE, 2012 : 1 - 8.
5Peter R J. Evolution of halftoning technology in the United States patent literature. Journal of Electronic Imaging, 1994,3 ( 3 ) .. 257 - 275.
6Cho S, Wang J, Lee S. Handling outliers in non- blind image deconvolution. In: Proceedings of IEEE International Conference on Computer Vision. Barcelona, Spain : IEEE, 2011 : 495 - 502.
7Jin Y. Analysis on frequency transfer characteristics of AM, TM screening image. In.. Proceedings of 2005 International Conference on Imaging: Technology and Applications for the21~' Century 304-305.
8Wang Z, Bovik A C, Sheik H R, et al. Image quality assessment: Form error visibility to structural similarity. IEEE Transaction on Image Processing, 2004,13 (4) : 600 - 612.
9Sheih H R, Bovik A C. Image information and visual quality. IEEE Transactions on Image Processing, 2006,15 (2) .. 430 - 444.
10Cavalcanti P G, Scharcanski J. Automated prescreening of pigmented skin lesions using standard cameras. Computerized Medical Imaging and Graphics,2011,35(6) :481 -491.

引证文献2

1于振洋,高尚兵,唐嵩涛.应用局部结构与方向张量的图像分割算法研究[J].南京大学学报（自然科学版）,2015,51(1):111-117. 被引量：1
2周啸,史瑞芝,李少梅,巩现勇,管凌霄.高阶网点集聚型重心定位频率调制半色调算法[J].南京大学学报（自然科学版）,2015,51(4):866-879.

二级引证文献1

1宋亚婷,韩冰,高新波.基于张量动态纹理模型的极光视频分类[J].南京大学学报（自然科学版）,2016,52(1):184-193. 被引量：2

1丁彦,李永忠.基于PCA和半监督聚类的入侵检测算法研究[J].山东大学学报（工学版）,2012,42(5):41-46. 被引量：10
2张明,王艳,李强.浅谈靶场数据挖掘中流数据的特征提取方法[J].通讯世界,2017,23(3):247-248. 被引量：1
3唐利娜.基于竞争凝聚的Web用户聚类[J].福建电脑,2010,26(12):100-102.
4余轶军,林怀忠,陈纯.基于竞争凝聚的个性化网页推荐[J].浙江大学学报（工学版）,2007,41(2):239-244. 被引量：1

南京大学学报（自然科学版）

2014年第4期

浏览历史

内容加载中请稍等...

基于点密度的半监督CA算法在图像聚类中的应用被引量：2

参考文献18

二级参考文献37

共引文献66

同被引文献19

引证文献2

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

基于点密度的半监督CA算法在图像聚类中的应用 被引量：2

参考文献18

二级参考文献37

共引文献66

同被引文献19

引证文献2

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

基于点密度的半监督CA算法在图像聚类中的应用被引量：2