Convex clustering,turning clustering into a convex optimization problem,has drawn wide attention.It overcomes the shortcomings of traditional clustering methods such as K-means,Density-Based Spatial Clustring of Appli...Convex clustering,turning clustering into a convex optimization problem,has drawn wide attention.It overcomes the shortcomings of traditional clustering methods such as K-means,Density-Based Spatial Clustring of Applications with Noise(DBSCAN)and hierarchical clustering that can easily fall into the local optimal solution.However,convex clustering is vulnerable to the occurrence of outlier features,as it uses the Frobenius norm to measure the distance between data points and their corresponding cluster centers and evaluate clusters.To accurately identify outlier features,this paper decomposes data into a clustering structure component and a normalized component that captures outlier features.Different from existing convex clustering evaluating features with the exact measurement,the proposed model can overcome the vast difference in the magnitude of different features and the outlier features can be efficiently identified and removed.To solve the proposed model,we design an efficient algorithm and prove the global convergence of the algorithm.Experiments on both synthetic datasets and UCI datasets demonstrate that the proposed method outperforms the compared approaches in convex clustering.展开更多
基金This work was supported by the National Natural Science Foundation of China(No.11771003).
文摘Convex clustering,turning clustering into a convex optimization problem,has drawn wide attention.It overcomes the shortcomings of traditional clustering methods such as K-means,Density-Based Spatial Clustring of Applications with Noise(DBSCAN)and hierarchical clustering that can easily fall into the local optimal solution.However,convex clustering is vulnerable to the occurrence of outlier features,as it uses the Frobenius norm to measure the distance between data points and their corresponding cluster centers and evaluate clusters.To accurately identify outlier features,this paper decomposes data into a clustering structure component and a normalized component that captures outlier features.Different from existing convex clustering evaluating features with the exact measurement,the proposed model can overcome the vast difference in the magnitude of different features and the outlier features can be efficiently identified and removed.To solve the proposed model,we design an efficient algorithm and prove the global convergence of the algorithm.Experiments on both synthetic datasets and UCI datasets demonstrate that the proposed method outperforms the compared approaches in convex clustering.