A novel unsupervised approach to automatically constructing multilevel image clusters from unordered im- ages is proposed in this paper. The whole input image col- lection is represented as an imaging sample space (...A novel unsupervised approach to automatically constructing multilevel image clusters from unordered im- ages is proposed in this paper. The whole input image col- lection is represented as an imaging sample space (ISS) con- sisting of globally indexed image features extracted by a new efficient multi^view image feature matching method. By mak- ing an analogy between image capturing and observation of ISS, each image is represented as a binary sequence, in which each bit indicates the visibility of a corresponding feature. Based on information theory-inspired image popularity and dissimilarity measures, we show that the image content and distance can be quantitatively described, guided by which an input image collection is organized into multilevel clusters automatically. The effectiveness and the efficiency of the pro- posed approach are demonstrated using three real image col- lections and promising results were obtained from both qual- itative and quantitative evaluation.展开更多
This paper presents,a new approach of Medical Image Pixels Clustering(MIPC),aims to trace the dissimilar patterns over the Magnetic Resonance(MR)image through the process of automatically identify the appropriate numb...This paper presents,a new approach of Medical Image Pixels Clustering(MIPC),aims to trace the dissimilar patterns over the Magnetic Resonance(MR)image through the process of automatically identify the appropriate number of distinct clusters based on different improved unsupervised clustering schemes for enrichment,pattern predication and deeper investigation.The proposed MIPC consists of two stages:clustering and validation.In the clustering stage,the MIPC automatically identifies the distinct number of dissimilar clusters over the gray scale MR image based on three different improved unsupervised clustering schemes likely improved Limited Agglomerative Clustering(iLIAC),Dynamic Automatic Agglomerative Clustering(DAAC)and Optimum N-Means(ONM).In the second stage,the performance of MIPC approach is estimated by measuring Intra intimacy and Intra contrast of each individual cluster in the result of MR image based on proposed validation method namely Shreekum Intra Cluster Measure(SICM).Experimental results showthat the MIPC approach is better suited for automatic identification of highly relative dissimilar clusters over the MR cancer images with higher Intra closeness and lower Intra contrast based on improved unsupervised clustering schemes.展开更多
Machine vision measurement(MVM)is an essential approach that measures the area or length of a target efficiently and non-destructively for product quality control.The result of MVM is determined by its configuration,e...Machine vision measurement(MVM)is an essential approach that measures the area or length of a target efficiently and non-destructively for product quality control.The result of MVM is determined by its configuration,especially the lighting scheme design in image acquisition and the algorithmic parameter optimization in image processing.In a traditional workflow,engineers constantly adjust and verify the configuration for an acceptable result,which is time-consuming and significantly depends on expertise.To address these challenges,we propose a target-independent approach,visual interactive image clustering,which facilitates configuration optimization by grouping images into different clusters to suggest lighting schemes with common parameters.Our approach has four steps:data preparation,data sampling,data processing,and visual analysis with our visualization system.During preparation,engineers design several candidate lighting schemes to acquire images and develop an algorithm to process images.Our approach samples engineer-defined parameters for each image and obtains results by executing the algorithm.The core of data processing is the explainable measurement of the relationships among images using the algorithmic parameters.Based on the image relationships,we develop VMExplorer,a visual analytics system that assists engineers in grouping images into clusters and exploring parameters.Finally,engineers can determine an appropriate lighting scheme with robust parameter combinations.To demonstrate the effiectiveness and usability of our approach,we conduct a case study with engineers and obtain feedback from expert interviews.展开更多
It is a fairly challenging issue to make image repositories easy to be searched and browsed. This depends on a technique--image clustering. Kernel-based clustering algorithm has been one of the most promising clusteri...It is a fairly challenging issue to make image repositories easy to be searched and browsed. This depends on a technique--image clustering. Kernel-based clustering algorithm has been one of the most promising clustering methods in the last few years, beeanse it can handle data with high dimensional complex structure. In this paper, a kernel fuzzy learning (KFL) algorithm is proposed, which takes advantages of the distance kernel trick and the gradient-based fuzzy clustering method to execute the image clustering automatically. Experimental results show that KFL is a more efficient method for image clustering in comparison with recent renorted alternative methods.展开更多
In this paper,we introduce a novel Multi-scale and Auto-tuned Semi-supervised Deep Subspace Clustering(MAS-DSC)algorithm,aimed at addressing the challenges of deep subspace clustering in high-dimensional real-world da...In this paper,we introduce a novel Multi-scale and Auto-tuned Semi-supervised Deep Subspace Clustering(MAS-DSC)algorithm,aimed at addressing the challenges of deep subspace clustering in high-dimensional real-world data,particularly in the field of medical imaging.Traditional deep subspace clustering algorithms,which are mostly unsupervised,are limited in their ability to effectively utilize the inherent prior knowledge in medical images.Our MAS-DSC algorithm incorporates a semi-supervised learning framework that uses a small amount of labeled data to guide the clustering process,thereby enhancing the discriminative power of the feature representations.Additionally,the multi-scale feature extraction mechanism is designed to adapt to the complexity of medical imaging data,resulting in more accurate clustering performance.To address the difficulty of hyperparameter selection in deep subspace clustering,this paper employs a Bayesian optimization algorithm for adaptive tuning of hyperparameters related to subspace clustering,prior knowledge constraints,and model loss weights.Extensive experiments on standard clustering datasets,including ORL,Coil20,and Coil100,validate the effectiveness of the MAS-DSC algorithm.The results show that with its multi-scale network structure and Bayesian hyperparameter optimization,MAS-DSC achieves excellent clustering results on these datasets.Furthermore,tests on a brain tumor dataset demonstrate the robustness of the algorithm and its ability to leverage prior knowledge for efficient feature extraction and enhanced clustering performance within a semi-supervised learning framework.展开更多
In this paper, we consider the problem of clustering Web images by mining correlations between images and their corresponding words. Since Web images always come with associated text, the corresponding textual tags of...In this paper, we consider the problem of clustering Web images by mining correlations between images and their corresponding words. Since Web images always come with associated text, the corresponding textual tags of Web images are used as a source to enhance the description of Web images. However, each word has different contribution for the interpretation of image semantics. Therefore, in order to evaluate the importance of each corresponding word of Web images, we propose a novel visibility model to compute the extent to which a word can be perceived visually in images, and then infer the correlation of word to image by the integration of visibility with tf-idf. Furthermore, Latent Dirichlet Allocation (LDA) is used to discover topic information inherent in surrounding text and topic correlations of images could be defined for image clustering. For integrating visibility and latent topic information into an image clustering framework, we first represent textual correlated and latent-topic correlated images by two hypergraph views, and then the proposed Spectral Multiple Hypergraph Clustering (SMHC) algorithm is used to cluster images into categories. The SMHC could be regarded as a new unsupervised learning process with two hypergraphs to classify Web images. Experimental results show that the SMHC algorithm has better clustering performance and the proposed SMHC-based image clustering framework is effective.展开更多
A principal direction Gaussian image (PDGI)-based algorithm is proposed to extract the regular swept surface from point cloud. Firstly, the PDGI of the regular swept surface is constructed from point cloud, then the...A principal direction Gaussian image (PDGI)-based algorithm is proposed to extract the regular swept surface from point cloud. Firstly, the PDGI of the regular swept surface is constructed from point cloud, then the bounding box of the Gaussian sphere is uniformly partitioned into a number of small cubes (3D grids) and the PDGI points on the Gaussian sphere are associated with the corresponding 3D grids. Secondly, cluster analysis technique is used to sort out a group of 3D grids containing more PDGI points among the 3D grids. By the connected-region growing algorithm, the congregation point or the great circle is detected from the 3D grids. Thus the translational direction is determined by the congregation point and the direction of the rotational axis is determined by the great circle. In addition, the positional point of the rotational axis is obtained by the intersection of all the projected normal lines of the rotational surface on the plane being perpendicular to the estimated direction of the rotational axis. Finally, a pattem search method is applied to optimize the translational direction and the rotational axis. Some experiments are used to illustrate the feasibility of the above algorithm.展开更多
Most existing applications of centroidal Voronoi tessellations(CVTs) lack consideration of the length of the cluster boundaries.In this paper we propose a new model and algorithms to produce segmentations which would ...Most existing applications of centroidal Voronoi tessellations(CVTs) lack consideration of the length of the cluster boundaries.In this paper we propose a new model and algorithms to produce segmentations which would minimize the total energy—a sum of the classic CVT energy and the weighted length of cluster boundaries.To distinguish it with the classic CVTs,we call it an Edge-Weighted CVT(EWCVT).The concept of EWCVT is expected to build a mathematical base for all CVT related data classifications with requirement of smoothness of the cluster boundaries.The EWCVT method is easy in implementation,fast in computation,and natural for any number of clusters.展开更多
In order to solve the problem of indoor place recognition for indoor service robot, a novel algorithm, clustering of features and images (CFI), is proposed in this work. Different from traditional indoor place recog...In order to solve the problem of indoor place recognition for indoor service robot, a novel algorithm, clustering of features and images (CFI), is proposed in this work. Different from traditional indoor place recognition methods which are based on kernels or bag of features, with large margin classifier, CFI proposed in this work is based on feature matching, image similarity and clustering of features and images. It establishes independent local feature clusters by feature cloud registration to represent each room, and defines image distance to describe the similarity between images or feature clusters, which determines the label of query images. Besides, it improves recognition speed by image scaling, with state inertia and hidden Markov model constraining the transition of the state to kill unreasonable wrong recognitions and achieves remarkable precision and speed. A series of experiments are conducted to test the algorithm based on standard databases, and it achieves recognition rate up to 97% and speed is over 30 fps, which is much superior to traditional methods. Its impressive precision and speed demonstrate the great discriminative power in the face of complicated environment.展开更多
文摘A novel unsupervised approach to automatically constructing multilevel image clusters from unordered im- ages is proposed in this paper. The whole input image col- lection is represented as an imaging sample space (ISS) con- sisting of globally indexed image features extracted by a new efficient multi^view image feature matching method. By mak- ing an analogy between image capturing and observation of ISS, each image is represented as a binary sequence, in which each bit indicates the visibility of a corresponding feature. Based on information theory-inspired image popularity and dissimilarity measures, we show that the image content and distance can be quantitatively described, guided by which an input image collection is organized into multilevel clusters automatically. The effectiveness and the efficiency of the pro- posed approach are demonstrated using three real image col- lections and promising results were obtained from both qual- itative and quantitative evaluation.
基金This work is supported by Faculty of Science and Technology,University of the Faroe Islands,Faroe Islands,Denmark and REVA University,Bengaluru.
文摘This paper presents,a new approach of Medical Image Pixels Clustering(MIPC),aims to trace the dissimilar patterns over the Magnetic Resonance(MR)image through the process of automatically identify the appropriate number of distinct clusters based on different improved unsupervised clustering schemes for enrichment,pattern predication and deeper investigation.The proposed MIPC consists of two stages:clustering and validation.In the clustering stage,the MIPC automatically identifies the distinct number of dissimilar clusters over the gray scale MR image based on three different improved unsupervised clustering schemes likely improved Limited Agglomerative Clustering(iLIAC),Dynamic Automatic Agglomerative Clustering(DAAC)and Optimum N-Means(ONM).In the second stage,the performance of MIPC approach is estimated by measuring Intra intimacy and Intra contrast of each individual cluster in the result of MR image based on proposed validation method namely Shreekum Intra Cluster Measure(SICM).Experimental results showthat the MIPC approach is better suited for automatic identification of highly relative dissimilar clusters over the MR cancer images with higher Intra closeness and lower Intra contrast based on improved unsupervised clustering schemes.
基金Project supported by the National Key R&D Program of China(No.2020YFB1707700)the Zhejiang Provincial Natural Science Foundation of China(No.LR23F020003)the National Nat-ural Science Foundation of China(Nos.61972356 and 62036009)。
文摘Machine vision measurement(MVM)is an essential approach that measures the area or length of a target efficiently and non-destructively for product quality control.The result of MVM is determined by its configuration,especially the lighting scheme design in image acquisition and the algorithmic parameter optimization in image processing.In a traditional workflow,engineers constantly adjust and verify the configuration for an acceptable result,which is time-consuming and significantly depends on expertise.To address these challenges,we propose a target-independent approach,visual interactive image clustering,which facilitates configuration optimization by grouping images into different clusters to suggest lighting schemes with common parameters.Our approach has four steps:data preparation,data sampling,data processing,and visual analysis with our visualization system.During preparation,engineers design several candidate lighting schemes to acquire images and develop an algorithm to process images.Our approach samples engineer-defined parameters for each image and obtains results by executing the algorithm.The core of data processing is the explainable measurement of the relationships among images using the algorithmic parameters.Based on the image relationships,we develop VMExplorer,a visual analytics system that assists engineers in grouping images into clusters and exploring parameters.Finally,engineers can determine an appropriate lighting scheme with robust parameter combinations.To demonstrate the effiectiveness and usability of our approach,we conduct a case study with engineers and obtain feedback from expert interviews.
基金Supported by the National Natural Science Foundation of China (No. 61101159, 60872123), the China Postdoctoral Science Foundation (No. 20100480049) and the Fundamental Research Funds for the Central Universities (No. 201 IZM0033)
文摘It is a fairly challenging issue to make image repositories easy to be searched and browsed. This depends on a technique--image clustering. Kernel-based clustering algorithm has been one of the most promising clustering methods in the last few years, beeanse it can handle data with high dimensional complex structure. In this paper, a kernel fuzzy learning (KFL) algorithm is proposed, which takes advantages of the distance kernel trick and the gradient-based fuzzy clustering method to execute the image clustering automatically. Experimental results show that KFL is a more efficient method for image clustering in comparison with recent renorted alternative methods.
基金supported in part by the National Natural Science Foundation of China under Grant 62171203in part by the Jiangsu Province“333 Project”High-Level Talent Cultivation Subsidized Project+2 种基金in part by the SuzhouKey Supporting Subjects for Health Informatics under Grant SZFCXK202147in part by the Changshu Science and Technology Program under Grants CS202015 and CS202246in part by Changshu Key Laboratory of Medical Artificial Intelligence and Big Data under Grants CYZ202301 and CS202314.
文摘In this paper,we introduce a novel Multi-scale and Auto-tuned Semi-supervised Deep Subspace Clustering(MAS-DSC)algorithm,aimed at addressing the challenges of deep subspace clustering in high-dimensional real-world data,particularly in the field of medical imaging.Traditional deep subspace clustering algorithms,which are mostly unsupervised,are limited in their ability to effectively utilize the inherent prior knowledge in medical images.Our MAS-DSC algorithm incorporates a semi-supervised learning framework that uses a small amount of labeled data to guide the clustering process,thereby enhancing the discriminative power of the feature representations.Additionally,the multi-scale feature extraction mechanism is designed to adapt to the complexity of medical imaging data,resulting in more accurate clustering performance.To address the difficulty of hyperparameter selection in deep subspace clustering,this paper employs a Bayesian optimization algorithm for adaptive tuning of hyperparameters related to subspace clustering,prior knowledge constraints,and model loss weights.Extensive experiments on standard clustering datasets,including ORL,Coil20,and Coil100,validate the effectiveness of the MAS-DSC algorithm.The results show that with its multi-scale network structure and Bayesian hyperparameter optimization,MAS-DSC achieves excellent clustering results on these datasets.Furthermore,tests on a brain tumor dataset demonstrate the robustness of the algorithm and its ability to leverage prior knowledge for efficient feature extraction and enhanced clustering performance within a semi-supervised learning framework.
基金Supported by the National Natural Science Foundation of China under Grant Nos.90920303,60833006the National Basic Research 973 Program of China under Grant No.2010CB327905the Program for Changjiang Scholars and Innovative Research Team in University of China under Grant Nos.IRT0652,PCSIRT
文摘In this paper, we consider the problem of clustering Web images by mining correlations between images and their corresponding words. Since Web images always come with associated text, the corresponding textual tags of Web images are used as a source to enhance the description of Web images. However, each word has different contribution for the interpretation of image semantics. Therefore, in order to evaluate the importance of each corresponding word of Web images, we propose a novel visibility model to compute the extent to which a word can be perceived visually in images, and then infer the correlation of word to image by the integration of visibility with tf-idf. Furthermore, Latent Dirichlet Allocation (LDA) is used to discover topic information inherent in surrounding text and topic correlations of images could be defined for image clustering. For integrating visibility and latent topic information into an image clustering framework, we first represent textual correlated and latent-topic correlated images by two hypergraph views, and then the proposed Spectral Multiple Hypergraph Clustering (SMHC) algorithm is used to cluster images into categories. The SMHC could be regarded as a new unsupervised learning process with two hypergraphs to classify Web images. Experimental results show that the SMHC algorithm has better clustering performance and the proposed SMHC-based image clustering framework is effective.
基金This project is supported by Key Program of National Natural Science Foundation of China(No.50435020).
文摘A principal direction Gaussian image (PDGI)-based algorithm is proposed to extract the regular swept surface from point cloud. Firstly, the PDGI of the regular swept surface is constructed from point cloud, then the bounding box of the Gaussian sphere is uniformly partitioned into a number of small cubes (3D grids) and the PDGI points on the Gaussian sphere are associated with the corresponding 3D grids. Secondly, cluster analysis technique is used to sort out a group of 3D grids containing more PDGI points among the 3D grids. By the connected-region growing algorithm, the congregation point or the great circle is detected from the 3D grids. Thus the translational direction is determined by the congregation point and the direction of the rotational axis is determined by the great circle. In addition, the positional point of the rotational axis is obtained by the intersection of all the projected normal lines of the rotational surface on the plane being perpendicular to the estimated direction of the rotational axis. Finally, a pattem search method is applied to optimize the translational direction and the rotational axis. Some experiments are used to illustrate the feasibility of the above algorithm.
基金supported in part by the U.S.National Science Foundation under grant number DMS-0913491.
文摘Most existing applications of centroidal Voronoi tessellations(CVTs) lack consideration of the length of the cluster boundaries.In this paper we propose a new model and algorithms to produce segmentations which would minimize the total energy—a sum of the classic CVT energy and the weighted length of cluster boundaries.To distinguish it with the classic CVTs,we call it an Edge-Weighted CVT(EWCVT).The concept of EWCVT is expected to build a mathematical base for all CVT related data classifications with requirement of smoothness of the cluster boundaries.The EWCVT method is easy in implementation,fast in computation,and natural for any number of clusters.
基金supported by National Natural Science Foundation of China(Nos.61305103 and 61473103)Natural Science Foundation Heilongjiang province(No.QC2014C072)+1 种基金Postdoctoral Science Foundation of Heilongjiang(No.LBH-Z14108)SelfPlanned Task of State Key Laboratory of Robotics and System(HIT)(No.SKLRS201609B)
文摘In order to solve the problem of indoor place recognition for indoor service robot, a novel algorithm, clustering of features and images (CFI), is proposed in this work. Different from traditional indoor place recognition methods which are based on kernels or bag of features, with large margin classifier, CFI proposed in this work is based on feature matching, image similarity and clustering of features and images. It establishes independent local feature clusters by feature cloud registration to represent each room, and defines image distance to describe the similarity between images or feature clusters, which determines the label of query images. Besides, it improves recognition speed by image scaling, with state inertia and hidden Markov model constraining the transition of the state to kill unreasonable wrong recognitions and achieves remarkable precision and speed. A series of experiments are conducted to test the algorithm based on standard databases, and it achieves recognition rate up to 97% and speed is over 30 fps, which is much superior to traditional methods. Its impressive precision and speed demonstrate the great discriminative power in the face of complicated environment.