In allusion to the disadvantage of having to obtain the number of clusters of data sets in advance and the sensitivity to selecting initial clustering centers in the k-means algorithm, an improved k-means clustering a...In allusion to the disadvantage of having to obtain the number of clusters of data sets in advance and the sensitivity to selecting initial clustering centers in the k-means algorithm, an improved k-means clustering algorithm is proposed. First, the concept of a silhouette coefficient is introduced, and the optimal clustering number Kopt of a data set with unknown class information is confirmed by calculating the silhouette coefficient of objects in clusters under different K values. Then the distribution of the data set is obtained through hierarchical clustering and the initial clustering-centers are confirmed. Finally, the clustering is completed by the traditional k-means clustering. By the theoretical analysis, it is proved that the improved k-means clustering algorithm has proper computational complexity. The experimental results of IRIS testing data set show that the algorithm can distinguish different clusters reasonably and recognize the outliers efficiently, and the entropy generated by the algorithm is lower.展开更多
In this paper, the borrowing data of readers is analyzed and studied by taking K-Means algorithm as an example and implementing this algorithm in Hadoop calculation platform, and data mining technology is effectively ...In this paper, the borrowing data of readers is analyzed and studied by taking K-Means algorithm as an example and implementing this algorithm in Hadoop calculation platform, and data mining technology is effectively and closely combined with personalized library service through the experimental data.展开更多
In this paper,according to the old people's physical characteristics and their technical requirements for comfort and mastery when operating the robot,a control approach driven by tactile and slip senses is invest...In this paper,according to the old people's physical characteristics and their technical requirements for comfort and mastery when operating the robot,a control approach driven by tactile and slip senses is investigated to control the elderly-assistant & walking-assistant robot. First,on the basis of the proposed driving control system program of tactile and slip,a detection system of tactile and slip senses are designed. Based on the tactile and slip feature representation and extraction,an improved classification and recognition method is proposed which combines K-nearest neighbor (KNN) algorithm and K-means algorithm. And then,a robot control system based on TMS320F2812 is designed in this paper,including its hardware and software design. Then,a moving control method including the fuzzy adaptive control algorithm is presented for the walking-assistant robot to realize some different moving properties. At last,by the experimental verification in the walking-assistant robot,the research results show that the tactile and slip senses detection and recognition method is effective,and the whole control system has good feasibility and adaptability.展开更多
The paper study improved K-means algorithm and establish indicators to classify customers according to RFM model. Experimental results show that, the new algorithm has good convergence and stability, it has better tha...The paper study improved K-means algorithm and establish indicators to classify customers according to RFM model. Experimental results show that, the new algorithm has good convergence and stability, it has better than single use of FKP algorithms for clustering. Finally the paper study the application of clustering in customer segmentation of mobile communication enterprise. It discusses the basic theory, customer segmentation methods and steps, the customer segmentation model based on consumption behavior psychology, and the segmentation model is successfully applied to the process of marketing decision support.展开更多
A new recommendation method was presented based on memetic algorithm-based clustering. The proposed method was tested on four highly sparse real-world datasets. Its recommendation performance is evaluated and compared...A new recommendation method was presented based on memetic algorithm-based clustering. The proposed method was tested on four highly sparse real-world datasets. Its recommendation performance is evaluated and compared with that of the frequency-based, user-based, item-based, k-means clustering-based, and genetic algorithm-based methods in terms of precision, recall, and F1 score. The results show that the proposed method yields better performance under the new user cold-start problem when each of new active users selects only one or two items into the basket. The average F1 scores on all four datasets are improved by 225.0%, 61.6%, 54.6%, 49.3%, 28.8%, and 6.3% over the frequency-based, user-based, item-based, k-means clustering-based, and two genetic algorithm-based methods, respectively.展开更多
Clustering is one of the most widely used data mining techniques that can be used to create homogeneous clusters.K-means is one of the popular clustering algorithms that,despite its inherent simplicity,has also some m...Clustering is one of the most widely used data mining techniques that can be used to create homogeneous clusters.K-means is one of the popular clustering algorithms that,despite its inherent simplicity,has also some major problems.One way to resolve these problems and improve the k-means algorithm is the use of evolutionary algorithms in clustering.In this study,the Imperialist Competitive Algorithm(ICA) is developed and then used in the clustering process.Clustering of IRIS,Wine and CMC datasets using developed ICA and comparing them with the results of clustering by the original ICA,GA and PSO algorithms,demonstrate the improvement of Imperialist competitive algorithm.展开更多
The generation of a perceptual map via three-way multidimensional scaling allows analysts to see the separation of objects in Euclidean space. The MDSvarext method incorporates the objects' confidence regions in this...The generation of a perceptual map via three-way multidimensional scaling allows analysts to see the separation of objects in Euclidean space. The MDSvarext method incorporates the objects' confidence regions in this analysis, allowing for statistical inference in the difference between objects, but the confidence regions that are generated are very large because of the inherent variability among the evaluators. One solution to this problem is cluster generation prior to the application of the MDSvarext method in order to obtain homogeneous subgroups and to achieve greater control of the variance. This work is relevant to studies of perception which usually evaluate the difference between objects or stimuli in the point of view of different people that judge this difference using several dimensions. This study investigated the possibility of using a K-means algorithm to generate subgroups before the MDSvarext method was applied, evaluating the process with two quality indicators, one Ex-Ante and one Ex-Post. The experiments were conducted based on simulation of judgment matrix of different objects in multiple dimensions being evaluated by several judges. In this experiment, the matrix used was a 10 objects, in 10 features, judged by 10 people. The results are promising as possible interpretations of the perceptual map and the indicators generated.展开更多
Intravascular ultrasound (IVUS) is a new technology for the diagnosis of coronary artery disease, and for the support of coronary intervention. IVUS image segmentation often encounters difficulties when plaque and aco...Intravascular ultrasound (IVUS) is a new technology for the diagnosis of coronary artery disease, and for the support of coronary intervention. IVUS image segmentation often encounters difficulties when plaque and acoustic shadow are present A novel approach for hard plaque recognition and media-adventitia border detection of IVUS images is presented in this paper. The IVUS images were first enhanced by a spatial-frequency domain filter that was constructed by the directional filter and histogram equalization. Then, the hard plaque was recognized based on the intensity variation within different regions that were obtained using the k-means algorithm. In the next step, a cost matrix representing the probability of the media-adventitia border was generated by combining image gradient, plaque location and image intensity. A heuristic graph-searching was applied to find the media-adventitia border from the cost matrix.Experiment results showed that the accuracy of hard plaque recognition and media-adventitia border detection was 89.94% and 95.57%, respectively. In conclusion,using hard plaques recognition could improve media-adventitia border detection in IVUS images.展开更多
We propose a novel texture clustering method. A classical type of(approximate) shift invariant discrete wavelet transform(DWT),dual tree DWT,is used to decompose texture images. Multiple signatures are generated from ...We propose a novel texture clustering method. A classical type of(approximate) shift invariant discrete wavelet transform(DWT),dual tree DWT,is used to decompose texture images. Multiple signatures are generated from the obtained high-frequency bands. A locality preserving approach is applied subsequently to project data from high-dimensional space to low-dimensional space. Shift invariant DWT can represent image texture information efficiently in combination with a histogram signature,and the local geometrical structure of the dataset is preserved well during clustering. Experimental results show that the proposed method remarkably outperforms traditional ones.展开更多
基金The National Natural Science Foundation of China(No50674086)Specialized Research Fund for the Doctoral Program of Higher Education (No20060290508)the Youth Scientific Research Foundation of China University of Mining and Technology (No2006A047)
文摘In allusion to the disadvantage of having to obtain the number of clusters of data sets in advance and the sensitivity to selecting initial clustering centers in the k-means algorithm, an improved k-means clustering algorithm is proposed. First, the concept of a silhouette coefficient is introduced, and the optimal clustering number Kopt of a data set with unknown class information is confirmed by calculating the silhouette coefficient of objects in clusters under different K values. Then the distribution of the data set is obtained through hierarchical clustering and the initial clustering-centers are confirmed. Finally, the clustering is completed by the traditional k-means clustering. By the theoretical analysis, it is proved that the improved k-means clustering algorithm has proper computational complexity. The experimental results of IRIS testing data set show that the algorithm can distinguish different clusters reasonably and recognize the outliers efficiently, and the entropy generated by the algorithm is lower.
文摘In this paper, the borrowing data of readers is analyzed and studied by taking K-Means algorithm as an example and implementing this algorithm in Hadoop calculation platform, and data mining technology is effectively and closely combined with personalized library service through the experimental data.
基金State Key Laboratory of Robotics and System(HIT) in China(No.SKLRS-2009-MS-02)
文摘In this paper,according to the old people's physical characteristics and their technical requirements for comfort and mastery when operating the robot,a control approach driven by tactile and slip senses is investigated to control the elderly-assistant & walking-assistant robot. First,on the basis of the proposed driving control system program of tactile and slip,a detection system of tactile and slip senses are designed. Based on the tactile and slip feature representation and extraction,an improved classification and recognition method is proposed which combines K-nearest neighbor (KNN) algorithm and K-means algorithm. And then,a robot control system based on TMS320F2812 is designed in this paper,including its hardware and software design. Then,a moving control method including the fuzzy adaptive control algorithm is presented for the walking-assistant robot to realize some different moving properties. At last,by the experimental verification in the walking-assistant robot,the research results show that the tactile and slip senses detection and recognition method is effective,and the whole control system has good feasibility and adaptability.
文摘The paper study improved K-means algorithm and establish indicators to classify customers according to RFM model. Experimental results show that, the new algorithm has good convergence and stability, it has better than single use of FKP algorithms for clustering. Finally the paper study the application of clustering in customer segmentation of mobile communication enterprise. It discusses the basic theory, customer segmentation methods and steps, the customer segmentation model based on consumption behavior psychology, and the segmentation model is successfully applied to the process of marketing decision support.
基金supporting by grant fund under the Strategic Scholarships for Frontier Research Network for the PhD Program Thai Doctoral degree
文摘A new recommendation method was presented based on memetic algorithm-based clustering. The proposed method was tested on four highly sparse real-world datasets. Its recommendation performance is evaluated and compared with that of the frequency-based, user-based, item-based, k-means clustering-based, and genetic algorithm-based methods in terms of precision, recall, and F1 score. The results show that the proposed method yields better performance under the new user cold-start problem when each of new active users selects only one or two items into the basket. The average F1 scores on all four datasets are improved by 225.0%, 61.6%, 54.6%, 49.3%, 28.8%, and 6.3% over the frequency-based, user-based, item-based, k-means clustering-based, and two genetic algorithm-based methods, respectively.
文摘Clustering is one of the most widely used data mining techniques that can be used to create homogeneous clusters.K-means is one of the popular clustering algorithms that,despite its inherent simplicity,has also some major problems.One way to resolve these problems and improve the k-means algorithm is the use of evolutionary algorithms in clustering.In this study,the Imperialist Competitive Algorithm(ICA) is developed and then used in the clustering process.Clustering of IRIS,Wine and CMC datasets using developed ICA and comparing them with the results of clustering by the original ICA,GA and PSO algorithms,demonstrate the improvement of Imperialist competitive algorithm.
文摘The generation of a perceptual map via three-way multidimensional scaling allows analysts to see the separation of objects in Euclidean space. The MDSvarext method incorporates the objects' confidence regions in this analysis, allowing for statistical inference in the difference between objects, but the confidence regions that are generated are very large because of the inherent variability among the evaluators. One solution to this problem is cluster generation prior to the application of the MDSvarext method in order to obtain homogeneous subgroups and to achieve greater control of the variance. This work is relevant to studies of perception which usually evaluate the difference between objects or stimuli in the point of view of different people that judge this difference using several dimensions. This study investigated the possibility of using a K-means algorithm to generate subgroups before the MDSvarext method was applied, evaluating the process with two quality indicators, one Ex-Ante and one Ex-Post. The experiments were conducted based on simulation of judgment matrix of different objects in multiple dimensions being evaluated by several judges. In this experiment, the matrix used was a 10 objects, in 10 features, judged by 10 people. The results are promising as possible interpretations of the perceptual map and the indicators generated.
文摘Intravascular ultrasound (IVUS) is a new technology for the diagnosis of coronary artery disease, and for the support of coronary intervention. IVUS image segmentation often encounters difficulties when plaque and acoustic shadow are present A novel approach for hard plaque recognition and media-adventitia border detection of IVUS images is presented in this paper. The IVUS images were first enhanced by a spatial-frequency domain filter that was constructed by the directional filter and histogram equalization. Then, the hard plaque was recognized based on the intensity variation within different regions that were obtained using the k-means algorithm. In the next step, a cost matrix representing the probability of the media-adventitia border was generated by combining image gradient, plaque location and image intensity. A heuristic graph-searching was applied to find the media-adventitia border from the cost matrix.Experiment results showed that the accuracy of hard plaque recognition and media-adventitia border detection was 89.94% and 95.57%, respectively. In conclusion,using hard plaques recognition could improve media-adventitia border detection in IVUS images.
基金supported by the Hi-Tech Research and Development Program (863) of China (Nos. 2007AA01Z311 and 2007AA04Z1A5)the National Basic Research Program (973) of China (No. 2009CB32 0804)+1 种基金the National Research Foundation for the Doctoral Program of Higher Education of China (No. 20060335114)the Science and Technology Program of Zhejiang Province, China (No. 2007C21006)
文摘We propose a novel texture clustering method. A classical type of(approximate) shift invariant discrete wavelet transform(DWT),dual tree DWT,is used to decompose texture images. Multiple signatures are generated from the obtained high-frequency bands. A locality preserving approach is applied subsequently to project data from high-dimensional space to low-dimensional space. Shift invariant DWT can represent image texture information efficiently in combination with a histogram signature,and the local geometrical structure of the dataset is preserved well during clustering. Experimental results show that the proposed method remarkably outperforms traditional ones.