The K-means method is one of the most widely used clustering methods and has been implemented in many fields of science and technology. One of the major problems of the k-means algorithm is that it may produce empty c...The K-means method is one of the most widely used clustering methods and has been implemented in many fields of science and technology. One of the major problems of the k-means algorithm is that it may produce empty clusters depending on initial center vectors. Genetic Algorithms (GAs) are adaptive heuristic search algorithm based on the evolutionary principles of natural selection and genetics. This paper presents a hybrid version of the k-means algorithm with GAs that efficiently eliminates this empty cluster problem. Results of simulation experiments using several data sets prove our claim.展开更多
The imbalance of energy consumption in wireless sensor networks(WSNs)easily results in the“hot spot”problem that the sensor nodes in a particular area die due to fast energy consumption.In order to solve the“hot s...The imbalance of energy consumption in wireless sensor networks(WSNs)easily results in the“hot spot”problem that the sensor nodes in a particular area die due to fast energy consumption.In order to solve the“hot spot”problem in WSNs,we propose an unequal clustering routing algorithm based on genetic algorithm(UCR-GA).In the cluster head election phase,the fitness function is constructed based on the residual energy,density and distance between nodes and base station,and the appropriate node is selected as the cluster head.In the data transmission phase,the cluster head selects single-hop or multi-hop communication mode according to the distance to the base station.After we comprehensively consider the residual energy of the cluster head and its communication energy consumption with the base station,an appropriate relay node is selected.The designed protocal is simulated under energy homogeneous and energy heterogeneity conditions,and the results show that the proposed routing protocal can effectively balance energy consumption,prolong the life cycle of network,and is appicable to heterogeneous networks.展开更多
The dimensionality of data is increasing very rapidly,which creates challenges for most of the current mining and learning algorithms,such as large memory requirements and high computational costs.The literature inclu...The dimensionality of data is increasing very rapidly,which creates challenges for most of the current mining and learning algorithms,such as large memory requirements and high computational costs.The literature includes much research on feature selection for supervised learning.However,feature selection for unsupervised learning has only recently been studied.Finding the subset of features in unsupervised learning that enhances the performance is challenging since the clusters are indeterminate.This work proposes a hybrid technique for unsupervised feature selection called GAk-MEANS,which combines the genetic algorithm(GA)approach with the classical k-Means algorithm.In the proposed algorithm,a new fitness func-tion is designed in addition to new smart crossover and mutation operators.The effectiveness of this algorithm is demonstrated on various datasets.Fur-thermore,the performance of GAk-MEANS has been compared with other genetic algorithms,such as the genetic algorithm using the Sammon Error Function and the genetic algorithm using the Sum of Squared Error Function.Additionally,the performance of GAk-MEANS is compared with the state-of-the-art statistical unsupervised feature selection techniques.Experimental results show that GAk-MEANS consistently selects subsets of features that result in better classification accuracy compared to others.In particular,GAk-MEANS is able to significantly reduce the size of the subset of selected features by an average of 86.35%(72%–96.14%),which leads to an increase of the accuracy by an average of 3.78%(1.05%–6.32%)compared to using all features.When compared with the genetic algorithm using the Sammon Error Function,GAk-MEANS is able to reduce the size of the subset of selected features by 41.29%on average,improve the accuracy by 5.37%,and reduce the time by 70.71%.When compared with the genetic algorithm using the Sum of Squared Error Function,GAk-MEANS on average is able to reduce the size of the subset of selected features by 15.91%,and improve the accuracy by 9.81%,but the time is increased by a factor of 3.When compared with the machine-learning based methods,we observed that GAk-MEANS is able to increase the accuracy by 13.67%on average with an 88.76%average increase in time.展开更多
This paper presents a new algorithm for solving unit commitment (UC) problems using a binary-real coded genetic algorithm based on k-means clustering technique. UC is a NP-hard nonlinear mixed-integer optimization pro...This paper presents a new algorithm for solving unit commitment (UC) problems using a binary-real coded genetic algorithm based on k-means clustering technique. UC is a NP-hard nonlinear mixed-integer optimization problem, encountered as one of the toughest problems in power systems, in which some power generating units are to be scheduled in such a way that the forecasted demand is met at minimum production cost over a time horizon. In the proposed algorithm, the algorithm integrates the main features of a binary-real coded genetic algorithm (GA) and k-means clustering technique. The binary coded GA is used to obtain a feasible commitment schedule for each generating unit;while the power amounts generated by committed units are determined by using real coded GA for the feasible commitment obtained in each interval. k-means clustering algorithm divides population into a specific number of subpopulations with dynamic size. In this way, using k-means clustering algorithm allows the use of different GA operators with the whole population and avoids the local problem minima. The effectiveness of the proposed technique is validated on a test power system available in the literature. The proposed algorithm performance is found quite satisfactory in comparison with the previously reported results.展开更多
The performance of the classical clustering algorithm is not always satisfied with the high-dimensional datasets, which make clustering method limited in many application. To solve this problem, clustering method with...The performance of the classical clustering algorithm is not always satisfied with the high-dimensional datasets, which make clustering method limited in many application. To solve this problem, clustering method with Projection Pursuit dimension reduction based on Immune Clonal Selection Algorithm (ICSA-PP) is proposed in this paper. Projection pursuit strategy can maintain consistent Euclidean distances between points in the low-dimensional embeddings where the ICSA is used to search optimizing projection direction. The proposed algorithm can converge quickly with less iteration to reduce dimension of some high-dimensional datasets, and in which space, K-mean clustering algorithm is used to partition the reduced data. The experiment results on UCI data show that the presented method can search quicker to optimize projection direction than Genetic Algorithm (GA) and it has better clustering results compared with traditional linear dimension reduction method for Principle Component Analysis (PCA).展开更多
This study aimed to develop a neural network(NN)-based method to predict the long-term mean radiant temperature(MRT)around buildings by using meteorological parameters as training data.The MRT dramatically impacts bui...This study aimed to develop a neural network(NN)-based method to predict the long-term mean radiant temperature(MRT)around buildings by using meteorological parameters as training data.The MRT dramatically impacts building energy consumption and significantly affects outdoor thermal comfort.In NN-based long-term MRT prediction,two main restrictions must be overcome to achieve precise results:first,the difficulty of preparing numerous training datasets;second,the challenge of developing an accurate NN model.To overcome these restrictions,a combination of principal component analysis(PCA)and K-means clustering was employed to reduce the training data while maintaining high prediction accuracy.Second,three widely used NN models(feedforward NN(FFNN),backpropagation NN(BPNN),and BPNN optimized using a genetic algorithm(GA-BPNN))were compared to identify the NN with the best long-term MRT prediction performance.The performances of the tested NNs were evaluated using the mean absolute percentage error(MAPE),which was≤3%in each case.The findings indicate that the training dataset was reduced effectively by the PCA and K-means.Among the three NNs,the GA-BPNN produced the most accurate results,with its MAPE being below 1%.This study will contribute to the development of fast and feasible outdoor thermal environment prediction.展开更多
文摘The K-means method is one of the most widely used clustering methods and has been implemented in many fields of science and technology. One of the major problems of the k-means algorithm is that it may produce empty clusters depending on initial center vectors. Genetic Algorithms (GAs) are adaptive heuristic search algorithm based on the evolutionary principles of natural selection and genetics. This paper presents a hybrid version of the k-means algorithm with GAs that efficiently eliminates this empty cluster problem. Results of simulation experiments using several data sets prove our claim.
基金National Natural Science Foundation of China(No.61862038)Lanzhou Talent Innovation and Entrepreneurship Technology Plan Project(No.2019-RC-14)Foundation of a Hundred Youth Talents Training Program of Lanzhou Jiaotong University。
文摘The imbalance of energy consumption in wireless sensor networks(WSNs)easily results in the“hot spot”problem that the sensor nodes in a particular area die due to fast energy consumption.In order to solve the“hot spot”problem in WSNs,we propose an unequal clustering routing algorithm based on genetic algorithm(UCR-GA).In the cluster head election phase,the fitness function is constructed based on the residual energy,density and distance between nodes and base station,and the appropriate node is selected as the cluster head.In the data transmission phase,the cluster head selects single-hop or multi-hop communication mode according to the distance to the base station.After we comprehensively consider the residual energy of the cluster head and its communication energy consumption with the base station,an appropriate relay node is selected.The designed protocal is simulated under energy homogeneous and energy heterogeneity conditions,and the results show that the proposed routing protocal can effectively balance energy consumption,prolong the life cycle of network,and is appicable to heterogeneous networks.
文摘The dimensionality of data is increasing very rapidly,which creates challenges for most of the current mining and learning algorithms,such as large memory requirements and high computational costs.The literature includes much research on feature selection for supervised learning.However,feature selection for unsupervised learning has only recently been studied.Finding the subset of features in unsupervised learning that enhances the performance is challenging since the clusters are indeterminate.This work proposes a hybrid technique for unsupervised feature selection called GAk-MEANS,which combines the genetic algorithm(GA)approach with the classical k-Means algorithm.In the proposed algorithm,a new fitness func-tion is designed in addition to new smart crossover and mutation operators.The effectiveness of this algorithm is demonstrated on various datasets.Fur-thermore,the performance of GAk-MEANS has been compared with other genetic algorithms,such as the genetic algorithm using the Sammon Error Function and the genetic algorithm using the Sum of Squared Error Function.Additionally,the performance of GAk-MEANS is compared with the state-of-the-art statistical unsupervised feature selection techniques.Experimental results show that GAk-MEANS consistently selects subsets of features that result in better classification accuracy compared to others.In particular,GAk-MEANS is able to significantly reduce the size of the subset of selected features by an average of 86.35%(72%–96.14%),which leads to an increase of the accuracy by an average of 3.78%(1.05%–6.32%)compared to using all features.When compared with the genetic algorithm using the Sammon Error Function,GAk-MEANS is able to reduce the size of the subset of selected features by 41.29%on average,improve the accuracy by 5.37%,and reduce the time by 70.71%.When compared with the genetic algorithm using the Sum of Squared Error Function,GAk-MEANS on average is able to reduce the size of the subset of selected features by 15.91%,and improve the accuracy by 9.81%,but the time is increased by a factor of 3.When compared with the machine-learning based methods,we observed that GAk-MEANS is able to increase the accuracy by 13.67%on average with an 88.76%average increase in time.
文摘This paper presents a new algorithm for solving unit commitment (UC) problems using a binary-real coded genetic algorithm based on k-means clustering technique. UC is a NP-hard nonlinear mixed-integer optimization problem, encountered as one of the toughest problems in power systems, in which some power generating units are to be scheduled in such a way that the forecasted demand is met at minimum production cost over a time horizon. In the proposed algorithm, the algorithm integrates the main features of a binary-real coded genetic algorithm (GA) and k-means clustering technique. The binary coded GA is used to obtain a feasible commitment schedule for each generating unit;while the power amounts generated by committed units are determined by using real coded GA for the feasible commitment obtained in each interval. k-means clustering algorithm divides population into a specific number of subpopulations with dynamic size. In this way, using k-means clustering algorithm allows the use of different GA operators with the whole population and avoids the local problem minima. The effectiveness of the proposed technique is validated on a test power system available in the literature. The proposed algorithm performance is found quite satisfactory in comparison with the previously reported results.
基金Supported by the National Natural Science Foundation of China (No. 61003198, 60703108, 60703109, 60702062,60803098)the National High Technology Development 863 Program of China (No. 2008AA01Z125, 2009AA12Z210)+1 种基金the China Postdoctoral Science Foundation funded project (No. 20090460093)the Provincial Natural Science Foundation of Shaanxi, China (No. 2009JQ8016)
文摘The performance of the classical clustering algorithm is not always satisfied with the high-dimensional datasets, which make clustering method limited in many application. To solve this problem, clustering method with Projection Pursuit dimension reduction based on Immune Clonal Selection Algorithm (ICSA-PP) is proposed in this paper. Projection pursuit strategy can maintain consistent Euclidean distances between points in the low-dimensional embeddings where the ICSA is used to search optimizing projection direction. The proposed algorithm can converge quickly with less iteration to reduce dimension of some high-dimensional datasets, and in which space, K-mean clustering algorithm is used to partition the reduced data. The experiment results on UCI data show that the presented method can search quicker to optimize projection direction than Genetic Algorithm (GA) and it has better clustering results compared with traditional linear dimension reduction method for Principle Component Analysis (PCA).
基金This study was supported by a Grant-in-Aid for Challenging Research(Exploratory)(No.19K22004)the China Scholarship Council(No.201708430100).
文摘This study aimed to develop a neural network(NN)-based method to predict the long-term mean radiant temperature(MRT)around buildings by using meteorological parameters as training data.The MRT dramatically impacts building energy consumption and significantly affects outdoor thermal comfort.In NN-based long-term MRT prediction,two main restrictions must be overcome to achieve precise results:first,the difficulty of preparing numerous training datasets;second,the challenge of developing an accurate NN model.To overcome these restrictions,a combination of principal component analysis(PCA)and K-means clustering was employed to reduce the training data while maintaining high prediction accuracy.Second,three widely used NN models(feedforward NN(FFNN),backpropagation NN(BPNN),and BPNN optimized using a genetic algorithm(GA-BPNN))were compared to identify the NN with the best long-term MRT prediction performance.The performances of the tested NNs were evaluated using the mean absolute percentage error(MAPE),which was≤3%in each case.The findings indicate that the training dataset was reduced effectively by the PCA and K-means.Among the three NNs,the GA-BPNN produced the most accurate results,with its MAPE being below 1%.This study will contribute to the development of fast and feasible outdoor thermal environment prediction.