During the storehouse surface rolling construction of a core rockfilldam, the spreading thickness of dam face is an important factor that affects the construction quality of the dam storehouse' rolling surface and...During the storehouse surface rolling construction of a core rockfilldam, the spreading thickness of dam face is an important factor that affects the construction quality of the dam storehouse' rolling surface and the overallquality of the entire dam. Currently, the method used to monitor and controlspreading thickness during the dam construction process is artificialsampling check after spreading, which makes it difficult to monitor the entire dam storehouse surface. In this paper, we present an in-depth study based on real-time monitoring and controltheory of storehouse surface rolling construction and obtain the rolling compaction thickness by analyzing the construction track of the rolling machine. Comparatively, the traditionalmethod can only analyze the rolling thickness of the dam storehouse surface after it has been compacted and cannot determine the thickness of the dam storehouse surface in realtime. To solve these problems, our system monitors the construction progress of the leveling machine and employs a real-time spreading thickness monitoring modelbased on the K-nearest neighbor algorithm. Taking the LHK core rockfilldam in Southwest China as an example, we performed real-time monitoring for the spreading thickness and conducted real-time interactive queries regarding the spreading thickness. This approach provides a new method for controlling the spreading thickness of the core rockfilldam storehouse surface.展开更多
Existing interference protection systems lack automatic evaluation methods to provide scientific, objective and accurate assessment results. To address this issue, this paper develops a layout scheme by geometrically ...Existing interference protection systems lack automatic evaluation methods to provide scientific, objective and accurate assessment results. To address this issue, this paper develops a layout scheme by geometrically modeling the actual scene, so that the hand-held full-band spectrum analyzer would be able to collect signal field strength values for indoor complex scenes. An improved prediction algorithm based on the K-nearest neighbor non-parametric kernel regression was proposed to predict the signal field strengths for the whole plane before and after being shield. Then the highest accuracy set of data could be picked out by comparison. The experimental results show that the improved prediction algorithm based on the K-nearest neighbor non-parametric kernel regression can scientifically and objectively predict the indoor complex scenes’ signal strength and evaluate the interference protection with high accuracy.展开更多
In this study,our aim is to address the problem of gene selection by proposing a hybrid bio-inspired evolutionary algorithm that combines Grey Wolf Optimization(GWO)with Harris Hawks Optimization(HHO)for feature selec...In this study,our aim is to address the problem of gene selection by proposing a hybrid bio-inspired evolutionary algorithm that combines Grey Wolf Optimization(GWO)with Harris Hawks Optimization(HHO)for feature selection.Themotivation for utilizingGWOandHHOstems fromtheir bio-inspired nature and their demonstrated success in optimization problems.We aimto leverage the strengths of these algorithms to enhance the effectiveness of feature selection in microarray-based cancer classification.We selected leave-one-out cross-validation(LOOCV)to evaluate the performance of both two widely used classifiers,k-nearest neighbors(KNN)and support vector machine(SVM),on high-dimensional cancer microarray data.The proposed method is extensively tested on six publicly available cancer microarray datasets,and a comprehensive comparison with recently published methods is conducted.Our hybrid algorithm demonstrates its effectiveness in improving classification performance,Surpassing alternative approaches in terms of precision.The outcomes confirm the capability of our method to substantially improve both the precision and efficiency of cancer classification,thereby advancing the development ofmore efficient treatment strategies.The proposed hybridmethod offers a promising solution to the gene selection problem in microarray-based cancer classification.It improves the accuracy and efficiency of cancer diagnosis and treatment,and its superior performance compared to other methods highlights its potential applicability in realworld cancer classification tasks.By harnessing the complementary search mechanisms of GWO and HHO,we leverage their bio-inspired behavior to identify informative genes relevant to cancer diagnosis and treatment.展开更多
Compositional data, such as relative information, is a crucial aspect of machine learning and other related fields. It is typically recorded as closed data or sums to a constant, like 100%. The statistical linear mode...Compositional data, such as relative information, is a crucial aspect of machine learning and other related fields. It is typically recorded as closed data or sums to a constant, like 100%. The statistical linear model is the most used technique for identifying hidden relationships between underlying random variables of interest. However, data quality is a significant challenge in machine learning, especially when missing data is present. The linear regression model is a commonly used statistical modeling technique used in various applications to find relationships between variables of interest. When estimating linear regression parameters which are useful for things like future prediction and partial effects analysis of independent variables, maximum likelihood estimation (MLE) is the method of choice. However, many datasets contain missing observations, which can lead to costly and time-consuming data recovery. To address this issue, the expectation-maximization (EM) algorithm has been suggested as a solution for situations including missing data. The EM algorithm repeatedly finds the best estimates of parameters in statistical models that depend on variables or data that have not been observed. This is called maximum likelihood or maximum a posteriori (MAP). Using the present estimate as input, the expectation (E) step constructs a log-likelihood function. Finding the parameters that maximize the anticipated log-likelihood, as determined in the E step, is the job of the maximization (M) phase. This study looked at how well the EM algorithm worked on a made-up compositional dataset with missing observations. It used both the robust least square version and ordinary least square regression techniques. The efficacy of the EM algorithm was compared with two alternative imputation techniques, k-Nearest Neighbor (k-NN) and mean imputation (), in terms of Aitchison distances and covariance.展开更多
In this paper, a memetic algorithm with competition(MAC) is proposed to solve the capacitated green vehicle routing problem(CGVRP). Firstly, the permutation array called traveling salesman problem(TSP) route is used t...In this paper, a memetic algorithm with competition(MAC) is proposed to solve the capacitated green vehicle routing problem(CGVRP). Firstly, the permutation array called traveling salesman problem(TSP) route is used to encode the solution, and an effective decoding method to construct the CGVRP route is presented accordingly. Secondly, the k-nearest neighbor(k NN) based initialization is presented to take use of the location information of the customers. Thirdly, according to the characteristics of the CGVRP, the search operators in the variable neighborhood search(VNS) framework and the simulated annealing(SA) strategy are executed on the TSP route for all solutions. Moreover, the customer adjustment operator and the alternative fuel station(AFS) adjustment operator on the CGVRP route are executed for the elite solutions after competition. In addition, the crossover operator is employed to share information among different solutions. The effect of parameter setting is investigated using the Taguchi method of design-ofexperiment to suggest suitable values. Via numerical tests, it demonstrates the effectiveness of both the competitive search and the decoding method. Moreover, extensive comparative results show that the proposed algorithm is more effective and efficient than the existing methods in solving the CGVRP.展开更多
Whale optimization algorithm(WOA)is a new population-based meta-heuristic algorithm.WOA uses shrinking encircling mechanism,spiral rise,and random learning strategies to update whale’s positions.WOA has merit in term...Whale optimization algorithm(WOA)is a new population-based meta-heuristic algorithm.WOA uses shrinking encircling mechanism,spiral rise,and random learning strategies to update whale’s positions.WOA has merit in terms of simple calculation and high computational accuracy,but its convergence speed is slow and it is easy to fall into the local optimal solution.In order to overcome the shortcomings,this paper integrates adaptive neighborhood and hybrid mutation strategies into whale optimization algorithms,designs the average distance from itself to other whales as an adaptive neighborhood radius,and chooses to learn from the optimal solution in the neighborhood instead of random learning strategies.The hybrid mutation strategy is used to enhance the ability of algorithm to jump out of the local optimal solution.A new whale optimization algorithm(HMNWOA)is proposed.The proposed algorithm inherits the global search capability of the original algorithm,enhances the exploitation ability,improves the quality of the population,and thus improves the convergence speed of the algorithm.A feature selection algorithm based on binary HMNWOA is proposed.Twelve standard datasets from UCI repository test the validity of the proposed algorithm for feature selection.The experimental results show that HMNWOA is very competitive compared to the other six popular feature selection methods in improving the classification accuracy and reducing the number of features,and ensures that HMNWOA has strong search ability in the search feature space.展开更多
The EM algorithm is a very popular maximum likelihood estimation method, the iterative algorithm for solving the maximum likelihood estimator when the observation data is the incomplete data, but also is very effectiv...The EM algorithm is a very popular maximum likelihood estimation method, the iterative algorithm for solving the maximum likelihood estimator when the observation data is the incomplete data, but also is very effective algorithm to estimate the finite mixture model parameters. However, EM algorithm can not guarantee to find the global optimal solution, and often easy to fall into local optimal solution, so it is sensitive to the determination of initial value to iteration. Traditional EM algorithm select the initial value at random, we propose an improved method of selection of initial value. First, we use the k-nearest-neighbor method to delete outliers. Second, use the k-means to initialize the EM algorithm. Compare this method with the original random initial value method, numerical experiments show that the parameter estimation effect of the initialization of the EM algorithm is significantly better than the effect of the original EM algorithm.展开更多
Slurry electrolysis(SE),as a hydrometallurgical process,has the characteristic of a multitank series connection,which leads to various stirring conditions and a complex solid suspension state.The computational fluid d...Slurry electrolysis(SE),as a hydrometallurgical process,has the characteristic of a multitank series connection,which leads to various stirring conditions and a complex solid suspension state.The computational fluid dynamics(CFD),which requires high computing resources,and a combination with machine learning was proposed to construct a rapid prediction model for the liquid flow and solid concentration fields in a SE tank.Through scientific selection of calculation samples via orthogonal experiments,a comprehensive dataset covering a wide range of conditions was established while effectively reducing the number of simulations and providing reasonable weights for each factor.Then,a prediction model of the SE tank was constructed using the K-nearest neighbor algorithm.The results show that with the increase in levels of orthogonal experiments,the prediction accuracy of the model improved remarkably.The model established with four factors and nine levels can accurately predict the flow and concentration fields,and the regression coefficients of average velocity and solid concentration were 0.926 and 0.937,respectively.Compared with traditional CFD,the response time of field information prediction in this model was reduced from 75 h to 20 s,which solves the problem of serious lag in CFD applied alone to actual production and meets real-time production control requirements.展开更多
Vehicle type recognition(VTR)is an important research topic due to its significance in intelligent transportation systems.However,recognizing vehicle type on the real-world images is challenging due to the illuminatio...Vehicle type recognition(VTR)is an important research topic due to its significance in intelligent transportation systems.However,recognizing vehicle type on the real-world images is challenging due to the illumination change,partial occlusion under real traffic environment.These difficulties limit the performance of current state-of-art methods,which are typically based on single-stage classification without considering feature availability.To address such difficulties,this paper proposes a two-stage vehicle type recognition method combining the most effective Gabor features.The first stage leverages edge features to classify vehicles by size into big or small via a similarity k-nearest neighbor classifier(SKNNC).Further the more specific vehicle type such as bus,truck,sedan or van is recognized by the second stage classification,which leverages the most effective Gabor features extracted by a set of Gabor wavelet kernels on the partitioned key patches via a kernel sparse representation-based classifier(KSRC).A verification and correction step based on minimum residual analysis is proposed to enhance the reliability of the VTR.To improve VTR efficiency,the most effective Gabor features are selected through gray relational analysis that leverages the correlation between Gabor feature image and the original image.Experimental results demonstrate that the proposed method not only improves the accuracy of VTR but also enhances the recognition robustness to illumination change and partial occlusion.展开更多
With the advent and advancements in the wireless technologies,Wi-Fi ngerprinting-based Indoor Positioning System(IPS)has become one of the most promising solutions for localization in indoor environments.Unlike the ou...With the advent and advancements in the wireless technologies,Wi-Fi ngerprinting-based Indoor Positioning System(IPS)has become one of the most promising solutions for localization in indoor environments.Unlike the outdoor environment,the lack of line-of-sight propagation in an indoor environment keeps the interest of the researchers to develop efcient and precise positioning systems that can later be incorporated in numerous applications involving Internet of Things(IoTs)and green computing.In this paper,we have proposed a technique that combines the capabilities of multiple algorithms to overcome the complexities experienced indoors.Initially,in the database development phase,Motley Kennan propagation model is used with Hough transformation to classify,detect,and assign different attenuation factors related to the types of walls.Furthermore,important parameters for system accuracy,such as,placement and geometry of Access Points(APs)in the coverage area are also considered.New algorithm for deployment of an additional AP to an already existing infrastructure is proposed by using Genetic Algorithm(GA)coupled with Enhanced Dilution of Precision(EDOP).Moreover,classication algorithm based on k-Nearest Neighbors(k-NN)is used to nd the position of a stationary or mobile user inside the given coverage area.For k-NN to provide low localization error and reduced space dimensionality,three APs are required to be selected optimally.In this paper,we have suggested an idea to select APs based on Position Vectors(PV)as an input to the localization algorithm.Deducing from our comprehensive investigations,it is revealed that the accuracy of indoor positioning system using the proposed technique unblemished the existing solutions with signicant improvements.展开更多
The potential to save energy in existing consumer electrical appliances is very high. One of the ways to achieve energy saving and improve energy use awareness is to recognize the energy consumption of individual elec...The potential to save energy in existing consumer electrical appliances is very high. One of the ways to achieve energy saving and improve energy use awareness is to recognize the energy consumption of individual electrical appliances. To recognize the energy consumption of consumer electrical appliances, the load disaggregation methodology is utilized. Non-intrusive appliance load monitoring (NIALM) is a load disaggrega-tion methodology that disaggregates the sum of power consumption in a single point into the power consumption of individual electrical appliances. In this study, load disaggregation is performed through voltage and current waveform, known as the V-I trajectory. The classification algorithm performs cropping and image pyramid reduction of the V-I trajectory plot template images before utilizing the principal component analysis (PCA) and the k-nearest neighbor (k-NN) algorithm. The novelty of this paper is to establish a systematic approach of load disaggregation through V-I trajectory-based load signature images by utilizing a multi-stage classification algorithm methodol-ogy. The contribution of this paper is in utilizing the “k- value,” the number of closest data points to the nearest neighbor, in the k-NN algorithm to be effective in classification of electrical appliances. The results of the multi-stage classification algorithm implementation have been discussed and the idea on future work has also been proposed.展开更多
In this article,a new optimization system that uses few features to recognize locomotion with high classification accuracy is proposed.The optimization system consists of three parts.First,the features of the mixed me...In this article,a new optimization system that uses few features to recognize locomotion with high classification accuracy is proposed.The optimization system consists of three parts.First,the features of the mixed mechanical signal data are extracted from each analysis window of 200 ms after each foot contact event.Then,the Binary version of the hybrid Gray Wolf Optimization and Particle Swarm Optimization(BGWOPSO)algorithm is used to select features.And,the selected features are optimized and assigned different weights by the Biogeography-Based Optimization(BBO)algorithm.Finally,an improved K-Nearest Neighbor(KNN)classifier is employed for intention recognition.This classifier has the advantages of high accuracy,few parameters as well as low memory burden.Based on data from eight patients with transfemoral amputations,the optimization system is evaluated.The numerical results indicate that the proposed model can recognize nine daily locomotion modes(i.e.,low-,mid-,and fast-speed level-ground walking,ramp ascent/decent,stair ascent/descent,and sit/stand)by only seven features,with an accuracy of 96.66%±0.68%.As for real-time prediction on a powered knee prosthesis,the shortest prediction time is only 9.8 ms.These promising results reveal the potential of intention recognition based on the proposed system for high-level control of the prosthetic knee.展开更多
The identification of targets varies in different surge tests. A multi-color space threshold segmentation and self-learning k-nearest neighbor algorithm (k-NN) for equipment under test status identification was prop...The identification of targets varies in different surge tests. A multi-color space threshold segmentation and self-learning k-nearest neighbor algorithm (k-NN) for equipment under test status identification was proposed after using feature matching to identify equipment status had to train new patterns every time before testing. First, color space (L*a*b*, hue saturation lightness (HSL), hue saturation value (HSV)) to segment was selected according to the high luminance points ratio and white luminance points ratio of the image. Second, the unknown class sample Sr was classified by the k-NN algorithm with training set T~ according to the feature vector, which was formed from number ofpixels, eccentricity ratio, compact- ness ratio, and Euler's numbers. Last, while the classification confidence coefficient equaled k, made Sr as one sample ofpre-training set Tz'. The training set Tz increased to Tz+1 by Tz' if Tz' was saturated. In nine series of illuminant, indicator light, screen, and disturbances samples (a total of 21600 frames), the algorithm had a 98.65% identification accuracy, also selected five groups of samples to enlarge the training set from To to T5 by itself. Keywords multi-color space, k-nearest neighbor algorithm (k-NN), self-learning, surge test展开更多
This paper proposes a new cost-efficient,adaptive,and self-healing algorithm in real time that detects faults in a short period with high accuracy,even in the situations when it is difficult to detect.Rather than usin...This paper proposes a new cost-efficient,adaptive,and self-healing algorithm in real time that detects faults in a short period with high accuracy,even in the situations when it is difficult to detect.Rather than using traditional machine learning(ML)algorithms or hybrid signal processing techniques,a new framework based on an optimization enabled weighted ensemble method is developed that combines essential ML algorithms.In the proposed method,the system will select and compound appropriate ML algorithms based on Particle Swarm Optimization(PSO)weights.For this purpose,power system failures are simulated by using the PSCA D-Python co-simulation.One of the salient features of this study is that the proposed solution works on real-time raw data without using any pre-computational techniques or pre-stored information.Therefore,the proposed technique will be able to work on different systems,topologies,or data collections.The proposed fault detection technique is validated by using PSCAD-Python co-simulation on a modified and standard IEEE-14 and standard IEEE-39 bus considering network faults which are difficult to detect.展开更多
Text categorization(TC)is one of the widely studied branches of text mining and has many applications in different domains.It tries to automatically assign a text document to one of the predefined categories often by ...Text categorization(TC)is one of the widely studied branches of text mining and has many applications in different domains.It tries to automatically assign a text document to one of the predefined categories often by using machine learning(ML)techniques.Choosing the best classifier in this task is the most important step in which k-Nearest Neighbor(KNN)is widely employed as a classifier as well as several other well-known ones such as Support Vector Machine,Multinomial Naive Bayes,Logistic Regression,and so on.The KNN has been extensively used for TC tasks and is one of the oldest and simplest methods for pattern classification.Its performance crucially relies on the distance metric used to identify nearest neighbors such that the most frequently observed label among these neighbors is used to classify an unseen test instance.Hence,in this paper,a comparative analysis of the KNN classifier is performed on a subset(i.e.,R8)of the Reuters-21578 benchmark dataset for TC.Experimental results are obtained by using different distance metrics as well as recently proposed distance learning metrics under different cases where the feature model and term weighting scheme are different.Our comparative evaluation of the results shows that Bray-Curtis and Linear Discriminant Analysis(LDA)are often superior to the other metrics and work well with raw term frequency weights.展开更多
基金supported by the Innovative Research Groups of National Natural Science Foundation of China(No. 51621092)National Basic Research Program of China ("973" Program, No. 2013CB035904)National Natural Science Foundation of China (No. 51439005)
文摘During the storehouse surface rolling construction of a core rockfilldam, the spreading thickness of dam face is an important factor that affects the construction quality of the dam storehouse' rolling surface and the overallquality of the entire dam. Currently, the method used to monitor and controlspreading thickness during the dam construction process is artificialsampling check after spreading, which makes it difficult to monitor the entire dam storehouse surface. In this paper, we present an in-depth study based on real-time monitoring and controltheory of storehouse surface rolling construction and obtain the rolling compaction thickness by analyzing the construction track of the rolling machine. Comparatively, the traditionalmethod can only analyze the rolling thickness of the dam storehouse surface after it has been compacted and cannot determine the thickness of the dam storehouse surface in realtime. To solve these problems, our system monitors the construction progress of the leveling machine and employs a real-time spreading thickness monitoring modelbased on the K-nearest neighbor algorithm. Taking the LHK core rockfilldam in Southwest China as an example, we performed real-time monitoring for the spreading thickness and conducted real-time interactive queries regarding the spreading thickness. This approach provides a new method for controlling the spreading thickness of the core rockfilldam storehouse surface.
基金the National Natural Science Foundation of China under projects 61772150 and 61862012the Guangxi Key R&D Program under project AB17195025+5 种基金the Guangxi Natural Science Foundation under grants 2018GXNSFDA281054 and 2018GXNSFAA281232the National Cryptography Development Fund of China under project MMJJ20170217the Guangxi Science and Technology Base and Special Talents Program AD18281044the Innovation Project of GUET Graduate Education under project 2017YJCX46the Guangxi Young Teachers’ Basic Ability Improvement Program under Grant 2018KY0194the open program of Guangxi Key Laboratory of Cryptography and Information Security under projects GCIS201621 and GCIS201702.
文摘Existing interference protection systems lack automatic evaluation methods to provide scientific, objective and accurate assessment results. To address this issue, this paper develops a layout scheme by geometrically modeling the actual scene, so that the hand-held full-band spectrum analyzer would be able to collect signal field strength values for indoor complex scenes. An improved prediction algorithm based on the K-nearest neighbor non-parametric kernel regression was proposed to predict the signal field strengths for the whole plane before and after being shield. Then the highest accuracy set of data could be picked out by comparison. The experimental results show that the improved prediction algorithm based on the K-nearest neighbor non-parametric kernel regression can scientifically and objectively predict the indoor complex scenes’ signal strength and evaluate the interference protection with high accuracy.
基金the Deputyship for Research and Innovation,“Ministry of Education”in Saudi Arabia for funding this research(IFKSUOR3-014-3).
文摘In this study,our aim is to address the problem of gene selection by proposing a hybrid bio-inspired evolutionary algorithm that combines Grey Wolf Optimization(GWO)with Harris Hawks Optimization(HHO)for feature selection.Themotivation for utilizingGWOandHHOstems fromtheir bio-inspired nature and their demonstrated success in optimization problems.We aimto leverage the strengths of these algorithms to enhance the effectiveness of feature selection in microarray-based cancer classification.We selected leave-one-out cross-validation(LOOCV)to evaluate the performance of both two widely used classifiers,k-nearest neighbors(KNN)and support vector machine(SVM),on high-dimensional cancer microarray data.The proposed method is extensively tested on six publicly available cancer microarray datasets,and a comprehensive comparison with recently published methods is conducted.Our hybrid algorithm demonstrates its effectiveness in improving classification performance,Surpassing alternative approaches in terms of precision.The outcomes confirm the capability of our method to substantially improve both the precision and efficiency of cancer classification,thereby advancing the development ofmore efficient treatment strategies.The proposed hybridmethod offers a promising solution to the gene selection problem in microarray-based cancer classification.It improves the accuracy and efficiency of cancer diagnosis and treatment,and its superior performance compared to other methods highlights its potential applicability in realworld cancer classification tasks.By harnessing the complementary search mechanisms of GWO and HHO,we leverage their bio-inspired behavior to identify informative genes relevant to cancer diagnosis and treatment.
文摘Compositional data, such as relative information, is a crucial aspect of machine learning and other related fields. It is typically recorded as closed data or sums to a constant, like 100%. The statistical linear model is the most used technique for identifying hidden relationships between underlying random variables of interest. However, data quality is a significant challenge in machine learning, especially when missing data is present. The linear regression model is a commonly used statistical modeling technique used in various applications to find relationships between variables of interest. When estimating linear regression parameters which are useful for things like future prediction and partial effects analysis of independent variables, maximum likelihood estimation (MLE) is the method of choice. However, many datasets contain missing observations, which can lead to costly and time-consuming data recovery. To address this issue, the expectation-maximization (EM) algorithm has been suggested as a solution for situations including missing data. The EM algorithm repeatedly finds the best estimates of parameters in statistical models that depend on variables or data that have not been observed. This is called maximum likelihood or maximum a posteriori (MAP). Using the present estimate as input, the expectation (E) step constructs a log-likelihood function. Finding the parameters that maximize the anticipated log-likelihood, as determined in the E step, is the job of the maximization (M) phase. This study looked at how well the EM algorithm worked on a made-up compositional dataset with missing observations. It used both the robust least square version and ordinary least square regression techniques. The efficacy of the EM algorithm was compared with two alternative imputation techniques, k-Nearest Neighbor (k-NN) and mean imputation (), in terms of Aitchison distances and covariance.
基金supported by the National Science Fund for Distinguished Young Scholars of China(61525304)the National Natural Science Foundation of China(61873328)
文摘In this paper, a memetic algorithm with competition(MAC) is proposed to solve the capacitated green vehicle routing problem(CGVRP). Firstly, the permutation array called traveling salesman problem(TSP) route is used to encode the solution, and an effective decoding method to construct the CGVRP route is presented accordingly. Secondly, the k-nearest neighbor(k NN) based initialization is presented to take use of the location information of the customers. Thirdly, according to the characteristics of the CGVRP, the search operators in the variable neighborhood search(VNS) framework and the simulated annealing(SA) strategy are executed on the TSP route for all solutions. Moreover, the customer adjustment operator and the alternative fuel station(AFS) adjustment operator on the CGVRP route are executed for the elite solutions after competition. In addition, the crossover operator is employed to share information among different solutions. The effect of parameter setting is investigated using the Taguchi method of design-ofexperiment to suggest suitable values. Via numerical tests, it demonstrates the effectiveness of both the competitive search and the decoding method. Moreover, extensive comparative results show that the proposed algorithm is more effective and efficient than the existing methods in solving the CGVRP.
基金This work was supported by the National Natural Science Foundation of China(Grant No.2017YFC0403605 and No.11601419).
文摘Whale optimization algorithm(WOA)is a new population-based meta-heuristic algorithm.WOA uses shrinking encircling mechanism,spiral rise,and random learning strategies to update whale’s positions.WOA has merit in terms of simple calculation and high computational accuracy,but its convergence speed is slow and it is easy to fall into the local optimal solution.In order to overcome the shortcomings,this paper integrates adaptive neighborhood and hybrid mutation strategies into whale optimization algorithms,designs the average distance from itself to other whales as an adaptive neighborhood radius,and chooses to learn from the optimal solution in the neighborhood instead of random learning strategies.The hybrid mutation strategy is used to enhance the ability of algorithm to jump out of the local optimal solution.A new whale optimization algorithm(HMNWOA)is proposed.The proposed algorithm inherits the global search capability of the original algorithm,enhances the exploitation ability,improves the quality of the population,and thus improves the convergence speed of the algorithm.A feature selection algorithm based on binary HMNWOA is proposed.Twelve standard datasets from UCI repository test the validity of the proposed algorithm for feature selection.The experimental results show that HMNWOA is very competitive compared to the other six popular feature selection methods in improving the classification accuracy and reducing the number of features,and ensures that HMNWOA has strong search ability in the search feature space.
文摘The EM algorithm is a very popular maximum likelihood estimation method, the iterative algorithm for solving the maximum likelihood estimator when the observation data is the incomplete data, but also is very effective algorithm to estimate the finite mixture model parameters. However, EM algorithm can not guarantee to find the global optimal solution, and often easy to fall into local optimal solution, so it is sensitive to the determination of initial value to iteration. Traditional EM algorithm select the initial value at random, we propose an improved method of selection of initial value. First, we use the k-nearest-neighbor method to delete outliers. Second, use the k-means to initialize the EM algorithm. Compare this method with the original random initial value method, numerical experiments show that the parameter estimation effect of the initialization of the EM algorithm is significantly better than the effect of the original EM algorithm.
基金financially supported by the National Natural Science Foundation of China(No.51974018the Open Foundation of the State Key Laboratory of Process Automation in Mining and Metallurgy(No.BGRIMM-KZSKL-2022-9).
文摘Slurry electrolysis(SE),as a hydrometallurgical process,has the characteristic of a multitank series connection,which leads to various stirring conditions and a complex solid suspension state.The computational fluid dynamics(CFD),which requires high computing resources,and a combination with machine learning was proposed to construct a rapid prediction model for the liquid flow and solid concentration fields in a SE tank.Through scientific selection of calculation samples via orthogonal experiments,a comprehensive dataset covering a wide range of conditions was established while effectively reducing the number of simulations and providing reasonable weights for each factor.Then,a prediction model of the SE tank was constructed using the K-nearest neighbor algorithm.The results show that with the increase in levels of orthogonal experiments,the prediction accuracy of the model improved remarkably.The model established with four factors and nine levels can accurately predict the flow and concentration fields,and the regression coefficients of average velocity and solid concentration were 0.926 and 0.937,respectively.Compared with traditional CFD,the response time of field information prediction in this model was reduced from 75 h to 20 s,which solves the problem of serious lag in CFD applied alone to actual production and meets real-time production control requirements.
基金supported in part by the National Natural Science Foundation of China(Nos.61304205 and 61502240)the Natural Science Foundation of Jiangsu Province(BK20191401)the Innovation and Entrepreneurship Training Project of College Students(202010300290,202010300211,202010300116E).
文摘Vehicle type recognition(VTR)is an important research topic due to its significance in intelligent transportation systems.However,recognizing vehicle type on the real-world images is challenging due to the illumination change,partial occlusion under real traffic environment.These difficulties limit the performance of current state-of-art methods,which are typically based on single-stage classification without considering feature availability.To address such difficulties,this paper proposes a two-stage vehicle type recognition method combining the most effective Gabor features.The first stage leverages edge features to classify vehicles by size into big or small via a similarity k-nearest neighbor classifier(SKNNC).Further the more specific vehicle type such as bus,truck,sedan or van is recognized by the second stage classification,which leverages the most effective Gabor features extracted by a set of Gabor wavelet kernels on the partitioned key patches via a kernel sparse representation-based classifier(KSRC).A verification and correction step based on minimum residual analysis is proposed to enhance the reliability of the VTR.To improve VTR efficiency,the most effective Gabor features are selected through gray relational analysis that leverages the correlation between Gabor feature image and the original image.Experimental results demonstrate that the proposed method not only improves the accuracy of VTR but also enhances the recognition robustness to illumination change and partial occlusion.
基金The authors extend their appreciation to National University of Sciences and Technology for funding this work through Researchers Supporting Grant,National University of Sciences and Technology,Islamabad,Pakistan.
文摘With the advent and advancements in the wireless technologies,Wi-Fi ngerprinting-based Indoor Positioning System(IPS)has become one of the most promising solutions for localization in indoor environments.Unlike the outdoor environment,the lack of line-of-sight propagation in an indoor environment keeps the interest of the researchers to develop efcient and precise positioning systems that can later be incorporated in numerous applications involving Internet of Things(IoTs)and green computing.In this paper,we have proposed a technique that combines the capabilities of multiple algorithms to overcome the complexities experienced indoors.Initially,in the database development phase,Motley Kennan propagation model is used with Hough transformation to classify,detect,and assign different attenuation factors related to the types of walls.Furthermore,important parameters for system accuracy,such as,placement and geometry of Access Points(APs)in the coverage area are also considered.New algorithm for deployment of an additional AP to an already existing infrastructure is proposed by using Genetic Algorithm(GA)coupled with Enhanced Dilution of Precision(EDOP).Moreover,classication algorithm based on k-Nearest Neighbors(k-NN)is used to nd the position of a stationary or mobile user inside the given coverage area.For k-NN to provide low localization error and reduced space dimensionality,three APs are required to be selected optimally.In this paper,we have suggested an idea to select APs based on Position Vectors(PV)as an input to the localization algorithm.Deducing from our comprehensive investigations,it is revealed that the accuracy of indoor positioning system using the proposed technique unblemished the existing solutions with signicant improvements.
文摘The potential to save energy in existing consumer electrical appliances is very high. One of the ways to achieve energy saving and improve energy use awareness is to recognize the energy consumption of individual electrical appliances. To recognize the energy consumption of consumer electrical appliances, the load disaggregation methodology is utilized. Non-intrusive appliance load monitoring (NIALM) is a load disaggrega-tion methodology that disaggregates the sum of power consumption in a single point into the power consumption of individual electrical appliances. In this study, load disaggregation is performed through voltage and current waveform, known as the V-I trajectory. The classification algorithm performs cropping and image pyramid reduction of the V-I trajectory plot template images before utilizing the principal component analysis (PCA) and the k-nearest neighbor (k-NN) algorithm. The novelty of this paper is to establish a systematic approach of load disaggregation through V-I trajectory-based load signature images by utilizing a multi-stage classification algorithm methodol-ogy. The contribution of this paper is in utilizing the “k- value,” the number of closest data points to the nearest neighbor, in the k-NN algorithm to be effective in classification of electrical appliances. The results of the multi-stage classification algorithm implementation have been discussed and the idea on future work has also been proposed.
基金This research was supported in part by the National Key Research and Development Program of China under Grant 2018YFC2001300in part by the National Natural Science Foundation of China under Grant 91948302,Grant 91848204,and Grant 52021003the Project of Scientific and Technological Development Plan of Jilin Province under Grant 20220508130RC.
文摘In this article,a new optimization system that uses few features to recognize locomotion with high classification accuracy is proposed.The optimization system consists of three parts.First,the features of the mixed mechanical signal data are extracted from each analysis window of 200 ms after each foot contact event.Then,the Binary version of the hybrid Gray Wolf Optimization and Particle Swarm Optimization(BGWOPSO)algorithm is used to select features.And,the selected features are optimized and assigned different weights by the Biogeography-Based Optimization(BBO)algorithm.Finally,an improved K-Nearest Neighbor(KNN)classifier is employed for intention recognition.This classifier has the advantages of high accuracy,few parameters as well as low memory burden.Based on data from eight patients with transfemoral amputations,the optimization system is evaluated.The numerical results indicate that the proposed model can recognize nine daily locomotion modes(i.e.,low-,mid-,and fast-speed level-ground walking,ramp ascent/decent,stair ascent/descent,and sit/stand)by only seven features,with an accuracy of 96.66%±0.68%.As for real-time prediction on a powered knee prosthesis,the shortest prediction time is only 9.8 ms.These promising results reveal the potential of intention recognition based on the proposed system for high-level control of the prosthetic knee.
文摘The identification of targets varies in different surge tests. A multi-color space threshold segmentation and self-learning k-nearest neighbor algorithm (k-NN) for equipment under test status identification was proposed after using feature matching to identify equipment status had to train new patterns every time before testing. First, color space (L*a*b*, hue saturation lightness (HSL), hue saturation value (HSV)) to segment was selected according to the high luminance points ratio and white luminance points ratio of the image. Second, the unknown class sample Sr was classified by the k-NN algorithm with training set T~ according to the feature vector, which was formed from number ofpixels, eccentricity ratio, compact- ness ratio, and Euler's numbers. Last, while the classification confidence coefficient equaled k, made Sr as one sample ofpre-training set Tz'. The training set Tz increased to Tz+1 by Tz' if Tz' was saturated. In nine series of illuminant, indicator light, screen, and disturbances samples (a total of 21600 frames), the algorithm had a 98.65% identification accuracy, also selected five groups of samples to enlarge the training set from To to T5 by itself. Keywords multi-color space, k-nearest neighbor algorithm (k-NN), self-learning, surge test
文摘This paper proposes a new cost-efficient,adaptive,and self-healing algorithm in real time that detects faults in a short period with high accuracy,even in the situations when it is difficult to detect.Rather than using traditional machine learning(ML)algorithms or hybrid signal processing techniques,a new framework based on an optimization enabled weighted ensemble method is developed that combines essential ML algorithms.In the proposed method,the system will select and compound appropriate ML algorithms based on Particle Swarm Optimization(PSO)weights.For this purpose,power system failures are simulated by using the PSCA D-Python co-simulation.One of the salient features of this study is that the proposed solution works on real-time raw data without using any pre-computational techniques or pre-stored information.Therefore,the proposed technique will be able to work on different systems,topologies,or data collections.The proposed fault detection technique is validated by using PSCAD-Python co-simulation on a modified and standard IEEE-14 and standard IEEE-39 bus considering network faults which are difficult to detect.
文摘Text categorization(TC)is one of the widely studied branches of text mining and has many applications in different domains.It tries to automatically assign a text document to one of the predefined categories often by using machine learning(ML)techniques.Choosing the best classifier in this task is the most important step in which k-Nearest Neighbor(KNN)is widely employed as a classifier as well as several other well-known ones such as Support Vector Machine,Multinomial Naive Bayes,Logistic Regression,and so on.The KNN has been extensively used for TC tasks and is one of the oldest and simplest methods for pattern classification.Its performance crucially relies on the distance metric used to identify nearest neighbors such that the most frequently observed label among these neighbors is used to classify an unseen test instance.Hence,in this paper,a comparative analysis of the KNN classifier is performed on a subset(i.e.,R8)of the Reuters-21578 benchmark dataset for TC.Experimental results are obtained by using different distance metrics as well as recently proposed distance learning metrics under different cases where the feature model and term weighting scheme are different.Our comparative evaluation of the results shows that Bray-Curtis and Linear Discriminant Analysis(LDA)are often superior to the other metrics and work well with raw term frequency weights.