With the widespread data collection and processing,privacy-preserving machine learning has become increasingly important in addressing privacy risks related to individuals.Support vector machine(SVM)is one of the most...With the widespread data collection and processing,privacy-preserving machine learning has become increasingly important in addressing privacy risks related to individuals.Support vector machine(SVM)is one of the most elementary learning models of machine learning.Privacy issues surrounding SVM classifier training have attracted increasing attention.In this paper,we investigate Differential Privacy-compliant Federated Machine Learning with Dimensionality Reduction,called FedDPDR-DPML,which greatly improves data utility while providing strong privacy guarantees.Considering in distributed learning scenarios,multiple participants usually hold unbalanced or small amounts of data.Therefore,FedDPDR-DPML enables multiple participants to collaboratively learn a global model based on weighted model averaging and knowledge aggregation and then the server distributes the global model to each participant to improve local data utility.Aiming at high-dimensional data,we adopt differential privacy in both the principal component analysis(PCA)-based dimensionality reduction phase and SVM classifiers training phase,which improves model accuracy while achieving strict differential privacy protection.Besides,we train Differential privacy(DP)-compliant SVM classifiers by adding noise to the objective function itself,thus leading to better data utility.Extensive experiments on three high-dimensional datasets demonstrate that FedDPDR-DPML can achieve high accuracy while ensuring strong privacy protection.展开更多
BACKGROUND Research has found that the amygdala plays a significant role in underlying pathology of major depressive disorder(MDD).However,few studies have explored machine learning-assisted diagnostic biomarkers base...BACKGROUND Research has found that the amygdala plays a significant role in underlying pathology of major depressive disorder(MDD).However,few studies have explored machine learning-assisted diagnostic biomarkers based on amygdala functional connectivity(FC).AIM To investigate the analysis of neuroimaging biomarkers as a streamlined approach for the diagnosis of MDD in adolescents.METHODS Forty-four adolescents diagnosed with MDD and 43 healthy controls were enrolled in the study.Using resting-state functional magnetic resonance imaging,the FC was compared between the adolescents with MDD and the healthy controls,with the bilateral amygdala serving as the seed point,followed by statistical analysis of the results.The support vector machine(SVM)method was then applied to classify functional connections in various brain regions and to evaluate the neurophysiological characteristics associated with MDD.RESULTS Compared to the controls and using the bilateral amygdala as the region of interest,patients with MDD showed significantly lower FC values in the left inferior temporal gyrus,bilateral calcarine,right lingual gyrus,and left superior occipital gyrus.However,there was an increase in the FC value in Vermis-10.The SVM analysis revealed that the reduction in the FC value in the right lingual gyrus could effectively differentiate patients with MDD from healthy controls,achieving a diagnostic accuracy of 83.91%,sensitivity of 79.55%,specificity of 88.37%,and an area under the curve of 67.65%.CONCLUSION The results showed that an abnormal FC value in the right lingual gyrus was effective as a neuroimaging biomarker to distinguish patients with MDD from healthy controls.展开更多
Aiming at the problems of the traditional method of assessing distribution of particle size in bench blasting, a support vector machines (SVMs) regression methodology was used to predict the mean particle size (X50...Aiming at the problems of the traditional method of assessing distribution of particle size in bench blasting, a support vector machines (SVMs) regression methodology was used to predict the mean particle size (X50) resulting from rock blast fragmentation in various mines based on the statistical learning theory. The data base consisted of blast design parameters, explosive parameters, modulus of elasticity and in-situ block size. The seven input independent variables used for the SVMs model for the prediction of X50 of rock blast fragmentation were the ratio of bench height to drilled burden (H/B), ratio of spacing to burden (S/B), ratio of burden to hole diameter (B/D), ratio of stemming to burden (T/B), powder factor (Pf), modulus of elasticity (E) and in-situ block size (XB). After using the 90 sets of the measured data in various mines and rock formations in the world for training and testing, the model was applied to 12 another blast data for validation of the trained support vector regression (SVR) model. The prediction results of SVR were compared with those of artificial neural network (ANN), multivariate regression analysis (MVRA) models, conventional Kuznetsov method and the measured X50 values. The proposed method shows promising results and the prediction accuracy of SVMs model is acceptable.展开更多
Support vector machines (SVMs) are utilized for emotion recognition in Chinese speech in this paper. Both binary class discrimination and the multi class discrimination are discussed. It proves that the emotional fe...Support vector machines (SVMs) are utilized for emotion recognition in Chinese speech in this paper. Both binary class discrimination and the multi class discrimination are discussed. It proves that the emotional features construct a nonlinear problem in the input space, and SVMs based on nonlinear mapping can solve it more effectively than other linear methods. Multi class classification based on SVMs with a soft decision function is constructed to classify the four emotion situations. Compared with principal component analysis (PCA) method and modified PCA method, SVMs perform the best result in multi class discrimination by using nonlinear kernel mapping.展开更多
Support vector machines (SVMs) are combined with the artificial immune network (aiNet), thus forming a new hybrid ai-SVM algorithm. The algorithm is used to reduce the number of samples and the training time of SV...Support vector machines (SVMs) are combined with the artificial immune network (aiNet), thus forming a new hybrid ai-SVM algorithm. The algorithm is used to reduce the number of samples and the training time of SVM on large datasets, aiNet is an artificial immune system (AIS) inspired method to perform the automatic data compression, extract the relevant information and retain the topology of the original sample distribution. The output of aiNet is a set of antibodies for representing the input dataset in a simplified way. Then the SVM model is built in the compressed antibody network instead of the original input data. Experimental results show that the ai-SVM algorithm is effective to reduce the computing time and simplify the SVM model, and the accuracy is not decreased.展开更多
The selection of important factors in machine learning-based susceptibility assessments is crucial to obtain reliable susceptibility results.In this study,metaheuristic optimization and feature selection techniques we...The selection of important factors in machine learning-based susceptibility assessments is crucial to obtain reliable susceptibility results.In this study,metaheuristic optimization and feature selection techniques were applied to identify the most important input parameters for mapping debris flow susceptibility in the southern mountain area of Chengde City in Hebei Province,China,by using machine learning algorithms.In total,133 historical debris flow records and 16 related factors were selected.The support vector machine(SVM)was first used as the base classifier,and then a hybrid model was introduced by a two-step process.First,the particle swarm optimization(PSO)algorithm was employed to select the SVM model hyperparameters.Second,two feature selection algorithms,namely principal component analysis(PCA)and PSO,were integrated into the PSO-based SVM model,which generated the PCA-PSO-SVM and FS-PSO-SVM models,respectively.Three statistical metrics(accuracy,recall,and specificity)and the area under the receiver operating characteristic curve(AUC)were employed to evaluate and validate the performance of the models.The results indicated that the feature selection-based models exhibited the best performance,followed by the PSO-based SVM and SVM models.Moreover,the performance of the FS-PSO-SVM model was better than that of the PCA-PSO-SVM model,showing the highest AUC,accuracy,recall,and specificity values in both the training and testing processes.It was found that the selection of optimal features is crucial to improving the reliability of debris flow susceptibility assessment results.Moreover,the PSO algorithm was found to be not only an effective tool for hyperparameter optimization,but also a useful feature selection algorithm to improve prediction accuracies of debris flow susceptibility by using machine learning algorithms.The high and very high debris flow susceptibility zone appropriately covers 38.01%of the study area,where debris flow may occur under intensive human activities and heavy rainfall events.展开更多
In microarray-based cancer classification, gene selection is an important issue owing to the large number of variables and small number of samples as well as its non-linearity. It is difficult to get satisfying result...In microarray-based cancer classification, gene selection is an important issue owing to the large number of variables and small number of samples as well as its non-linearity. It is difficult to get satisfying results by using conventional linear sta- tistical methods. Recursive feature elimination based on support vector machine (SVM RFE) is an effective algorithm for gene selection and cancer classification, which are integrated into a consistent framework. In this paper, we propose a new method to select parameters of the aforementioned algorithm implemented with Gaussian kernel SVMs as better alternatives to the common practice of selecting the apparently best parameters by using a genetic algorithm to search for a couple of optimal parameter. Fast implementation issues for this method are also discussed for pragmatic reasons. The proposed method was tested on two repre- sentative hereditary breast cancer and acute leukaemia datasets. The experimental results indicate that the proposed method per- forms well in selecting genes and achieves high classification accuracies with these genes.展开更多
SVMs(support vector machines) is a new artificial intelligence methodology derived from Vapnik's statistical learning theory, which has better generalization than artificial neural network. A Csupport vector classi...SVMs(support vector machines) is a new artificial intelligence methodology derived from Vapnik's statistical learning theory, which has better generalization than artificial neural network. A Csupport vector classifiers Based Fault Diagnostic Model (CBFDM) which gives the 3 most possible fault causes is constructed in this paper. Five fold cross validation is chosen as the method of model selection for CBFDM. The simulated data are generated from PW4000-94 engine influence coefficient matrix at cruise, and the results show that the diagnostic accuracy of CBFDM is over 93 % even when the standard deviation of noise is 3 times larger than the normal. This model can also be used for other diagnostic problems.展开更多
Engine spark ignition is an important source for diagnosis of engine faults.Based on the waveform of the ignition pattern,a mechanic can guess what may be the potential malfunctioning parts of an engine with his/her e...Engine spark ignition is an important source for diagnosis of engine faults.Based on the waveform of the ignition pattern,a mechanic can guess what may be the potential malfunctioning parts of an engine with his/her experience and handbooks.However,this manual diagnostic method is imprecise because many spark ignition patterns are very similar.Therefore,a diagnosis needs many trials to identify the malfunctioning parts.Meanwhile the mechanic needs to disassemble and assemble the engine parts for verification.To tackle this problem,an intelligent diagnosis system was established based on ignition patterns.First,the captured patterns were normalized and compressed.Then wavelet packet transform(WPT) was employed to extract the representative features of the ignition patterns.Finally,a classification system was constructed by using multi-class support vector machines(SVM) and the extracted features.The classification system can intelligently classify the most likely engine fault so as to reduce the number of diagnosis trials.Experimental results show that SVM produces higher diagnosis accuracy than the traditional multilayer feedforward neural network.This is the first trial on the combination of WPT and SVM to analyze ignition patterns and diagnose automotive engines.展开更多
Gears alternately mesh and detach in driving process, and then workingconditions of gears are alternately changing, so they are easy to be spalled and worn. But becauseof the effect of additive gaussian measurement no...Gears alternately mesh and detach in driving process, and then workingconditions of gears are alternately changing, so they are easy to be spalled and worn. But becauseof the effect of additive gaussian measurement noises, the signal-to-noises ratio is low; theirfault features are difficult to extract. This study aims to propose an approach of gear faultsclassification, using the cumulants and support vector machines. The cumulants can eliminate theadditive gaussian noises, boost the signal-to-noises ratio. Generalisation of support vectormachines as classifier, which is employed structural risk minimisation principle, is superior tothat of conventional neural networks, which is employed traditional empirical risk minimisationprinciple. Support vector machines as the classifier, and the third and fourth order cumulants asinput, gears faults are successfully recognized. The experimental results show that the method offault classification combining cumulants with support vector machines is very effective.展开更多
The endpoint parameters are very important to the process of EAF steel-making, but their on-line measurement is difficult. The soft sensor technology is widely used for the prediction of endpoint parameters. Based on ...The endpoint parameters are very important to the process of EAF steel-making, but their on-line measurement is difficult. The soft sensor technology is widely used for the prediction of endpoint parameters. Based on the analysis of the smelting process of EAF and the advantages of support vector machines, a soft sensor model for predicting the endpoint parameters was built using multiple support vector machines (MSVM). In this model, the input space was divided by subtractive clustering and a sub-model based on LS-SVM was built in each sub-space. To decrease the correlation among the sub-models and to improve the accuracy and robustness of the model, the sub- models were combined by Principal Components Regression. The accuracy of the soft sensor model is perfectly improved. The simulation result demonstrates the practicability and efficiency of the MSVM model for the endpoint prediction of EAF.展开更多
Based on support vector machines, three modeling methods, i.e., white-box modeling, grey-box modeling and black-box modeling of ship manoeuvring motion in 4 degrees of freedom are investigated. With the whole-ship mat...Based on support vector machines, three modeling methods, i.e., white-box modeling, grey-box modeling and black-box modeling of ship manoeuvring motion in 4 degrees of freedom are investigated. With the whole-ship mathematical model for ship manoeuvring motion, in which the hydrodynamic coefficients are obtained from roll planar motion mechanism test, some zigzag tests and turning circle manoeuvres are simulated. In the white-box modeling and grey-box modeling, the training data taken every 5 s from the simulated 20°/20° zigzag test are used, while in the black-box modeling, the training data taken every 5 s from the simulated 15°/15°, 20°/20° zigzag tests and 15°, 25° turning manoeuvres are used; and the trained support vector machines are used to predict the whole 20°/20° zigzag test. Comparisons between the simulated and predicted 20°/20° zigzag tests show good predictive ability of the proposed methods. Besides, all mathematical models obtained by the proposed modeling methods are used to predict the 10°/10° zigzag test and 35° turning circle manoeuvre, and the predicted results are compared with those of simulation tests to demonstrate the good generalization performance of the mathematical models. Finally, the proposed modeling methods are analyzed and compared with each other in aspects of application conditions, prediction accuracy and computation speed. The appropriate modeling method can be chosen according to the intended use of the mathematical models and the available data needed for system identification.展开更多
Suppliers' selection in supply chain management (SCM) has attracted considerable research interests in recent years. Recent literatures show that neural networks achieve better performance than traditional statisti...Suppliers' selection in supply chain management (SCM) has attracted considerable research interests in recent years. Recent literatures show that neural networks achieve better performance than traditional statistical methods. However, neural networks have inherent drawbacks, such as local optimization solution, lack generalization, and uncontrolled convergence. A relatively new machine learning technique, support vector machine (SVM), which overcomes the drawbacks of neural networks, is introduced to provide a model with better explanatory power to select ideal supplier partners. Meanwhile, in practice, the suppliers' samples are very insufficient. SVMs are adaptive to deal with small samples' training and testing. The prediction accuracies for BPNN and SVM methods are compared to choose the appreciating suppliers. The actual examples illustrate that SVM methods are superior to BPNN.展开更多
Since the efficiency of photovoltaic(PV) power is closely related to the weather,many PV enterprises install weather instruments to monitor the working state of the PV power system.With the development of the soft mea...Since the efficiency of photovoltaic(PV) power is closely related to the weather,many PV enterprises install weather instruments to monitor the working state of the PV power system.With the development of the soft measurement technology,the instrumental method seems obsolete and involves high cost.This paper proposes a novel method for predicting the types of weather based on the PV power data and partial meteorological data.By this method,the weather types are deduced by data analysis,instead of weather instrument A better fault detection is obtained by using the support vector machines(SVM) and comparing the predicted and the actual weather.The model of the weather prediction is established by a direct SVM for training multiclass predictors.Although SVM is suitable for classification,the classified results depend on the type of the kernel,the parameters of the kernel,and the soft margin coefficient,which are difficult to choose.In this paper,these parameters are optimized by particle swarm optimization(PSO) algorithm in anticipation of good prediction results can be achieved.Prediction results show that this method is feasible and effective.展开更多
Condition monitoring is a very important aspect in automated manufacturing processes. Any malfunction of a machining process will deteriorate production quality and efficiency. This paper presents an application of su...Condition monitoring is a very important aspect in automated manufacturing processes. Any malfunction of a machining process will deteriorate production quality and efficiency. This paper presents an application of support vector machines in grinding process monitoring. The paper starts with an overview of grinding behaviour. Grinding force is analysed through a Short Time Fourier Transform (STFT) to identify features for condition monitoring. The Support Vector Machine (SVM) methodology is introduced as a powerful tool for the classification of different wheel wear situations. After training with available signal data, the SVM is able to identify the state of a grinding process. The requirement and strategy for using SVM for grinding process monitoring is discussed, while the result of the example illustrates how effective SVMs can be in determining wheel redress-life.展开更多
Support Vector Machines(SVM) is a powerful machine learning method developed from statistical learning theory and is currently an active field in artificial intelligent technology. SVM is sensitive to noise vectors ne...Support Vector Machines(SVM) is a powerful machine learning method developed from statistical learning theory and is currently an active field in artificial intelligent technology. SVM is sensitive to noise vectors near hyperplane since it is determined only by few support vectors. In this paper, Multi SVM decision model(MSDM) was proposed. MSDM consists of multiple SVMs and makes decision by synthetic information based on multi SVMs. MSDM is applied to heart disease diagnoses based on UCI benchmark data set. MSDM somewhat inproves the robust of decision system.展开更多
A prediction control algorithm is presented based on least squares support vector machines (LS-SVM) model for a class of complex systems with strong nonlinearity. The nonlinear off-line model of the controlled plant i...A prediction control algorithm is presented based on least squares support vector machines (LS-SVM) model for a class of complex systems with strong nonlinearity. The nonlinear off-line model of the controlled plant is built by LS-SVM with radial basis function (RBF) kernel. In the process of system running, the off-line model is linearized at each sampling instant, and the generalized prediction control (GPC) algorithm is employed to implement the prediction control for the controlled plant. The obtained algorithm is applied to a boiler temperature control system with complicated nonlinearity and large time delay. The results of the experiment verify the effectiveness and merit of the algorithm.展开更多
This paper presents a new algorithm for Support Vector Machine (SVM) training, which trains a machine based on the cluster centers of errors caused by the current machine. Experiments with various training sets show t...This paper presents a new algorithm for Support Vector Machine (SVM) training, which trains a machine based on the cluster centers of errors caused by the current machine. Experiments with various training sets show that the computation time of this new algorithm scales almost linear with training set size and thus may be applied to much larger training sets, in comparison to standard quadratic programming (QP) techniques.展开更多
Harmonic analysis, the traditional tidal forecasting method, cannot take into account the impact of noncyclical factors, and is also based on the BP neural network tidal prediction model which is easily limited by the...Harmonic analysis, the traditional tidal forecasting method, cannot take into account the impact of noncyclical factors, and is also based on the BP neural network tidal prediction model which is easily limited by the amount of data. According to the movement of celestial bodies, and considering the insufficient tidal characteristics of historical data which are impacted by the nonperiodic weather, a tidal prediction method is designed based on support vector machine (SVM) to carry out the simulation experiment by using tidal data from Xiamen Tide Gauge, Luchaogang Tide Gauge and Weifang Tide Gauge individually. And the results show that the model satisfactorily carries out the tide prediction which is influenced by noncyclical factors. At the same time, it also proves that the proposed prediction method, which when compared with harmonic analysis method and the BP neural network method, has faster modeling speed, higher prediction precision and stronger generalization ability.展开更多
This paper presents a novel method for radar emitter signal recognition. First, wavelet packet transform (WPT) is introduced to extract features from radar emitter signals. Then, rough set theory is used to select t...This paper presents a novel method for radar emitter signal recognition. First, wavelet packet transform (WPT) is introduced to extract features from radar emitter signals. Then, rough set theory is used to select the optimal feature subset with good discriminability from original feature set, and support vector machines (SVMs) are employed to design classifiers. A large number of experimental results show that the proposed method achieves very high recognition rates for 9 radar emitter signals in a wide range of signal-to-noise rates, and proves a feasible and valid method.展开更多
基金supported in part by National Natural Science Foundation of China(Nos.62102311,62202377,62272385)in part by Natural Science Basic Research Program of Shaanxi(Nos.2022JQ-600,2022JM-353,2023-JC-QN-0327)+2 种基金in part by Shaanxi Distinguished Youth Project(No.2022JC-47)in part by Scientific Research Program Funded by Shaanxi Provincial Education Department(No.22JK0560)in part by Distinguished Youth Talents of Shaanxi Universities,and in part by Youth Innovation Team of Shaanxi Universities.
文摘With the widespread data collection and processing,privacy-preserving machine learning has become increasingly important in addressing privacy risks related to individuals.Support vector machine(SVM)is one of the most elementary learning models of machine learning.Privacy issues surrounding SVM classifier training have attracted increasing attention.In this paper,we investigate Differential Privacy-compliant Federated Machine Learning with Dimensionality Reduction,called FedDPDR-DPML,which greatly improves data utility while providing strong privacy guarantees.Considering in distributed learning scenarios,multiple participants usually hold unbalanced or small amounts of data.Therefore,FedDPDR-DPML enables multiple participants to collaboratively learn a global model based on weighted model averaging and knowledge aggregation and then the server distributes the global model to each participant to improve local data utility.Aiming at high-dimensional data,we adopt differential privacy in both the principal component analysis(PCA)-based dimensionality reduction phase and SVM classifiers training phase,which improves model accuracy while achieving strict differential privacy protection.Besides,we train Differential privacy(DP)-compliant SVM classifiers by adding noise to the objective function itself,thus leading to better data utility.Extensive experiments on three high-dimensional datasets demonstrate that FedDPDR-DPML can achieve high accuracy while ensuring strong privacy protection.
文摘BACKGROUND Research has found that the amygdala plays a significant role in underlying pathology of major depressive disorder(MDD).However,few studies have explored machine learning-assisted diagnostic biomarkers based on amygdala functional connectivity(FC).AIM To investigate the analysis of neuroimaging biomarkers as a streamlined approach for the diagnosis of MDD in adolescents.METHODS Forty-four adolescents diagnosed with MDD and 43 healthy controls were enrolled in the study.Using resting-state functional magnetic resonance imaging,the FC was compared between the adolescents with MDD and the healthy controls,with the bilateral amygdala serving as the seed point,followed by statistical analysis of the results.The support vector machine(SVM)method was then applied to classify functional connections in various brain regions and to evaluate the neurophysiological characteristics associated with MDD.RESULTS Compared to the controls and using the bilateral amygdala as the region of interest,patients with MDD showed significantly lower FC values in the left inferior temporal gyrus,bilateral calcarine,right lingual gyrus,and left superior occipital gyrus.However,there was an increase in the FC value in Vermis-10.The SVM analysis revealed that the reduction in the FC value in the right lingual gyrus could effectively differentiate patients with MDD from healthy controls,achieving a diagnostic accuracy of 83.91%,sensitivity of 79.55%,specificity of 88.37%,and an area under the curve of 67.65%.CONCLUSION The results showed that an abnormal FC value in the right lingual gyrus was effective as a neuroimaging biomarker to distinguish patients with MDD from healthy controls.
基金Foundation item:Project (2006BAB02A02) supported by the National Key Technology R&D Program during the 11th Five-year Plan Period of ChinaProject (CX2011B119) supported by the Graduated Students' Research and Innovation Fund of Hunan Province, ChinaProject (2009ssxt230) supported by the Central South University Innovation Fund,China
文摘Aiming at the problems of the traditional method of assessing distribution of particle size in bench blasting, a support vector machines (SVMs) regression methodology was used to predict the mean particle size (X50) resulting from rock blast fragmentation in various mines based on the statistical learning theory. The data base consisted of blast design parameters, explosive parameters, modulus of elasticity and in-situ block size. The seven input independent variables used for the SVMs model for the prediction of X50 of rock blast fragmentation were the ratio of bench height to drilled burden (H/B), ratio of spacing to burden (S/B), ratio of burden to hole diameter (B/D), ratio of stemming to burden (T/B), powder factor (Pf), modulus of elasticity (E) and in-situ block size (XB). After using the 90 sets of the measured data in various mines and rock formations in the world for training and testing, the model was applied to 12 another blast data for validation of the trained support vector regression (SVR) model. The prediction results of SVR were compared with those of artificial neural network (ANN), multivariate regression analysis (MVRA) models, conventional Kuznetsov method and the measured X50 values. The proposed method shows promising results and the prediction accuracy of SVMs model is acceptable.
文摘Support vector machines (SVMs) are utilized for emotion recognition in Chinese speech in this paper. Both binary class discrimination and the multi class discrimination are discussed. It proves that the emotional features construct a nonlinear problem in the input space, and SVMs based on nonlinear mapping can solve it more effectively than other linear methods. Multi class classification based on SVMs with a soft decision function is constructed to classify the four emotion situations. Compared with principal component analysis (PCA) method and modified PCA method, SVMs perform the best result in multi class discrimination by using nonlinear kernel mapping.
文摘Support vector machines (SVMs) are combined with the artificial immune network (aiNet), thus forming a new hybrid ai-SVM algorithm. The algorithm is used to reduce the number of samples and the training time of SVM on large datasets, aiNet is an artificial immune system (AIS) inspired method to perform the automatic data compression, extract the relevant information and retain the topology of the original sample distribution. The output of aiNet is a set of antibodies for representing the input dataset in a simplified way. Then the SVM model is built in the compressed antibody network instead of the original input data. Experimental results show that the ai-SVM algorithm is effective to reduce the computing time and simplify the SVM model, and the accuracy is not decreased.
基金supported by the Second Tibetan Plateau Scientific Expedition and Research Program(Grant no.2019QZKK0904)Natural Science Foundation of Hebei Province(Grant no.D2022403032)S&T Program of Hebei(Grant no.E2021403001).
文摘The selection of important factors in machine learning-based susceptibility assessments is crucial to obtain reliable susceptibility results.In this study,metaheuristic optimization and feature selection techniques were applied to identify the most important input parameters for mapping debris flow susceptibility in the southern mountain area of Chengde City in Hebei Province,China,by using machine learning algorithms.In total,133 historical debris flow records and 16 related factors were selected.The support vector machine(SVM)was first used as the base classifier,and then a hybrid model was introduced by a two-step process.First,the particle swarm optimization(PSO)algorithm was employed to select the SVM model hyperparameters.Second,two feature selection algorithms,namely principal component analysis(PCA)and PSO,were integrated into the PSO-based SVM model,which generated the PCA-PSO-SVM and FS-PSO-SVM models,respectively.Three statistical metrics(accuracy,recall,and specificity)and the area under the receiver operating characteristic curve(AUC)were employed to evaluate and validate the performance of the models.The results indicated that the feature selection-based models exhibited the best performance,followed by the PSO-based SVM and SVM models.Moreover,the performance of the FS-PSO-SVM model was better than that of the PCA-PSO-SVM model,showing the highest AUC,accuracy,recall,and specificity values in both the training and testing processes.It was found that the selection of optimal features is crucial to improving the reliability of debris flow susceptibility assessment results.Moreover,the PSO algorithm was found to be not only an effective tool for hyperparameter optimization,but also a useful feature selection algorithm to improve prediction accuracies of debris flow susceptibility by using machine learning algorithms.The high and very high debris flow susceptibility zone appropriately covers 38.01%of the study area,where debris flow may occur under intensive human activities and heavy rainfall events.
基金Project supported by the National Basic Research Program (973) of China (No. 2002CB312200) and the Center for Bioinformatics Pro-gram Grant of Harvard Center of Neurodegeneration and Repair,Harvard Medical School, Harvard University, Boston, USA
文摘In microarray-based cancer classification, gene selection is an important issue owing to the large number of variables and small number of samples as well as its non-linearity. It is difficult to get satisfying results by using conventional linear sta- tistical methods. Recursive feature elimination based on support vector machine (SVM RFE) is an effective algorithm for gene selection and cancer classification, which are integrated into a consistent framework. In this paper, we propose a new method to select parameters of the aforementioned algorithm implemented with Gaussian kernel SVMs as better alternatives to the common practice of selecting the apparently best parameters by using a genetic algorithm to search for a couple of optimal parameter. Fast implementation issues for this method are also discussed for pragmatic reasons. The proposed method was tested on two repre- sentative hereditary breast cancer and acute leukaemia datasets. The experimental results indicate that the proposed method per- forms well in selecting genes and achieves high classification accuracies with these genes.
文摘SVMs(support vector machines) is a new artificial intelligence methodology derived from Vapnik's statistical learning theory, which has better generalization than artificial neural network. A Csupport vector classifiers Based Fault Diagnostic Model (CBFDM) which gives the 3 most possible fault causes is constructed in this paper. Five fold cross validation is chosen as the method of model selection for CBFDM. The simulated data are generated from PW4000-94 engine influence coefficient matrix at cruise, and the results show that the diagnostic accuracy of CBFDM is over 93 % even when the standard deviation of noise is 3 times larger than the normal. This model can also be used for other diagnostic problems.
基金supported by University of Macao Research Grant,China (Grant No. RG057/08-09S/VCM/FST, Grant No. UL011/09-Y1/ EME/ WPK01/FST)
文摘Engine spark ignition is an important source for diagnosis of engine faults.Based on the waveform of the ignition pattern,a mechanic can guess what may be the potential malfunctioning parts of an engine with his/her experience and handbooks.However,this manual diagnostic method is imprecise because many spark ignition patterns are very similar.Therefore,a diagnosis needs many trials to identify the malfunctioning parts.Meanwhile the mechanic needs to disassemble and assemble the engine parts for verification.To tackle this problem,an intelligent diagnosis system was established based on ignition patterns.First,the captured patterns were normalized and compressed.Then wavelet packet transform(WPT) was employed to extract the representative features of the ignition patterns.Finally,a classification system was constructed by using multi-class support vector machines(SVM) and the extracted features.The classification system can intelligently classify the most likely engine fault so as to reduce the number of diagnosis trials.Experimental results show that SVM produces higher diagnosis accuracy than the traditional multilayer feedforward neural network.This is the first trial on the combination of WPT and SVM to analyze ignition patterns and diagnose automotive engines.
基金This project is supported by 95 Pandeng Preselect Project (No.PD9521908)and 973 Project(No.G199802320).
文摘Gears alternately mesh and detach in driving process, and then workingconditions of gears are alternately changing, so they are easy to be spalled and worn. But becauseof the effect of additive gaussian measurement noises, the signal-to-noises ratio is low; theirfault features are difficult to extract. This study aims to propose an approach of gear faultsclassification, using the cumulants and support vector machines. The cumulants can eliminate theadditive gaussian noises, boost the signal-to-noises ratio. Generalisation of support vectormachines as classifier, which is employed structural risk minimisation principle, is superior tothat of conventional neural networks, which is employed traditional empirical risk minimisationprinciple. Support vector machines as the classifier, and the third and fourth order cumulants asinput, gears faults are successfully recognized. The experimental results show that the method offault classification combining cumulants with support vector machines is very effective.
基金Item Sponsored by National Natural Science Foundation of China (60374003)
文摘The endpoint parameters are very important to the process of EAF steel-making, but their on-line measurement is difficult. The soft sensor technology is widely used for the prediction of endpoint parameters. Based on the analysis of the smelting process of EAF and the advantages of support vector machines, a soft sensor model for predicting the endpoint parameters was built using multiple support vector machines (MSVM). In this model, the input space was divided by subtractive clustering and a sub-model based on LS-SVM was built in each sub-space. To decrease the correlation among the sub-models and to improve the accuracy and robustness of the model, the sub- models were combined by Principal Components Regression. The accuracy of the soft sensor model is perfectly improved. The simulation result demonstrates the practicability and efficiency of the MSVM model for the endpoint prediction of EAF.
基金financially supported by the National Natural Science Foundation of China(Grant No.51279106)the Special Research Fund for the Doctoral Program of Higher Education of China(Grant No.20110073110009)
文摘Based on support vector machines, three modeling methods, i.e., white-box modeling, grey-box modeling and black-box modeling of ship manoeuvring motion in 4 degrees of freedom are investigated. With the whole-ship mathematical model for ship manoeuvring motion, in which the hydrodynamic coefficients are obtained from roll planar motion mechanism test, some zigzag tests and turning circle manoeuvres are simulated. In the white-box modeling and grey-box modeling, the training data taken every 5 s from the simulated 20°/20° zigzag test are used, while in the black-box modeling, the training data taken every 5 s from the simulated 15°/15°, 20°/20° zigzag tests and 15°, 25° turning manoeuvres are used; and the trained support vector machines are used to predict the whole 20°/20° zigzag test. Comparisons between the simulated and predicted 20°/20° zigzag tests show good predictive ability of the proposed methods. Besides, all mathematical models obtained by the proposed modeling methods are used to predict the 10°/10° zigzag test and 35° turning circle manoeuvre, and the predicted results are compared with those of simulation tests to demonstrate the good generalization performance of the mathematical models. Finally, the proposed modeling methods are analyzed and compared with each other in aspects of application conditions, prediction accuracy and computation speed. The appropriate modeling method can be chosen according to the intended use of the mathematical models and the available data needed for system identification.
文摘Suppliers' selection in supply chain management (SCM) has attracted considerable research interests in recent years. Recent literatures show that neural networks achieve better performance than traditional statistical methods. However, neural networks have inherent drawbacks, such as local optimization solution, lack generalization, and uncontrolled convergence. A relatively new machine learning technique, support vector machine (SVM), which overcomes the drawbacks of neural networks, is introduced to provide a model with better explanatory power to select ideal supplier partners. Meanwhile, in practice, the suppliers' samples are very insufficient. SVMs are adaptive to deal with small samples' training and testing. The prediction accuracies for BPNN and SVM methods are compared to choose the appreciating suppliers. The actual examples illustrate that SVM methods are superior to BPNN.
基金supported by the National Natural Science Foundation of China(61433004,61473069)IAPI Fundamental Research Funds(2013ZCX14)+1 种基金supported by the Development Project of Key Laboratory of Liaoning Provincethe Enterprise Postdoctoral Fund Projects of Liaoning Province
文摘Since the efficiency of photovoltaic(PV) power is closely related to the weather,many PV enterprises install weather instruments to monitor the working state of the PV power system.With the development of the soft measurement technology,the instrumental method seems obsolete and involves high cost.This paper proposes a novel method for predicting the types of weather based on the PV power data and partial meteorological data.By this method,the weather types are deduced by data analysis,instead of weather instrument A better fault detection is obtained by using the support vector machines(SVM) and comparing the predicted and the actual weather.The model of the weather prediction is established by a direct SVM for training multiclass predictors.Although SVM is suitable for classification,the classified results depend on the type of the kernel,the parameters of the kernel,and the soft margin coefficient,which are difficult to choose.In this paper,these parameters are optimized by particle swarm optimization(PSO) algorithm in anticipation of good prediction results can be achieved.Prediction results show that this method is feasible and effective.
文摘Condition monitoring is a very important aspect in automated manufacturing processes. Any malfunction of a machining process will deteriorate production quality and efficiency. This paper presents an application of support vector machines in grinding process monitoring. The paper starts with an overview of grinding behaviour. Grinding force is analysed through a Short Time Fourier Transform (STFT) to identify features for condition monitoring. The Support Vector Machine (SVM) methodology is introduced as a powerful tool for the classification of different wheel wear situations. After training with available signal data, the SVM is able to identify the state of a grinding process. The requirement and strategy for using SVM for grinding process monitoring is discussed, while the result of the example illustrates how effective SVMs can be in determining wheel redress-life.
基金Special Funds for Major State Basic Research of China (Project 973 ,G19980 3 0 415 )
文摘Support Vector Machines(SVM) is a powerful machine learning method developed from statistical learning theory and is currently an active field in artificial intelligent technology. SVM is sensitive to noise vectors near hyperplane since it is determined only by few support vectors. In this paper, Multi SVM decision model(MSDM) was proposed. MSDM consists of multiple SVMs and makes decision by synthetic information based on multi SVMs. MSDM is applied to heart disease diagnoses based on UCI benchmark data set. MSDM somewhat inproves the robust of decision system.
基金This work has been supported by the National Outstanding Youth Science Foundation of China (No. 60025308) and the Teach and Research Award Program for Outstanding Young Teachers in Higher Education Institutions of MOE,China.
文摘A prediction control algorithm is presented based on least squares support vector machines (LS-SVM) model for a class of complex systems with strong nonlinearity. The nonlinear off-line model of the controlled plant is built by LS-SVM with radial basis function (RBF) kernel. In the process of system running, the off-line model is linearized at each sampling instant, and the generalized prediction control (GPC) algorithm is employed to implement the prediction control for the controlled plant. The obtained algorithm is applied to a boiler temperature control system with complicated nonlinearity and large time delay. The results of the experiment verify the effectiveness and merit of the algorithm.
文摘This paper presents a new algorithm for Support Vector Machine (SVM) training, which trains a machine based on the cluster centers of errors caused by the current machine. Experiments with various training sets show that the computation time of this new algorithm scales almost linear with training set size and thus may be applied to much larger training sets, in comparison to standard quadratic programming (QP) techniques.
基金The Shanghai Committee of Science and Technology of China under contract No. 10510502800the Graduate Student Education Innovation Program Foundation of Shanghai Municipal Education Commission of Chinathe National Key Science Foundation Research "973" Project of the Ministry of Science and Technology of China under contract No. 2012CB316200
文摘Harmonic analysis, the traditional tidal forecasting method, cannot take into account the impact of noncyclical factors, and is also based on the BP neural network tidal prediction model which is easily limited by the amount of data. According to the movement of celestial bodies, and considering the insufficient tidal characteristics of historical data which are impacted by the nonperiodic weather, a tidal prediction method is designed based on support vector machine (SVM) to carry out the simulation experiment by using tidal data from Xiamen Tide Gauge, Luchaogang Tide Gauge and Weifang Tide Gauge individually. And the results show that the model satisfactorily carries out the tide prediction which is influenced by noncyclical factors. At the same time, it also proves that the proposed prediction method, which when compared with harmonic analysis method and the BP neural network method, has faster modeling speed, higher prediction precision and stronger generalization ability.
文摘This paper presents a novel method for radar emitter signal recognition. First, wavelet packet transform (WPT) is introduced to extract features from radar emitter signals. Then, rough set theory is used to select the optimal feature subset with good discriminability from original feature set, and support vector machines (SVMs) are employed to design classifiers. A large number of experimental results show that the proposed method achieves very high recognition rates for 9 radar emitter signals in a wide range of signal-to-noise rates, and proves a feasible and valid method.