The distribution of data has a significant impact on the results of classification.When the distribution of one class is insignificant compared to the distribution of another class,data imbalance occurs.This will resu...The distribution of data has a significant impact on the results of classification.When the distribution of one class is insignificant compared to the distribution of another class,data imbalance occurs.This will result in rising outlier values and noise.Therefore,the speed and performance of classification could be greatly affected.Given the above problems,this paper starts with the motivation and mathematical representing of classification,puts forward a new classification method based on the relationship between different classification formulations.Combined with the vector characteristics of the actual problem and the choice of matrix characteristics,we firstly analyze the orderly regression to introduce slack variables to solve the constraint problem of the lone point.Then we introduce the fuzzy factors to solve the problem of the gap between the isolated points on the basis of the support vector machine.We introduce the cost control to solve the problem of sample skew.Finally,based on the bi-boundary support vector machine,a twostep weight setting twin classifier is constructed.This can help to identify multitasks with feature-selected patterns without the need for additional optimizers,which solves the problem of large-scale classification that can’t deal effectively with the very low category distribution gap.展开更多
In this paper we apply the nonlinear time series analysis method to small-time scale traffic measurement data. The prediction-based method is used to determine the embedding dimension of the traffic data. Based on the...In this paper we apply the nonlinear time series analysis method to small-time scale traffic measurement data. The prediction-based method is used to determine the embedding dimension of the traffic data. Based on the reconstructed phase space, the local support vector machine prediction method is used to predict the traffic measurement data, and the BIC-based neighbouring point selection method is used to choose the number of the nearest neighbouring points for the local support vector machine regression model. The experimental results show that the local support vector machine prediction method whose neighbouring points are optimized can effectively predict the small-time scale traffic measurement data and can reproduce the statistical features of real traffic measurements.展开更多
Path loss prediction models are vital for accurate signal propagation in wireless channels. Empirical and deterministic models used in path loss predictions have not produced optimal results. In this paper, we introdu...Path loss prediction models are vital for accurate signal propagation in wireless channels. Empirical and deterministic models used in path loss predictions have not produced optimal results. In this paper, we introduced machine learning algorithms to path loss predictions because it offers a flexible network architecture and extensive data can be used. We introduced support vector regression (SVR) and radial basis function (RBF) models to path loss predictions in the investigated environments. The SVR model was able to process several input parameters without introducing complexity to the network architecture. The RBF on its part provides a good function approximation. Hyperparameter tuning of the machine learning models was carried out in order to achieve optimal results. The performances of the SVR and RBF models were compared and result validated using the root-mean squared error (RMSE). The two machine learning algorithms were also compared with the Cost-231, SUI, Egli, Freespace, Cost-231 W-I models. The analytical models overpredicted path loss. Overall, the machine learning models predicted path loss with greater accuracy than the empirical models. The SVR model performed best across all the indices with RMSE values of 1.378 dB, 1.4523 dB, 2.1568 dB in rural, suburban and urban settings respectively and should therefore be adopted for signal propagation in the investigated environments and beyond.展开更多
The computational approaches of support vector machine (SVM), support vector regression (SVR) and molecular docking were widely utilized for the computation of active compounds. In this work, to improve the accura...The computational approaches of support vector machine (SVM), support vector regression (SVR) and molecular docking were widely utilized for the computation of active compounds. In this work, to improve the accuracy and reliability of prediction, the strategy of combining the above three computational approaches was applied to predict potential cytochrome P450 1A2 (CYP1A2) inhibitors. The accuracy of the optimal SVM qualitative model was 99.432%, 97.727%, and 91.667% for training set, internal test set and external test set, respectively, showing this model had high discrimination ability. The R2 and mean square error for the optimal SVR quantitative model were 0.763, 0.013 for training set, and 0.753, 0.056 for test set respectively, indicating that this SVR model has high predictive ability for the biolog-ical activities of compounds. According to the results of the SVM and SVR models, some types of descriptors were identi ed to be essential to bioactivity prediction of compounds, including the connectivity indices, constitutional descriptors and functional group counts. Moreover, molecular docking studies were used to reveal the binding poses and binding a n-ity of potential inhibitors interacting with CYP1A2. Wherein, the amino acids of THR124 and ASP320 could form key hydrogen bond interactions with active compounds. And the amino acids of ALA317 and GLY316 could form strong hydrophobic bond interactions with active compounds. The models obtained above were applied to discover potential CYP1A2 inhibitors from natural products, which could predict the CYPs-mediated drug-drug inter-actions and provide useful guidance and reference for rational drug combination therapy. A set of 20 potential CYP1A2 inhibitors were obtained. Part of the results was consistent with references, which further indicates the accuracy of these models and the reliability of this combinatorial computation strategy.展开更多
Machine learning is currently one of the research hotspots in the field of landslide prediction.To clarify and evaluate the differences in characteristics and prediction effects of different machine learning models,Co...Machine learning is currently one of the research hotspots in the field of landslide prediction.To clarify and evaluate the differences in characteristics and prediction effects of different machine learning models,Conghua District,which is the most prone to landslide disasters in Guangzhou,was selected for landslide susceptibility evaluation.The evaluation factors were selected by using correlation analysis and variance expansion factor method.Applying four machine learning methods namely Logistic Regression(LR),Random Forest(RF),Support Vector Machines(SVM),and Extreme Gradient Boosting(XGB),landslide models were constructed.Comparative analysis and evaluation of the model were conducted through statistical indices and receiver operating characteristic(ROC)curves.The results showed that LR,RF,SVM,and XGB models have good predictive performance for landslide susceptibility,with the area under curve(AUC)values of 0.752,0.965,0.996,and 0.998,respectively.XGB model had the highest predictive ability,followed by RF model,SVM model,and LR model.The frequency ratio(FR)accuracy of LR,RF,SVM,and XGB models was 0.775,0.842,0.759,and 0.822,respectively.RF and XGB models were superior to LR and SVM models,indicating that the integrated algorithm has better predictive ability than a single classification algorithm in regional landslide classification problems.展开更多
As one of the most important part of weapon system of systems(WSoS),quantitative evaluation of reconnaissance satellite system(RSS)is indispensable during its construction and application.Aiming at the problem of nonl...As one of the most important part of weapon system of systems(WSoS),quantitative evaluation of reconnaissance satellite system(RSS)is indispensable during its construction and application.Aiming at the problem of nonlinear effectiveness evaluation under small sample conditions,we propose an evaluation method based on support vector regression(SVR)to effectively address the defects of traditional methods.Considering the performance of SVR is influenced by the penalty factor,kernel type,and other parameters deeply,the improved grey wolf optimizer(IGWO)is employed for parameter optimization.In the proposed IGWO algorithm,the opposition-based learning strategy is adopted to increase the probability of avoiding the local optima,the mutation operator is used to escape from premature convergence and differential convergence factors are applied to increase the rate of convergence.Numerical experiments of 14 test functions validate the applicability of IGWO algorithm dealing with global optimization.The index system and evaluation method are constructed based on the characteristics of RSS.To validate the proposed IGWO-SVR evaluation method,eight benchmark data sets and combat simulation are employed to estimate the evaluation accuracy,convergence performance and computational complexity.According to the experimental results,the proposed method outperforms several prediction based evaluation methods,verifies the superiority and effectiveness in RSS operational effectiveness evaluation.展开更多
Prostate cancer(PCa)symptoms are commonly confused with benign prostate hyperplasia(BPH),particularly in the early stages due to similarities between symptoms,and in some instances,underdiagnoses.Clinical methods have...Prostate cancer(PCa)symptoms are commonly confused with benign prostate hyperplasia(BPH),particularly in the early stages due to similarities between symptoms,and in some instances,underdiagnoses.Clinical methods have been utilized to diagnose PCa;however,at the full-blown stage,clinical methods usually present high risks of complicated side effects.Therefore,we proposed the use of support vector machine for early differential diagnosis of PCa(SVM-PCa-EDD).SVM was used to classify persons with and without PCa.We used the PCa dataset from the Kaggle Healthcare repository to develop and validate SVM model for classification.The PCa dataset consisted of 250 features and one class of features.Attributes considered in this study were age,body mass index(BMI),race,family history,obesity,trouble urinating,urine stream force,blood in semen,bone pain,and erectile dysfunction.The SVM-PCa-EDD was used for preprocessing the PCa dataset,specifically dealing with class imbalance,and for dimensionality reduction.After eliminating class imbalance,the area under the receiver operating characteristic(ROC)curve(AUC)of the logistic regression(LR)model trained with the downsampled dataset was 58.4%,whereas that of the AUC-ROC of LR trained with the class imbalance dataset was 54.3%.The SVM-PCa-EDD achieved 90%accuracy,80%sensitivity,and 80%specificity.The validation of SVM-PCa-EDD using random forest and LR showed that SVM-PCa-EDD performed better in early differential diagnosis of PCa.The proposed model can assist medical experts in early diagnosis of PCa,particularly in resource-constrained healthcare settings and making further recommendations for PCa testing and treatment.展开更多
In this study,we developed multiple hybrid machine-learning models to address parameter optimization limitations and enhance the spatial prediction of landslide susceptibility models.We created a geographic informatio...In this study,we developed multiple hybrid machine-learning models to address parameter optimization limitations and enhance the spatial prediction of landslide susceptibility models.We created a geographic information system database,and our analysis results were used to prepare a landslide inventory map containing 359 landslide events identified from Google Earth,aerial photographs,and other validated sources.A support vector regression(SVR)machine-learning model was used to divide the landslide inventory into training(70%)and testing(30%)datasets.The landslide susceptibility map was produced using 14 causative factors.We applied the established gray wolf optimization(GWO)algorithm,bat algorithm(BA),and cuckoo optimization algorithm(COA)to fine-tune the parameters of the SVR model to improve its predictive accuracy.The resultant hybrid models,SVR-GWO,SVR-BA,and SVR-COA,were validated in terms of the area under curve(AUC)and root mean square error(RMSE).The AUC values for the SVR-GWO(0.733),SVR-BA(0.724),and SVR-COA(0.738)models indicate their good prediction rates for landslide susceptibility modeling.SVR-COA had the greatest accuracy,with an RMSE of 0.21687,and SVR-BA had the least accuracy,with an RMSE of 0.23046.The three optimized hybrid models outperformed the SVR model(AUC=0.704,RMSE=0.26689),confirming the ability of metaheuristic algorithms to improve model performance.展开更多
Multi-kernel-based support vector machine (SVM) model structure of nonlinear systems and its specific identification method is proposed, which is composed of a SVM with linear kernel function followed in series by a...Multi-kernel-based support vector machine (SVM) model structure of nonlinear systems and its specific identification method is proposed, which is composed of a SVM with linear kernel function followed in series by a SVM with spline kernel function. With the help of this model, nonlinear model predictive control can be transformed to linear model predictive control, and consequently a unified analytical solution of optimal input of multi-step-ahead predictive control is possible to derive. This algorithm does not require online iterative optimization in order to be suitable for real-time control with less calculation. The simulation results of pH neutralization process and CSTR reactor show the effectiveness and advantages of the presented algorithm.展开更多
A support vector machine with guadratic polynomial kernel function based nonlinear model multi-step-ahead optimizing predictive controller was presented. A support vector machine based predictive model was established...A support vector machine with guadratic polynomial kernel function based nonlinear model multi-step-ahead optimizing predictive controller was presented. A support vector machine based predictive model was established by black-box identification. And a quadratic objective function with receding horizon was selected to obtain the controller output. By solving a nonlinear optimization problem with equality constraint of model output and boundary constraint of controller output using Nelder-Mead simplex direct search method, a sub-optimal control law was achieved in feature space. The effect of the controller was demonstrated on a recognized benchmark problem and a continuous-stirred tank reactor. The simulation results show that the multi-step-ahead predictive controller can be well applied to nonlinear system, with better performance in following reference trajectory and disturbance-rejection.展开更多
Soft sensor is widely used in industrial process control. It plays animportant role to improve the quality of product and assure safety in production. The core of softsensor is to construct soft sensing model. A new s...Soft sensor is widely used in industrial process control. It plays animportant role to improve the quality of product and assure safety in production. The core of softsensor is to construct soft sensing model. A new soft sensing modeling method based on supportvector machine (SVM) is proposed. SVM is a new machine learning method based on statistical learningtheory and is powerful for the problem characterized by small sample, nonlinearity, high dimensionand local minima. The proposed methods are applied to the estimation of frozen point of light dieseloil in distillation column. The estimated outputs of soft sensing model based on SVM match the realvalues of frozen point and follow varying trend of frozen point very well. Experiment results showthat SVM provides a new effective method for soft sensing modeling and has promising application inindustrial process applications.展开更多
In this paper, we present a novel Support Vector Machine active learning algorithm for effective 3D model retrieval using the concept of relevance feedback. The proposed method learns from the most informative objects...In this paper, we present a novel Support Vector Machine active learning algorithm for effective 3D model retrieval using the concept of relevance feedback. The proposed method learns from the most informative objects which are marked by the user, and then creates a boundary separating the relevant models from irrelevant ones. What it needs is only a small number of 3D models labelled by the user. It can grasp the user's semantic knowledge rapidly and accurately. Experimental results showed that the proposed algorithm significantly improves the retrieval effectiveness. Compared with four state-of-the-art query refinement schemes for 3D model retrieval, it provides superior retrieval performance after no more than two rounds of relevance feedback.展开更多
This paper proposed a semi-supervised regression model with co-training algorithm based on support vector machine, which was used for retrieving water quality variables from SPOT 5 remote sensing data. The model consi...This paper proposed a semi-supervised regression model with co-training algorithm based on support vector machine, which was used for retrieving water quality variables from SPOT 5 remote sensing data. The model consisted of two support vector regressors (SVRs). Nonlinear relationship between water quality variables and SPOT 5 spectrum was described by the two SVRs, and semi-supervised co-training algorithm for the SVRs was es-tablished. The model was used for retrieving concentrations of four representative pollution indicators―permangan- ate index (CODmn), ammonia nitrogen (NH3-N), chemical oxygen demand (COD) and dissolved oxygen (DO) of the Weihe River in Shaanxi Province, China. The spatial distribution map for those variables over a part of the Weihe River was also produced. SVR can be used to implement any nonlinear mapping readily, and semi-supervis- ed learning can make use of both labeled and unlabeled samples. By integrating the two SVRs and using semi-supervised learning, we provide an operational method when paired samples are limited. The results show that it is much better than the multiple statistical regression method, and can provide the whole water pollution condi-tions for management fast and can be extended to hyperspectral remote sensing applications.展开更多
Support vector machine(SVM) has shown great potential in pattern recognition and regressive estima-tion.Due to the industrial development demands,such as the fermentation process modeling,improving the training perfor...Support vector machine(SVM) has shown great potential in pattern recognition and regressive estima-tion.Due to the industrial development demands,such as the fermentation process modeling,improving the training performance on increasingly large sample sets is an important problem.However,solving a large optimization problem is computationally intensive and memory intensive.In this paper,a geometric interpretation of SVM re-gression(SVR) is derived,and μ-SVM is extended for both L1-norm and L2-norm penalty SVR.Further,Gilbert al-gorithm,a well-known geometric algorithm,is modified to solve SVR problems.Theoretical analysis indicates that the presented SVR training geometric algorithms have the same convergence and almost identical cost of computa-tion as their corresponding algorithms for SVM classification.Experimental results show that the geometric meth-ods are more efficient than conventional methods using quadratic programming and require much less memory.展开更多
Robustly stable multi-step-ahead model predictive control (MPC) based on parallel support vector machines (SVMs) with linear kernel was proposed. First, an analytical solution of optimal control laws of parallel SVMs ...Robustly stable multi-step-ahead model predictive control (MPC) based on parallel support vector machines (SVMs) with linear kernel was proposed. First, an analytical solution of optimal control laws of parallel SVMs based MPC was derived, and then the necessary and sufficient stability condition for MPC closed loop was given according to SVM model, and finally a method of judging the discrepancy between SVM model and the actual plant was presented, and consequently the constraint sets, which can guarantee that the stability condition is still robust for model/plant mismatch within some given bounds, were obtained by applying small-gain theorem. Simulation experiments show the proposed stability condition and robust constraint sets can provide a convenient way of adjusting controller parameters to ensure a closed-loop with larger stable margin.展开更多
A new multiple models(MM) approach was proposed to model complex industrial process by using Fuzzy Support Vector Machines(F -SVMs). By applying the proposed approach to a pH neutralization titration experiment, F -SV...A new multiple models(MM) approach was proposed to model complex industrial process by using Fuzzy Support Vector Machines(F -SVMs). By applying the proposed approach to a pH neutralization titration experiment, F -SVMs MM not only provides satisfactory approximation and generalization property, but also achieves superior performance to USOCPN multiple modeling method and single modeling method based on standard SVMs.展开更多
Support Vector-based learning methods are an important part of Computational Intelligence techniques. Recent efforts have been dealing with the problem of learning from very large datasets. This paper reviews the most...Support Vector-based learning methods are an important part of Computational Intelligence techniques. Recent efforts have been dealing with the problem of learning from very large datasets. This paper reviews the most commonly used formulations of support vector machines for regression (SVRs) aiming to emphasize its usability on large-scale applications. We review the general concept of support vector machines (SVMs), address the state-of-the-art on training methods SVMs, and explain the fundamental principle of SVRs. The most common learning methods for SVRs are introduced and linear programming-based SVR formulations are explained emphasizing its suitability for large-scale learning. Finally, this paper also discusses some open problems and current trends.展开更多
This paper proposes a design of internal model control systems for process with delay by using support vector regression(SVR).The proposed system fully uses the excellent nonlinear estimation performance of SVR with t...This paper proposes a design of internal model control systems for process with delay by using support vector regression(SVR).The proposed system fully uses the excellent nonlinear estimation performance of SVR with the structural risk minimization principle.Closed-system stability and steady error are analyzed for the existence of modeling errors.The simulations show that the proposed control systems have the better control performance than that by neural networks in the cases of the training samples with small size and noises.展开更多
Aiming at solving the problems of machine-learning in fault diagnosis, a diagnosis approach is proposed based on hidden Markov model (HMM) and support vector machine (SVM). HMM usually describes intra-class measur...Aiming at solving the problems of machine-learning in fault diagnosis, a diagnosis approach is proposed based on hidden Markov model (HMM) and support vector machine (SVM). HMM usually describes intra-class measure well and is good at dealing with continuous dynamic signals. SVM expresses inter-class difference effectively and has perfect classify ability. This approach is built on the merit of HMM and SVM. Then, the experiment is made in the transmission system of a helicopter. With the features extracted from vibration signals in gearbox, this HMM-SVM based diagnostic approach is trained and used to monitor and diagnose the gearbox's faults. The result shows that this method is better than HMM-based and SVM-based diagnosing methods in higher diagnostic accuracy with small training samples.展开更多
In this paper a new continuous variable called core-ratio is defined to describe the probability for a residue to be in a binding site, thereby replacing the previous binary description of the interface residue using ...In this paper a new continuous variable called core-ratio is defined to describe the probability for a residue to be in a binding site, thereby replacing the previous binary description of the interface residue using 0 and 1. So we can use the support vector machine regression method to fit the core-ratio value and predict the protein binding sites. We also design a new group of physical and chemical descriptors to characterize the binding sites. The new descriptors are more effective, with an averaging procedure used. Our test shows that much better prediction results can be obtained by the support vector regression (SVR) method than by the support vector classification method.展开更多
基金Hebei Province Key Research and Development Project(No.20313701D)Hebei Province Key Research and Development Project(No.19210404D)+13 种基金Mobile computing and universal equipment for the Beijing Key Laboratory Open Project,The National Social Science Fund of China(17AJL014)Beijing University of Posts and Telecommunications Construction of World-Class Disciplines and Characteristic Development Guidance Special Fund “Cultural Inheritance and Innovation”Project(No.505019221)National Natural Science Foundation of China(No.U1536112)National Natural Science Foundation of China(No.81673697)National Natural Science Foundation of China(61872046)The National Social Science Fund Key Project of China(No.17AJL014)“Blue Fire Project”(Huizhou)University of Technology Joint Innovation Project(CXZJHZ201729)Industry-University Cooperation Cooperative Education Project of the Ministry of Education(No.201902218004)Industry-University Cooperation Cooperative Education Project of the Ministry of Education(No.201902024006)Industry-University Cooperation Cooperative Education Project of the Ministry of Education(No.201901197007)Industry-University Cooperation Collaborative Education Project of the Ministry of Education(No.201901199005)The Ministry of Education Industry-University Cooperation Collaborative Education Project(No.201901197001)Shijiazhuang science and technology plan project(236240267A)Hebei Province key research and development plan project(20312701D)。
文摘The distribution of data has a significant impact on the results of classification.When the distribution of one class is insignificant compared to the distribution of another class,data imbalance occurs.This will result in rising outlier values and noise.Therefore,the speed and performance of classification could be greatly affected.Given the above problems,this paper starts with the motivation and mathematical representing of classification,puts forward a new classification method based on the relationship between different classification formulations.Combined with the vector characteristics of the actual problem and the choice of matrix characteristics,we firstly analyze the orderly regression to introduce slack variables to solve the constraint problem of the lone point.Then we introduce the fuzzy factors to solve the problem of the gap between the isolated points on the basis of the support vector machine.We introduce the cost control to solve the problem of sample skew.Finally,based on the bi-boundary support vector machine,a twostep weight setting twin classifier is constructed.This can help to identify multitasks with feature-selected patterns without the need for additional optimizers,which solves the problem of large-scale classification that can’t deal effectively with the very low category distribution gap.
基金Project supported by the National Natural Science Foundation of China (Grant No 60573065)the Natural Science Foundation of Shandong Province,China (Grant No Y2007G33)the Key Subject Research Foundation of Shandong Province,China(Grant No XTD0708)
文摘In this paper we apply the nonlinear time series analysis method to small-time scale traffic measurement data. The prediction-based method is used to determine the embedding dimension of the traffic data. Based on the reconstructed phase space, the local support vector machine prediction method is used to predict the traffic measurement data, and the BIC-based neighbouring point selection method is used to choose the number of the nearest neighbouring points for the local support vector machine regression model. The experimental results show that the local support vector machine prediction method whose neighbouring points are optimized can effectively predict the small-time scale traffic measurement data and can reproduce the statistical features of real traffic measurements.
文摘Path loss prediction models are vital for accurate signal propagation in wireless channels. Empirical and deterministic models used in path loss predictions have not produced optimal results. In this paper, we introduced machine learning algorithms to path loss predictions because it offers a flexible network architecture and extensive data can be used. We introduced support vector regression (SVR) and radial basis function (RBF) models to path loss predictions in the investigated environments. The SVR model was able to process several input parameters without introducing complexity to the network architecture. The RBF on its part provides a good function approximation. Hyperparameter tuning of the machine learning models was carried out in order to achieve optimal results. The performances of the SVR and RBF models were compared and result validated using the root-mean squared error (RMSE). The two machine learning algorithms were also compared with the Cost-231, SUI, Egli, Freespace, Cost-231 W-I models. The analytical models overpredicted path loss. Overall, the machine learning models predicted path loss with greater accuracy than the empirical models. The SVR model performed best across all the indices with RMSE values of 1.378 dB, 1.4523 dB, 2.1568 dB in rural, suburban and urban settings respectively and should therefore be adopted for signal propagation in the investigated environments and beyond.
文摘The computational approaches of support vector machine (SVM), support vector regression (SVR) and molecular docking were widely utilized for the computation of active compounds. In this work, to improve the accuracy and reliability of prediction, the strategy of combining the above three computational approaches was applied to predict potential cytochrome P450 1A2 (CYP1A2) inhibitors. The accuracy of the optimal SVM qualitative model was 99.432%, 97.727%, and 91.667% for training set, internal test set and external test set, respectively, showing this model had high discrimination ability. The R2 and mean square error for the optimal SVR quantitative model were 0.763, 0.013 for training set, and 0.753, 0.056 for test set respectively, indicating that this SVR model has high predictive ability for the biolog-ical activities of compounds. According to the results of the SVM and SVR models, some types of descriptors were identi ed to be essential to bioactivity prediction of compounds, including the connectivity indices, constitutional descriptors and functional group counts. Moreover, molecular docking studies were used to reveal the binding poses and binding a n-ity of potential inhibitors interacting with CYP1A2. Wherein, the amino acids of THR124 and ASP320 could form key hydrogen bond interactions with active compounds. And the amino acids of ALA317 and GLY316 could form strong hydrophobic bond interactions with active compounds. The models obtained above were applied to discover potential CYP1A2 inhibitors from natural products, which could predict the CYPs-mediated drug-drug inter-actions and provide useful guidance and reference for rational drug combination therapy. A set of 20 potential CYP1A2 inhibitors were obtained. Part of the results was consistent with references, which further indicates the accuracy of these models and the reliability of this combinatorial computation strategy.
基金supported by the projects of the China Geological Survey(DD20221729,DD20190291)Zhuhai Urban Geological Survey(including informatization)(MZCD–2201–008).
文摘Machine learning is currently one of the research hotspots in the field of landslide prediction.To clarify and evaluate the differences in characteristics and prediction effects of different machine learning models,Conghua District,which is the most prone to landslide disasters in Guangzhou,was selected for landslide susceptibility evaluation.The evaluation factors were selected by using correlation analysis and variance expansion factor method.Applying four machine learning methods namely Logistic Regression(LR),Random Forest(RF),Support Vector Machines(SVM),and Extreme Gradient Boosting(XGB),landslide models were constructed.Comparative analysis and evaluation of the model were conducted through statistical indices and receiver operating characteristic(ROC)curves.The results showed that LR,RF,SVM,and XGB models have good predictive performance for landslide susceptibility,with the area under curve(AUC)values of 0.752,0.965,0.996,and 0.998,respectively.XGB model had the highest predictive ability,followed by RF model,SVM model,and LR model.The frequency ratio(FR)accuracy of LR,RF,SVM,and XGB models was 0.775,0.842,0.759,and 0.822,respectively.RF and XGB models were superior to LR and SVM models,indicating that the integrated algorithm has better predictive ability than a single classification algorithm in regional landslide classification problems.
基金the National Defense Science and Technology Key Laboratory Fund of China(XM2020XT1023).
文摘As one of the most important part of weapon system of systems(WSoS),quantitative evaluation of reconnaissance satellite system(RSS)is indispensable during its construction and application.Aiming at the problem of nonlinear effectiveness evaluation under small sample conditions,we propose an evaluation method based on support vector regression(SVR)to effectively address the defects of traditional methods.Considering the performance of SVR is influenced by the penalty factor,kernel type,and other parameters deeply,the improved grey wolf optimizer(IGWO)is employed for parameter optimization.In the proposed IGWO algorithm,the opposition-based learning strategy is adopted to increase the probability of avoiding the local optima,the mutation operator is used to escape from premature convergence and differential convergence factors are applied to increase the rate of convergence.Numerical experiments of 14 test functions validate the applicability of IGWO algorithm dealing with global optimization.The index system and evaluation method are constructed based on the characteristics of RSS.To validate the proposed IGWO-SVR evaluation method,eight benchmark data sets and combat simulation are employed to estimate the evaluation accuracy,convergence performance and computational complexity.According to the experimental results,the proposed method outperforms several prediction based evaluation methods,verifies the superiority and effectiveness in RSS operational effectiveness evaluation.
文摘Prostate cancer(PCa)symptoms are commonly confused with benign prostate hyperplasia(BPH),particularly in the early stages due to similarities between symptoms,and in some instances,underdiagnoses.Clinical methods have been utilized to diagnose PCa;however,at the full-blown stage,clinical methods usually present high risks of complicated side effects.Therefore,we proposed the use of support vector machine for early differential diagnosis of PCa(SVM-PCa-EDD).SVM was used to classify persons with and without PCa.We used the PCa dataset from the Kaggle Healthcare repository to develop and validate SVM model for classification.The PCa dataset consisted of 250 features and one class of features.Attributes considered in this study were age,body mass index(BMI),race,family history,obesity,trouble urinating,urine stream force,blood in semen,bone pain,and erectile dysfunction.The SVM-PCa-EDD was used for preprocessing the PCa dataset,specifically dealing with class imbalance,and for dimensionality reduction.After eliminating class imbalance,the area under the receiver operating characteristic(ROC)curve(AUC)of the logistic regression(LR)model trained with the downsampled dataset was 58.4%,whereas that of the AUC-ROC of LR trained with the class imbalance dataset was 54.3%.The SVM-PCa-EDD achieved 90%accuracy,80%sensitivity,and 80%specificity.The validation of SVM-PCa-EDD using random forest and LR showed that SVM-PCa-EDD performed better in early differential diagnosis of PCa.The proposed model can assist medical experts in early diagnosis of PCa,particularly in resource-constrained healthcare settings and making further recommendations for PCa testing and treatment.
基金supported by the Basic Research Project of the Korea Institute of Geoscience and Mineral Resources(KIGAM)Project of Environmental Business Big Data Platform and Center Construction funded by the Ministry of Science and ICT。
文摘In this study,we developed multiple hybrid machine-learning models to address parameter optimization limitations and enhance the spatial prediction of landslide susceptibility models.We created a geographic information system database,and our analysis results were used to prepare a landslide inventory map containing 359 landslide events identified from Google Earth,aerial photographs,and other validated sources.A support vector regression(SVR)machine-learning model was used to divide the landslide inventory into training(70%)and testing(30%)datasets.The landslide susceptibility map was produced using 14 causative factors.We applied the established gray wolf optimization(GWO)algorithm,bat algorithm(BA),and cuckoo optimization algorithm(COA)to fine-tune the parameters of the SVR model to improve its predictive accuracy.The resultant hybrid models,SVR-GWO,SVR-BA,and SVR-COA,were validated in terms of the area under curve(AUC)and root mean square error(RMSE).The AUC values for the SVR-GWO(0.733),SVR-BA(0.724),and SVR-COA(0.738)models indicate their good prediction rates for landslide susceptibility modeling.SVR-COA had the greatest accuracy,with an RMSE of 0.21687,and SVR-BA had the least accuracy,with an RMSE of 0.23046.The three optimized hybrid models outperformed the SVR model(AUC=0.704,RMSE=0.26689),confirming the ability of metaheuristic algorithms to improve model performance.
基金Supported by the State Key Development Program for Basic Research of China (No.2002CB312200) and the National Natural Science Foundation of China (No.60574019).
文摘Multi-kernel-based support vector machine (SVM) model structure of nonlinear systems and its specific identification method is proposed, which is composed of a SVM with linear kernel function followed in series by a SVM with spline kernel function. With the help of this model, nonlinear model predictive control can be transformed to linear model predictive control, and consequently a unified analytical solution of optimal input of multi-step-ahead predictive control is possible to derive. This algorithm does not require online iterative optimization in order to be suitable for real-time control with less calculation. The simulation results of pH neutralization process and CSTR reactor show the effectiveness and advantages of the presented algorithm.
文摘A support vector machine with guadratic polynomial kernel function based nonlinear model multi-step-ahead optimizing predictive controller was presented. A support vector machine based predictive model was established by black-box identification. And a quadratic objective function with receding horizon was selected to obtain the controller output. By solving a nonlinear optimization problem with equality constraint of model output and boundary constraint of controller output using Nelder-Mead simplex direct search method, a sub-optimal control law was achieved in feature space. The effect of the controller was demonstrated on a recognized benchmark problem and a continuous-stirred tank reactor. The simulation results show that the multi-step-ahead predictive controller can be well applied to nonlinear system, with better performance in following reference trajectory and disturbance-rejection.
基金This project is supported by Special Foundation for Major State Basic Research of China (No.G1998030415).
文摘Soft sensor is widely used in industrial process control. It plays animportant role to improve the quality of product and assure safety in production. The core of softsensor is to construct soft sensing model. A new soft sensing modeling method based on supportvector machine (SVM) is proposed. SVM is a new machine learning method based on statistical learningtheory and is powerful for the problem characterized by small sample, nonlinearity, high dimensionand local minima. The proposed methods are applied to the estimation of frozen point of light dieseloil in distillation column. The estimated outputs of soft sensing model based on SVM match the realvalues of frozen point and follow varying trend of frozen point very well. Experiment results showthat SVM provides a new effective method for soft sensing modeling and has promising application inindustrial process applications.
基金the National Basic Research Program (973) of China (No. 2004CB719401)the National Research Foundation for the Doctoral Program of Higher Education of China (No.20060003060)
文摘In this paper, we present a novel Support Vector Machine active learning algorithm for effective 3D model retrieval using the concept of relevance feedback. The proposed method learns from the most informative objects which are marked by the user, and then creates a boundary separating the relevant models from irrelevant ones. What it needs is only a small number of 3D models labelled by the user. It can grasp the user's semantic knowledge rapidly and accurately. Experimental results showed that the proposed algorithm significantly improves the retrieval effectiveness. Compared with four state-of-the-art query refinement schemes for 3D model retrieval, it provides superior retrieval performance after no more than two rounds of relevance feedback.
基金Under the auspices of National Natural Science Foundation of China (No. 40671133)Fundamental Research Funds for the Central Universities (No. GK200902015)
文摘This paper proposed a semi-supervised regression model with co-training algorithm based on support vector machine, which was used for retrieving water quality variables from SPOT 5 remote sensing data. The model consisted of two support vector regressors (SVRs). Nonlinear relationship between water quality variables and SPOT 5 spectrum was described by the two SVRs, and semi-supervised co-training algorithm for the SVRs was es-tablished. The model was used for retrieving concentrations of four representative pollution indicators―permangan- ate index (CODmn), ammonia nitrogen (NH3-N), chemical oxygen demand (COD) and dissolved oxygen (DO) of the Weihe River in Shaanxi Province, China. The spatial distribution map for those variables over a part of the Weihe River was also produced. SVR can be used to implement any nonlinear mapping readily, and semi-supervis- ed learning can make use of both labeled and unlabeled samples. By integrating the two SVRs and using semi-supervised learning, we provide an operational method when paired samples are limited. The results show that it is much better than the multiple statistical regression method, and can provide the whole water pollution condi-tions for management fast and can be extended to hyperspectral remote sensing applications.
基金Supported by the National Natural Science Foundation of China (20476007,20676013)
文摘Support vector machine(SVM) has shown great potential in pattern recognition and regressive estima-tion.Due to the industrial development demands,such as the fermentation process modeling,improving the training performance on increasingly large sample sets is an important problem.However,solving a large optimization problem is computationally intensive and memory intensive.In this paper,a geometric interpretation of SVM re-gression(SVR) is derived,and μ-SVM is extended for both L1-norm and L2-norm penalty SVR.Further,Gilbert al-gorithm,a well-known geometric algorithm,is modified to solve SVR problems.Theoretical analysis indicates that the presented SVR training geometric algorithms have the same convergence and almost identical cost of computa-tion as their corresponding algorithms for SVM classification.Experimental results show that the geometric meth-ods are more efficient than conventional methods using quadratic programming and require much less memory.
基金Project(2002CB312200) supported by the National Key Fundamental Research and Development Program of China project(60574019) supported by the National Natural Science Foundation of China
文摘Robustly stable multi-step-ahead model predictive control (MPC) based on parallel support vector machines (SVMs) with linear kernel was proposed. First, an analytical solution of optimal control laws of parallel SVMs based MPC was derived, and then the necessary and sufficient stability condition for MPC closed loop was given according to SVM model, and finally a method of judging the discrepancy between SVM model and the actual plant was presented, and consequently the constraint sets, which can guarantee that the stability condition is still robust for model/plant mismatch within some given bounds, were obtained by applying small-gain theorem. Simulation experiments show the proposed stability condition and robust constraint sets can provide a convenient way of adjusting controller parameters to ensure a closed-loop with larger stable margin.
基金National High Technology Research andDevelopment Program of China( Project 863 G2 0 0 1AA413 13 0
文摘A new multiple models(MM) approach was proposed to model complex industrial process by using Fuzzy Support Vector Machines(F -SVMs). By applying the proposed approach to a pH neutralization titration experiment, F -SVMs MM not only provides satisfactory approximation and generalization property, but also achieves superior performance to USOCPN multiple modeling method and single modeling method based on standard SVMs.
文摘Support Vector-based learning methods are an important part of Computational Intelligence techniques. Recent efforts have been dealing with the problem of learning from very large datasets. This paper reviews the most commonly used formulations of support vector machines for regression (SVRs) aiming to emphasize its usability on large-scale applications. We review the general concept of support vector machines (SVMs), address the state-of-the-art on training methods SVMs, and explain the fundamental principle of SVRs. The most common learning methods for SVRs are introduced and linear programming-based SVR formulations are explained emphasizing its suitability for large-scale learning. Finally, this paper also discusses some open problems and current trends.
文摘This paper proposes a design of internal model control systems for process with delay by using support vector regression(SVR).The proposed system fully uses the excellent nonlinear estimation performance of SVR with the structural risk minimization principle.Closed-system stability and steady error are analyzed for the existence of modeling errors.The simulations show that the proposed control systems have the better control performance than that by neural networks in the cases of the training samples with small size and noises.
基金This project is supported by National Natural Science Foundation of China(No.50375153).
文摘Aiming at solving the problems of machine-learning in fault diagnosis, a diagnosis approach is proposed based on hidden Markov model (HMM) and support vector machine (SVM). HMM usually describes intra-class measure well and is good at dealing with continuous dynamic signals. SVM expresses inter-class difference effectively and has perfect classify ability. This approach is built on the merit of HMM and SVM. Then, the experiment is made in the transmission system of a helicopter. With the features extracted from vibration signals in gearbox, this HMM-SVM based diagnostic approach is trained and used to monitor and diagnose the gearbox's faults. The result shows that this method is better than HMM-based and SVM-based diagnosing methods in higher diagnostic accuracy with small training samples.
基金Project supported by the National Natural Science Foundation of China (Grant Nos. 10674172 and 10874229)
文摘In this paper a new continuous variable called core-ratio is defined to describe the probability for a residue to be in a binding site, thereby replacing the previous binary description of the interface residue using 0 and 1. So we can use the support vector machine regression method to fit the core-ratio value and predict the protein binding sites. We also design a new group of physical and chemical descriptors to characterize the binding sites. The new descriptors are more effective, with an averaging procedure used. Our test shows that much better prediction results can be obtained by the support vector regression (SVR) method than by the support vector classification method.