Robustly stable multi-step-ahead model predictive control (MPC) based on parallel support vector machines (SVMs) with linear kernel was proposed. First, an analytical solution of optimal control laws of parallel SVMs ...Robustly stable multi-step-ahead model predictive control (MPC) based on parallel support vector machines (SVMs) with linear kernel was proposed. First, an analytical solution of optimal control laws of parallel SVMs based MPC was derived, and then the necessary and sufficient stability condition for MPC closed loop was given according to SVM model, and finally a method of judging the discrepancy between SVM model and the actual plant was presented, and consequently the constraint sets, which can guarantee that the stability condition is still robust for model/plant mismatch within some given bounds, were obtained by applying small-gain theorem. Simulation experiments show the proposed stability condition and robust constraint sets can provide a convenient way of adjusting controller parameters to ensure a closed-loop with larger stable margin.展开更多
当前动态数据流下的实时分类问题存在3个难点:针对海量数据的实时处理;概念漂移的跟踪和模型的更新;模型的稳定和鲁棒性.针对上述问题,将极端支持向量机(extreme support vector machine,ESVM)与MapReduce框架结合,提出了带遗忘因子的鲁...当前动态数据流下的实时分类问题存在3个难点:针对海量数据的实时处理;概念漂移的跟踪和模型的更新;模型的稳定和鲁棒性.针对上述问题,将极端支持向量机(extreme support vector machine,ESVM)与MapReduce框架结合,提出了带遗忘因子的鲁棒ESVM算法.该方法通过构造残差权重矩阵,对残差进行修正,同时加入遗忘因子,提高新样本的作用,从而实现对海量数据处理问题的求解.实验结果显示,所提出方法能够快速有效地对动态数据流进行分类,且结果不易受到噪声干扰,稳定性强.展开更多
A dynamic parallel forecasting model is proposed, which is based on the problem of current forecasting models and their combined model. According to the process of the model, the fuzzy C-means clustering algorithm is ...A dynamic parallel forecasting model is proposed, which is based on the problem of current forecasting models and their combined model. According to the process of the model, the fuzzy C-means clustering algorithm is improved in outliers operation and distance in the clusters and among the clusters. Firstly, the input data sets are optimized and their coherence is ensured, the region scale algorithm is modified and non-isometric multi scale region fuzzy time series model is built. At the same time, the particle swarm optimization algorithm about the particle speed, location and inertia weight value is improved, this method is used to optimize the parameters of support vector machine, construct the combined forecast model, build the dynamic parallel forecast model, and calculate the dynamic weight values and regard the product of the weight value and forecast value to be the final forecast values. At last, the example shows the improved forecast model is effective and accurate.展开更多
Support vector machine (SVM) was introduced to analyze the reliability of the implicit performance function, which is difficult to implement by the classical methods such as the first order reliability method (FORM...Support vector machine (SVM) was introduced to analyze the reliability of the implicit performance function, which is difficult to implement by the classical methods such as the first order reliability method (FORM) and the Monte Carlo simulation (MCS). As a classification method where the underlying structural risk minimization inference rule is employed, SVM possesses excellent learning capacity with a small amount of information and good capability of generalization over the complete data. Hence, two approaches, i.e., SVM-based FORM and SVM-based MCS, were presented for the structural reliability analysis of the implicit limit state function. Compared to the conventional response surface method (RSM) and the artificial neural network (ANN), which are widely used to replace the implicit state function for alleviating the computation cost, the more important advantages of SVM are that it can approximate the implicit function with higher precision and better generalization under the small amount of information and avoid the "curse of dimensionality". The SVM-based reliability approaches can approximate the actual performance function over the complete sampling data with the decreased number of the implicit performance function analysis (usually finite element analysis), and the computational precision can satisfy the engineering requirement, which are demonstrated by illustrations.展开更多
Aiming at the reliability analysis of small sample data or implicit structural function,a novel structural reliability analysis model based on support vector machine(SVM)and neural network direct integration method(DN...Aiming at the reliability analysis of small sample data or implicit structural function,a novel structural reliability analysis model based on support vector machine(SVM)and neural network direct integration method(DNN)is proposed.Firstly,SVM with good small sample learning ability is used to train small sample data,fit structural performance functions and establish regular integration regions.Secondly,DNN is approximated the integral function to achieve multiple integration in the integration region.Finally,structural reliability was obtained by DNN.Numerical examples are investigated to demonstrate the effectiveness of the present method,which provides a feasible way for the structural reliability analysis.展开更多
The purpose of this paper is to present a novel way to building quantitative structure-property relationship(QSPR) models for predicting the gas-to-benzene solvation enthalpy(ΔHSolv) of 158 organic compounds based on...The purpose of this paper is to present a novel way to building quantitative structure-property relationship(QSPR) models for predicting the gas-to-benzene solvation enthalpy(ΔHSolv) of 158 organic compounds based on molecular descriptors calculated from the structure alone. Different kinds of descriptors were calculated for each compounds using dragon package. The variable selection technique of enhanced replacement method(ERM) was employed to select optimal subset of descriptors. Our investigation reveals that the dependence of physico-chemical properties on solvation enthalpy is a nonlinear observable fact and that ERM method is unable to model the solvation enthalpy accurately. The standard error value of prediction set for support vector machine(SVM) is 1.681 kJ ? mol^(-1) while it is 4.624 kJ ? mol^(-1) for ERM. The results established that the calculated ΔHSolvvalues by SVM were in good agreement with the experimental ones, and the performances of the SVM models were superior to those obtained by ERM one. This indicates that SVM can be used as an alternative modeling tool for QSPR studies.展开更多
The structure and function of proteins are closely related, and protein structure decides its function, therefore protein structure prediction is quite important.β-turns are important components of protein secondary ...The structure and function of proteins are closely related, and protein structure decides its function, therefore protein structure prediction is quite important.β-turns are important components of protein secondary structure. So development of an accurate prediction method ofβ-turn types is very necessary. In this paper, we used the composite vector with position conservation scoring function, increment of diversity and predictive secondary structure information as the input parameter of support vector machine algorithm for predicting theβ-turn types in the database of 426 protein chains, obtained the overall prediction accuracy of 95.6%, 97.8%, 97.0%, 98.9%, 99.2%, 91.8%, 99.4% and 83.9% with the Matthews Correlation Coefficient values of 0.74, 0.68, 0.20, 0.49, 0.23, 0.47, 0.49 and 0.53 for types I, II, VIII, I’, II’, IV, VI and nonturn respectively, which is better than other prediction.展开更多
Based on the research of predictingβ-hairpin motifs in proteins, we apply Random Forest and Support Vector Machine algorithm to predictβ-hairpin motifs in ArchDB40 dataset. The motifs with the loop length of 2 to 8 ...Based on the research of predictingβ-hairpin motifs in proteins, we apply Random Forest and Support Vector Machine algorithm to predictβ-hairpin motifs in ArchDB40 dataset. The motifs with the loop length of 2 to 8 amino acid residues are extracted as research object and thefixed-length pattern of 12 amino acids are selected. When using the same characteristic parameters and the same test method, Random Forest algorithm is more effective than Support Vector Machine. In addition, because of Random Forest algorithm doesn’t produce overfitting phenomenon while the dimension of characteristic parameters is higher, we use Random Forest based on higher dimension characteristic parameters to predictβ-hairpin motifs. The better prediction results are obtained;the overall accuracy and Matthew’s correlation coefficient of 5-fold cross-validation achieve 83.3% and 0.59, respectively.展开更多
A three-descriptor quantitative structure-property relationship (QSPR) model, based on the support vector machine (SVM) algorithm, was constructed to predict the glass transition temperatures (Tgs) ofpolyarylate...A three-descriptor quantitative structure-property relationship (QSPR) model, based on the support vector machine (SVM) algorithm, was constructed to predict the glass transition temperatures (Tgs) ofpolyarylates with complex structures. A total of 50 polyarylates were randomly divided into three sets, viz., the training set (30 polymers), validation set (10 polymers) and prediction set (10 polymers). By adjusting various parameters by trial and error, the final optimum SVM model based on Austin Model 1 (AM1) calculation is a polynomial kernel with the parameters C of 100, ε of 1.00E-05 and d of 2. The root-mean-square (RMS) errors obtained from the training set, validation set and prediction set are 19.4, 12.8 and 15.5 K, respectively. Research results show that the proposed SVM model has better statistical quality than the previous models. Thus, applying the SVM algorithm to predict Tgs of polymers is feasible.展开更多
Natural gas load forecasting is a key process to the efficient operation of pipeline network. An accurate forecast is required to guarantee a balanced network operation and ensure safe gas supply at a minimum cost.Mac...Natural gas load forecasting is a key process to the efficient operation of pipeline network. An accurate forecast is required to guarantee a balanced network operation and ensure safe gas supply at a minimum cost.Machine learning techniques have been increasingly applied to load forecasting. A novel regression technique based on the statistical learning theory, support vector machines (SVM), is investigated in this paper for natural gas shortterm load forecasting. SVM is based on the principle of structure risk minimization as opposed to the principle of empirical risk minimization in conventional regression techniques. Using a data set with 2 years load values we developed prediction model using SVM to obtain 31 days load predictions. The results on city natural gas short-term load forecasting show that SVM provides better prediction accuracy than neural network. The software package natural gas pipeline networks simulation and load forecasting (NGPNSLF) based on support vector regression prediction has been developed, which has also been applied in practice.展开更多
The support vector classification (SVC) was employed to make a model for classification of antifungal activities of 1-(1H-1,2,4-triazole-l-yl)-2-(2,4-difluorophenyl)-3-substituted-2-propanols triazole derivative...The support vector classification (SVC) was employed to make a model for classification of antifungal activities of 1-(1H-1,2,4-triazole-l-yl)-2-(2,4-difluorophenyl)-3-substituted-2-propanols triazole derivatives. The compounds with high antifungal activities and those with low antifungal activities were compared on the basis of the following molecular descriptors: net atomic charge on the atom N connecting with R, dipole moment and heat of formation, By using the SVC, a mathematical model was constructed, which can predict the antifungal activities of the triazole derivatives, with an accuracy of 91% on the basis of the leave-one-out cross-validation (LOOCV) test, The results indicate that the performance of the SVC model can exceed that of the principal component analysis (PCA) and K-Nearest Neighbor (KNN) models for this real world data set.展开更多
针对最小二乘孪生支持向量机(least squares twin support vector machine,LSTSVM)对噪声或是异常数据敏感和忽略数据内在结构信息的问题,提出了一种直觉模糊的结构化最小二乘孪生支持向量机(intuition fuzzy and structural least squa...针对最小二乘孪生支持向量机(least squares twin support vector machine,LSTSVM)对噪声或是异常数据敏感和忽略数据内在结构信息的问题,提出了一种直觉模糊的结构化最小二乘孪生支持向量机(intuition fuzzy and structural least squares twin support vector machine,IF-SLSTSVM)。首先采用孤立森林对输入样本点进行预处理;然后通过直觉模糊数的概念,赋予输入样本点不同的权重以减少噪声或是异常数据对分类超平面产生的影响;最后采用K-Means算法,以协方差的形式获取输入样本点之间的结构信息。IFSLSTSVM在LS-TSVM的基础上,考虑了输入样本点在特征空间中的分布信息及输入样本点之间的关系,提高了模型的鲁棒性。实验采取UCI数据集,在0%、5%、10%以及20%的不同比例噪声环境对IF-SLSTSVM算法的有效性进行验证。结果显示相较于6种对比算法,IF-SLSTSVM算法有更好的鲁棒性。展开更多
基金Project(2002CB312200) supported by the National Key Fundamental Research and Development Program of China project(60574019) supported by the National Natural Science Foundation of China
文摘Robustly stable multi-step-ahead model predictive control (MPC) based on parallel support vector machines (SVMs) with linear kernel was proposed. First, an analytical solution of optimal control laws of parallel SVMs based MPC was derived, and then the necessary and sufficient stability condition for MPC closed loop was given according to SVM model, and finally a method of judging the discrepancy between SVM model and the actual plant was presented, and consequently the constraint sets, which can guarantee that the stability condition is still robust for model/plant mismatch within some given bounds, were obtained by applying small-gain theorem. Simulation experiments show the proposed stability condition and robust constraint sets can provide a convenient way of adjusting controller parameters to ensure a closed-loop with larger stable margin.
文摘当前动态数据流下的实时分类问题存在3个难点:针对海量数据的实时处理;概念漂移的跟踪和模型的更新;模型的稳定和鲁棒性.针对上述问题,将极端支持向量机(extreme support vector machine,ESVM)与MapReduce框架结合,提出了带遗忘因子的鲁棒ESVM算法.该方法通过构造残差权重矩阵,对残差进行修正,同时加入遗忘因子,提高新样本的作用,从而实现对海量数据处理问题的求解.实验结果显示,所提出方法能够快速有效地对动态数据流进行分类,且结果不易受到噪声干扰,稳定性强.
基金supported by the National Defense Preliminary Research Program of China(A157167)the National Defense Fundamental of China(9140A19030314JB35275)
文摘A dynamic parallel forecasting model is proposed, which is based on the problem of current forecasting models and their combined model. According to the process of the model, the fuzzy C-means clustering algorithm is improved in outliers operation and distance in the clusters and among the clusters. Firstly, the input data sets are optimized and their coherence is ensured, the region scale algorithm is modified and non-isometric multi scale region fuzzy time series model is built. At the same time, the particle swarm optimization algorithm about the particle speed, location and inertia weight value is improved, this method is used to optimize the parameters of support vector machine, construct the combined forecast model, build the dynamic parallel forecast model, and calculate the dynamic weight values and regard the product of the weight value and forecast value to be the final forecast values. At last, the example shows the improved forecast model is effective and accurate.
基金Project supported by the National Natural Science Foundation of China (No.10572117)the National Astronautics Science Foundation of China (Nos.N3CH0502 and N5CH0001)Program for New Century Excellent Talent of Ministry of Education of China (No.NCET-05-0868)
文摘Support vector machine (SVM) was introduced to analyze the reliability of the implicit performance function, which is difficult to implement by the classical methods such as the first order reliability method (FORM) and the Monte Carlo simulation (MCS). As a classification method where the underlying structural risk minimization inference rule is employed, SVM possesses excellent learning capacity with a small amount of information and good capability of generalization over the complete data. Hence, two approaches, i.e., SVM-based FORM and SVM-based MCS, were presented for the structural reliability analysis of the implicit limit state function. Compared to the conventional response surface method (RSM) and the artificial neural network (ANN), which are widely used to replace the implicit state function for alleviating the computation cost, the more important advantages of SVM are that it can approximate the implicit function with higher precision and better generalization under the small amount of information and avoid the "curse of dimensionality". The SVM-based reliability approaches can approximate the actual performance function over the complete sampling data with the decreased number of the implicit performance function analysis (usually finite element analysis), and the computational precision can satisfy the engineering requirement, which are demonstrated by illustrations.
基金National Natural Science Foundation of China(Nos.11262014,11962021 and 51965051)Inner Mongolia Natural Science Foundation,China(No.2019MS05064)+1 种基金Inner Mongolia Earthquake Administration Director Fund Project,China(No.2019YB06)Inner Mongolia University of Technology Foundation,China(No.2020015)。
文摘Aiming at the reliability analysis of small sample data or implicit structural function,a novel structural reliability analysis model based on support vector machine(SVM)and neural network direct integration method(DNN)is proposed.Firstly,SVM with good small sample learning ability is used to train small sample data,fit structural performance functions and establish regular integration regions.Secondly,DNN is approximated the integral function to achieve multiple integration in the integration region.Finally,structural reliability was obtained by DNN.Numerical examples are investigated to demonstrate the effectiveness of the present method,which provides a feasible way for the structural reliability analysis.
文摘The purpose of this paper is to present a novel way to building quantitative structure-property relationship(QSPR) models for predicting the gas-to-benzene solvation enthalpy(ΔHSolv) of 158 organic compounds based on molecular descriptors calculated from the structure alone. Different kinds of descriptors were calculated for each compounds using dragon package. The variable selection technique of enhanced replacement method(ERM) was employed to select optimal subset of descriptors. Our investigation reveals that the dependence of physico-chemical properties on solvation enthalpy is a nonlinear observable fact and that ERM method is unable to model the solvation enthalpy accurately. The standard error value of prediction set for support vector machine(SVM) is 1.681 kJ ? mol^(-1) while it is 4.624 kJ ? mol^(-1) for ERM. The results established that the calculated ΔHSolvvalues by SVM were in good agreement with the experimental ones, and the performances of the SVM models were superior to those obtained by ERM one. This indicates that SVM can be used as an alternative modeling tool for QSPR studies.
文摘The structure and function of proteins are closely related, and protein structure decides its function, therefore protein structure prediction is quite important.β-turns are important components of protein secondary structure. So development of an accurate prediction method ofβ-turn types is very necessary. In this paper, we used the composite vector with position conservation scoring function, increment of diversity and predictive secondary structure information as the input parameter of support vector machine algorithm for predicting theβ-turn types in the database of 426 protein chains, obtained the overall prediction accuracy of 95.6%, 97.8%, 97.0%, 98.9%, 99.2%, 91.8%, 99.4% and 83.9% with the Matthews Correlation Coefficient values of 0.74, 0.68, 0.20, 0.49, 0.23, 0.47, 0.49 and 0.53 for types I, II, VIII, I’, II’, IV, VI and nonturn respectively, which is better than other prediction.
文摘Based on the research of predictingβ-hairpin motifs in proteins, we apply Random Forest and Support Vector Machine algorithm to predictβ-hairpin motifs in ArchDB40 dataset. The motifs with the loop length of 2 to 8 amino acid residues are extracted as research object and thefixed-length pattern of 12 amino acids are selected. When using the same characteristic parameters and the same test method, Random Forest algorithm is more effective than Support Vector Machine. In addition, because of Random Forest algorithm doesn’t produce overfitting phenomenon while the dimension of characteristic parameters is higher, we use Random Forest based on higher dimension characteristic parameters to predictβ-hairpin motifs. The better prediction results are obtained;the overall accuracy and Matthew’s correlation coefficient of 5-fold cross-validation achieve 83.3% and 0.59, respectively.
基金supported by the Open Project Program of Key Laboratory of Environmentally Friendly Chemistry and Applications of Ministry of Education,China (No.10HJYH06)
文摘A three-descriptor quantitative structure-property relationship (QSPR) model, based on the support vector machine (SVM) algorithm, was constructed to predict the glass transition temperatures (Tgs) ofpolyarylates with complex structures. A total of 50 polyarylates were randomly divided into three sets, viz., the training set (30 polymers), validation set (10 polymers) and prediction set (10 polymers). By adjusting various parameters by trial and error, the final optimum SVM model based on Austin Model 1 (AM1) calculation is a polynomial kernel with the parameters C of 100, ε of 1.00E-05 and d of 2. The root-mean-square (RMS) errors obtained from the training set, validation set and prediction set are 19.4, 12.8 and 15.5 K, respectively. Research results show that the proposed SVM model has better statistical quality than the previous models. Thus, applying the SVM algorithm to predict Tgs of polymers is feasible.
文摘Natural gas load forecasting is a key process to the efficient operation of pipeline network. An accurate forecast is required to guarantee a balanced network operation and ensure safe gas supply at a minimum cost.Machine learning techniques have been increasingly applied to load forecasting. A novel regression technique based on the statistical learning theory, support vector machines (SVM), is investigated in this paper for natural gas shortterm load forecasting. SVM is based on the principle of structure risk minimization as opposed to the principle of empirical risk minimization in conventional regression techniques. Using a data set with 2 years load values we developed prediction model using SVM to obtain 31 days load predictions. The results on city natural gas short-term load forecasting show that SVM provides better prediction accuracy than neural network. The software package natural gas pipeline networks simulation and load forecasting (NGPNSLF) based on support vector regression prediction has been developed, which has also been applied in practice.
基金Project supported by the National Natural Science Foundation of China (Grant Nos.20373040, 20503015)
文摘The support vector classification (SVC) was employed to make a model for classification of antifungal activities of 1-(1H-1,2,4-triazole-l-yl)-2-(2,4-difluorophenyl)-3-substituted-2-propanols triazole derivatives. The compounds with high antifungal activities and those with low antifungal activities were compared on the basis of the following molecular descriptors: net atomic charge on the atom N connecting with R, dipole moment and heat of formation, By using the SVC, a mathematical model was constructed, which can predict the antifungal activities of the triazole derivatives, with an accuracy of 91% on the basis of the leave-one-out cross-validation (LOOCV) test, The results indicate that the performance of the SVC model can exceed that of the principal component analysis (PCA) and K-Nearest Neighbor (KNN) models for this real world data set.
文摘针对最小二乘孪生支持向量机(least squares twin support vector machine,LSTSVM)对噪声或是异常数据敏感和忽略数据内在结构信息的问题,提出了一种直觉模糊的结构化最小二乘孪生支持向量机(intuition fuzzy and structural least squares twin support vector machine,IF-SLSTSVM)。首先采用孤立森林对输入样本点进行预处理;然后通过直觉模糊数的概念,赋予输入样本点不同的权重以减少噪声或是异常数据对分类超平面产生的影响;最后采用K-Means算法,以协方差的形式获取输入样本点之间的结构信息。IFSLSTSVM在LS-TSVM的基础上,考虑了输入样本点在特征空间中的分布信息及输入样本点之间的关系,提高了模型的鲁棒性。实验采取UCI数据集,在0%、5%、10%以及20%的不同比例噪声环境对IF-SLSTSVM算法的有效性进行验证。结果显示相较于6种对比算法,IF-SLSTSVM算法有更好的鲁棒性。