In the objective world,how to deal with the complexity and uncertainty of big data efficiently and accurately has become the premise and key to machine learning.Fuzzy support vector machine(FSVM)not only deals with th...In the objective world,how to deal with the complexity and uncertainty of big data efficiently and accurately has become the premise and key to machine learning.Fuzzy support vector machine(FSVM)not only deals with the classifi-cation problems for training samples with fuzzy information,but also assigns a fuzzy membership degree to each training sample,allowing different training samples to contribute differently in predicting an optimal hyperplane to separate two classes with maximum margin,reducing the effect of outliers and noise,Quantum computing has super parallel computing capabilities and holds the pro-mise of faster algorithmic processing of data.However,FSVM and quantum com-puting are incapable of dealing with the complexity and uncertainty of big data in an efficient and accurate manner.This paper research and propose an efficient and accurate quantum fuzzy support vector machine(QFSVM)algorithm based on the fact that quantum computing can efficiently process large amounts of data and FSVM is easy to deal with the complexity and uncertainty problems.The central idea of the proposed algorithm is to use the quantum algorithm for solving linear systems of equations(HHL algorithm)and the least-squares method to solve the quadratic programming problem in the FSVM.The proposed algorithm can deter-mine whether a sample belongs to the positive or negative class while also achiev-ing a good generalization performance.Furthermore,this paper applies QFSVM to handwritten character recognition and demonstrates that QFSVM can be run on quantum computers,and achieve accurate classification of handwritten characters.When compared to FSVM,QFSVM’s computational complexity decreases expo-nentially with the number of training samples.展开更多
This study describes a classification methodology based on support vector machines(SVMs),which offer superior classification performance for fault diagnosis in chemical process engineering.The method incorporates an e...This study describes a classification methodology based on support vector machines(SVMs),which offer superior classification performance for fault diagnosis in chemical process engineering.The method incorporates an efficient parameter tuning procedure(based on minimization of radius/margin bound for SVM's leave-one-out errors)into a multi-class classification strategy using a fuzzy decision factor,which is named fuzzy support vector machine(FSVM).The datasets generated from the Tennessee Eastman process(TEP)simulator were used to evaluate the clas-sification performance.To decrease the negative influence of the auto-correlated and irrelevant variables,a key vari-able identification procedure using recursive feature elimination,based on the SVM is implemented,with time lags incorporated,before every classifier is trained,and the number of relatively important variables to every classifier is basically determined by 10-fold cross-validation.Performance comparisons are implemented among several kinds of multi-class decision machines,by which the effectiveness of the proposed approach is proved.展开更多
A new multiple models(MM) approach was proposed to model complex industrial process by using Fuzzy Support Vector Machines(F -SVMs). By applying the proposed approach to a pH neutralization titration experiment, F -SV...A new multiple models(MM) approach was proposed to model complex industrial process by using Fuzzy Support Vector Machines(F -SVMs). By applying the proposed approach to a pH neutralization titration experiment, F -SVMs MM not only provides satisfactory approximation and generalization property, but also achieves superior performance to USOCPN multiple modeling method and single modeling method based on standard SVMs.展开更多
In order to measure the backhoe vibratory excavating resistance of a hydraulic excavator fast and precisely,the influences of vibratory excavating depth,angle,vibratory frequency,amplitude,bucket inserting velocity an...In order to measure the backhoe vibratory excavating resistance of a hydraulic excavator fast and precisely,the influences of vibratory excavating depth,angle,vibratory frequency,amplitude,bucket inserting velocity and soil type on the vibratory excavating resistance were analyzed.Simulation analysis was carded out to establish the bucket inserting velocity,amplitude and vibratory frequency considered as secondary variables and excavating resistance as primary variable.A fttzzy membership function was introduced to improve the anti-noise capacity of support vector machine,which is a soft-sensing model on the hydraulic excavator's backhoe vibratory excavating resistance based on fuzzy support vector machine.The simulation result reveals that its maximum relative training and testing error are nearly 0.68% and-0.47%,respectively.It is concluded that the model has quite high modeling precision and generalization capacity,and it can measure the vibratory excavating resistance accurately,reliably and fast in an indirect way.展开更多
The distribution of data has a significant impact on the results of classification.When the distribution of one class is insignificant compared to the distribution of another class,data imbalance occurs.This will resu...The distribution of data has a significant impact on the results of classification.When the distribution of one class is insignificant compared to the distribution of another class,data imbalance occurs.This will result in rising outlier values and noise.Therefore,the speed and performance of classification could be greatly affected.Given the above problems,this paper starts with the motivation and mathematical representing of classification,puts forward a new classification method based on the relationship between different classification formulations.Combined with the vector characteristics of the actual problem and the choice of matrix characteristics,we firstly analyze the orderly regression to introduce slack variables to solve the constraint problem of the lone point.Then we introduce the fuzzy factors to solve the problem of the gap between the isolated points on the basis of the support vector machine.We introduce the cost control to solve the problem of sample skew.Finally,based on the bi-boundary support vector machine,a twostep weight setting twin classifier is constructed.This can help to identify multitasks with feature-selected patterns without the need for additional optimizers,which solves the problem of large-scale classification that can’t deal effectively with the very low category distribution gap.展开更多
With the widespread data collection and processing,privacy-preserving machine learning has become increasingly important in addressing privacy risks related to individuals.Support vector machine(SVM)is one of the most...With the widespread data collection and processing,privacy-preserving machine learning has become increasingly important in addressing privacy risks related to individuals.Support vector machine(SVM)is one of the most elementary learning models of machine learning.Privacy issues surrounding SVM classifier training have attracted increasing attention.In this paper,we investigate Differential Privacy-compliant Federated Machine Learning with Dimensionality Reduction,called FedDPDR-DPML,which greatly improves data utility while providing strong privacy guarantees.Considering in distributed learning scenarios,multiple participants usually hold unbalanced or small amounts of data.Therefore,FedDPDR-DPML enables multiple participants to collaboratively learn a global model based on weighted model averaging and knowledge aggregation and then the server distributes the global model to each participant to improve local data utility.Aiming at high-dimensional data,we adopt differential privacy in both the principal component analysis(PCA)-based dimensionality reduction phase and SVM classifiers training phase,which improves model accuracy while achieving strict differential privacy protection.Besides,we train Differential privacy(DP)-compliant SVM classifiers by adding noise to the objective function itself,thus leading to better data utility.Extensive experiments on three high-dimensional datasets demonstrate that FedDPDR-DPML can achieve high accuracy while ensuring strong privacy protection.展开更多
A fault diagnosis model is proposed based on fuzzy support vector machine (FSVM) combined with fuzzy clustering (FC).Considering the relationship between the sample point and non-self class,FC algorithm is applied to ...A fault diagnosis model is proposed based on fuzzy support vector machine (FSVM) combined with fuzzy clustering (FC).Considering the relationship between the sample point and non-self class,FC algorithm is applied to generate fuzzy memberships.In the algorithm,sample weights based on a distribution density function of data point and genetic algorithm (GA) are introduced to enhance the performance of FC.Then a multi-class FSVM with radial basis function kernel is established according to directed acyclic graph algorithm,the penalty factor and kernel parameter of which are optimized by GA.Finally,the model is executed for multi-class fault diagnosis of rolling element bearings.The results show that the presented model achieves high performances both in identifying fault types and fault degrees.The performance comparisons of the presented model with SVM and distance-based FSVM for noisy case demonstrate the capacity of dealing with noise and generalization.展开更多
Support vector machine(SVM)is easily affected by noises and outliers,and its training time dramatically increases with the growing in number of training samples.Satellite cloud image may easily be deteriorated by nois...Support vector machine(SVM)is easily affected by noises and outliers,and its training time dramatically increases with the growing in number of training samples.Satellite cloud image may easily be deteriorated by noises and intensity non-uniformity with a huge amount of data needs to be processed regularly,so it is hard to detect convective clouds in satellite image using traditional SVM.To deal with this problem,a novel method for detection of convective clouds was proposed based on fast fuzzy support vector machine(FFSVM).FFSVM was constructed by eliminating feeble samples and designing new membership function as two aspects.Firstly,according to the distribution characteristics of fuzzy inseparable sample set and the fact that the classification hyper-plane is only determined by support vectors,this paper uses SVDD,Gaussian model and border vector extraction model comprehensively to design a sample selection method in three steps,which can eliminate most of redundant samples and keep possible support vectors.Then,by defining adaptive parameters related to attenuation rate and critical membership on the basis of the distribution characteristics of training set,an adaptive membership function is designed.Finally,the FFSVM is trained by the remaining samples using adaptive membership function to detect convective clouds.The experiments on FY-2D satellite images show that the proposed method,compared with traditional FSVM,not only remarkably reduces training time,but also further improves the accuracy of convective clouds detection.展开更多
Concluding the conformity of XBRL(eXtensible Business Reporting Language)instance documents law to the Benford's law yields different results before and after a company's financial distress.A new idea of apply...Concluding the conformity of XBRL(eXtensible Business Reporting Language)instance documents law to the Benford's law yields different results before and after a company's financial distress.A new idea of applying the machine learning technique to redefine the way conventional auditors work is therefore proposed since the unacceptable conformity implies a large likelihood of a fraudulent document.Fuzzy support vector machines models are developed to implement such an idea.The dependent variable is a fuzzy variable quantifying the conformity of an XBRL instance document to the Benford's law;whereas,independent variables are financial ratios.The interval factor method is introduced to express the fuzziness in input data.It is found the range of a fuzzy support vector machines model is controlled by maximum and minimum dependent and independent variables.Therefore,defining any member function to describe the fuzziness in input data is unnecessary.The results of this study indicate that the price-to-book ratio versus equity ratio is suitable to classify the priority of auditing XBRL instance documents with the less than 30%misclassification rate.In conclusion,the machine learning technique may be used to redefine the way conventional auditors work.This study provides the main evidence of applying a future project of training smart auditors.展开更多
Soil quality determination and estimation is an important issue not only for terrestrial ecosystems but also for sustainable management of soils.In this study,soil quality was determined by linear and nonlinear standa...Soil quality determination and estimation is an important issue not only for terrestrial ecosystems but also for sustainable management of soils.In this study,soil quality was determined by linear and nonlinear standard scoring function methods integrated with a neutrosophic fuzzy analytic hierarchy process in the micro catchment.In addition,soil quality values were estimated using a support vector machine(SVM)in machine learning algorithms.In order to generate spatial distribution maps of soil quality indice values,different interpolation methods were evaluated to detect the most suitable semivariogram model.While the soil quality index values obtained by the linear method were determined between 0.458-0.717,the soil quality index with the nonlinear method showed variability at the levels of 0.433-0.651.There was no statistical difference between the two methods,and they were determined to be similar.In the estimation of soil quality with SVM,the normalized root means square error(NRMSE)values obtained in the linear and nonlinear method estimation were determined as 0.057 and 0.047,respectively.The spherical model of simple kriging was determined as the interpolation method with the lowest RMSE value in the actual and predicted values of the linear method while,in the nonlinear method,the lowest error in the distribution maps was determined with exponential of the simple kriging.展开更多
In order to enhance measuring precision of the real complex electromechanical system,complex industrial system and complex ecological & management system with characteristics of multi-variable,non-liner,strong cou...In order to enhance measuring precision of the real complex electromechanical system,complex industrial system and complex ecological & management system with characteristics of multi-variable,non-liner,strong coupling and large time-delay,in terms of the fuzzy character of this real complex system,a fuzzy least squares support vector machine(FLS-SVM) soft measurement model was established and its parameters were optimized by using adaptive mutative scale chaos immune algorithm.The simulation results reveal that fuzzy least squares support vector machines soft measurement model is of better approximation accuracy and robustness.And application results show that the relative errors of the soft measurement model are less than 3.34%.展开更多
According to the chaotic and non-linear characters of power load data,the time series matrix is established with the theory of phase-space reconstruction,and then Lyapunov exponents with chaotic time series are comput...According to the chaotic and non-linear characters of power load data,the time series matrix is established with the theory of phase-space reconstruction,and then Lyapunov exponents with chaotic time series are computed to determine the time delay and the embedding dimension.Due to different features of the data,data mining algorithm is conducted to classify the data into different groups.Redundant information is eliminated by the advantage of data mining technology,and the historical loads that have highly similar features with the forecasting day are searched by the system.As a result,the training data can be decreased and the computing speed can also be improved when constructing support vector machine(SVM) model.Then,SVM algorithm is used to predict power load with parameters that get in pretreatment.In order to prove the effectiveness of the new model,the calculation with data mining SVM algorithm is compared with that of single SVM and back propagation network.It can be seen that the new DSVM algorithm effectively improves the forecast accuracy by 0.75%,1.10% and 1.73% compared with SVM for two random dimensions of 11-dimension,14-dimension and BP network,respectively.This indicates that the DSVM gains perfect improvement effect in the short-term power load forecasting.展开更多
Smooth support vector machine (SSVM) changs the normal support vector machine (SVM) into the unconstrained op- timization by using the smooth sigmoid function. The method can be solved under the Broyden-Fletcher-G...Smooth support vector machine (SSVM) changs the normal support vector machine (SVM) into the unconstrained op- timization by using the smooth sigmoid function. The method can be solved under the Broyden-Fletcher-Goldfarb-Shanno (BFGS) algorithm and the Newdon-Armijio (NA) algorithm easily, however the accuracy of sigmoid function is not as good as that of polyno- mial smooth function. Furthermore, the method cannot reduce the influence of outliers or noise in dataset. A fuzzy smooth support vector machine (FSSVM) with fuzzy membership and polynomial smooth functions is introduced into the SVM. The fuzzy member- ship considers the contribution rate of each sample to the optimal separating hyperplane and makes the optimization problem more accurate at the inflection point. Those changes play a positive role on trials. The results of the experiments show that those FSSVMs can obtain a better accuracy and consume the shorter time than SSVM and lagrange support vector machine (LSVM).展开更多
Workers’exposure to excessive noise is a big universal work-related challenges.One of the major consequences of exposure to noise is permanent or transient hearing loss.The current study sought to utilize audiometric...Workers’exposure to excessive noise is a big universal work-related challenges.One of the major consequences of exposure to noise is permanent or transient hearing loss.The current study sought to utilize audiometric data to weigh and prioritize the factors affecting workers’hearing loss based using the Support Vector Machine(SVM)algorithm.This cross sectional-descriptive study was conducted in 2017 in a mining industry in southeast Iran.The participating workers(n=150)were divided into three groups of 50 based on the sound pressure level to which they were exposed(two experimental groups and one control group).Audiometric tests were carried out for all members of each group.The study generally entailed the following steps:(1)selecting predicting variables to weigh and prioritize factors affecting hearing loss;(2)conducting audiometric tests and assessing permanent hearing loss in each ear and then evaluating total hearing loss;(3)categorizing different types of hearing loss;(4)weighing and prioritizing factors that affect hearing loss based on the SVM algorithm;and(5)assessing the error rate and accuracy of the models.The collected data were fed into SPSS 18,followed by conducting linear regression and paired samples t-test.It was revealed that,in the first model(SPL<70 dBA),the frequency of 8 KHz had the greatest impact(with a weight of 33%),while noise had the smallest influence(with a weight of 5%).The accuracy of this model was 100%.In the second model(70<SPL<80 dBA),the frequency of 4 KHz had the most profound effect(with a weight of 21%),whereas the frequency of 250 Hz had the lowest impact(with a weight of 6%).The accuracy of this model was 100%too.In the third model(SPL>85 dBA),the frequency of 4 KHz had the highest impact(with a weight of 22%),while the frequency of 250 Hz had the smallest influence(with a weight of 3%).The accuracy of this model was 100%too.In the fourth model,the frequency of 4 KHz had the greatest effect(with a weight of 24%),while the frequency of 500 Hz had the smallest effect(with a weight of 4%).The accuracy of this model was found to be 94%.According to the modeling conducted using the SVM algorithm,the frequency of 4 KHz has the most profound effect on predicting changes in hearing loss.Given the high accuracy of the obtained model,this algorithm is an appropriate and powerful tool to predict and model hearing loss.展开更多
Support vector machines (SVM) have been widely used in chaotic time series predictions in recent years. In order to enhance the prediction efficiency of this method and implement it in hardware, the sigmoid kernel i...Support vector machines (SVM) have been widely used in chaotic time series predictions in recent years. In order to enhance the prediction efficiency of this method and implement it in hardware, the sigmoid kernel in SVM is drawn in a more natural way by using the fuzzy logic method proposed in this paper. This method provides easy hardware implementation and straightforward interpretability. Experiments on two typical chaotic time series predictions have been carried out and the obtained results show that the average CPU time can be reduced significantly at the cost of a small decrease in prediction accuracy, which is favourable for the hardware implementation for chaotic time series prediction.展开更多
Hybrid data assimilation (DA) is a method seeing more use in recent hydrology and water resources research. In this study, a DA method coupled with the support vector machines (SVMs) and the ensemble Kalman filter...Hybrid data assimilation (DA) is a method seeing more use in recent hydrology and water resources research. In this study, a DA method coupled with the support vector machines (SVMs) and the ensemble Kalman filter (EnKF) technology was used for the prediction of soil moisture in different soil layers: 0-5 cm, 30 cm, 50 cm, 100 cm, 200 cm, and 300 cm. The SVM methodology was first used to train the ground measurements of soil moisture and meteorological parameters from the Meilin study area, in East China, to construct soil moisture statistical prediction models. Subsequent observations and their statistics were used for predictions, with two approaches: the SVM predictor and the SVM-EnKF model made by coupling the SVM model with the EnKF technique using the DA method. Validation results showed that the proposed SVM-EnKF model can improve the prediction results of soil moisture in different layers, from the surface to the root zone.展开更多
Multi-source multi-class classification methods based on multi-class Support Vector Machines and data fusion strategies are proposed in this paper. The centralized and distributed fusion schemes are applied to combine...Multi-source multi-class classification methods based on multi-class Support Vector Machines and data fusion strategies are proposed in this paper. The centralized and distributed fusion schemes are applied to combine information from several data sources. In the centralized scheme, all information from several data sources is centralized to construct an input space. Then a multi-class Support Vector Machine classifier is trained. In the distributed schemes, the individual data sources are proc-essed separately and modelled by using the multi-class Support Vector Machine. Then new data fusion strategies are proposed to combine the information from the individual multi-class Support Vector Machine models. Our proposed fusion strategies take into account that an Support Vector Machine (SVM) classifier achieves classification by finding the optimal classification hyperplane with maximal margin. The proposed methods are applied for fault diagnosis of a diesel engine. The experimental results showed that almost all the proposed approaches can largely improve the diagnostic accuracy. The robustness of diagnosis is also improved because of the implementation of data fusion strategies. The proposed methods can also be applied in other fields.展开更多
This paper firstly proposes a new support vector machine regression (SVR) with a robust loss function, and designs a gradient based algorithm for implementation of the SVR, then uses the SVR to extract fuzzy rules and...This paper firstly proposes a new support vector machine regression (SVR) with a robust loss function, and designs a gradient based algorithm for implementation of the SVR, then uses the SVR to extract fuzzy rules and designs fuzzy rule-based system. Simulations show that fuzzy rule-based system technique based on robust SVR achieves superior performance to the conventional fuzzy inference method, the proposed method provides satisfactory performance with excellent approximation and generalization property than the existing algorithm.展开更多
The internal combustion engine is the main power source of current large⁃scale machinery and equipment.Overhaul and maintenance of its faults are important conditions for ensuring the safe and stable operation of mach...The internal combustion engine is the main power source of current large⁃scale machinery and equipment.Overhaul and maintenance of its faults are important conditions for ensuring the safe and stable operation of machinery and equipment,and the identification of faults is a prerequisite.Therefore,the fault identification of internal combustion engines is one of the important directions of current research.In order to further improve the accuracy of the fault recognition of internal combustion engines,this paper takes a certain type of internal combustion engine as the research object,and constructs a support vector machine and a fuzzy neural network fault recognition model.The binary tree multi⁃class classification algorithm is used to determine the priority,and then the fuzzy neural network is verified.The feasibility of the model is proved through experiments,which can quickly identify the failure of the internal combustion engine and improve the failure processing efficiency.展开更多
Machine learning method has been widely used in various geotechnical engineering risk analysis in recent years. However, the overfitting problem often occurs due to the small number of samples obtained in history. Thi...Machine learning method has been widely used in various geotechnical engineering risk analysis in recent years. However, the overfitting problem often occurs due to the small number of samples obtained in history. This paper proposes the FuzzySVM(support vector machine) geotechnical engineering risk analysis method based on the Bayesian network. The proposed method utilizes the fuzzy set theory to build a Bayesian network to reflect prior knowledge, and utilizes the SVM to build a Bayesian network to reflect historical samples. Then a Bayesian network for evaluation is built in Bayesian estimation method by combining prior knowledge with historical samples. Taking seismic damage evaluation of slopes as an example, the steps of the method are stated in detail. The proposed method is used to evaluate the seismic damage of 96 slopes along roads in the area affected by the Wenchuan earthquake. The evaluation results show that the method can solve the overfitting problem, which often occurs if the machine learning methods are used to evaluate risk of geotechnical engineering, and the performance of the method is much better than that of the previous machine learning methods. Moreover,the proposed method can also effectively evaluate various geotechnical engineering risks in the absence of some influencing factors.展开更多
基金supported by the National Natural Science Foundation of China(No.62076042)the Key Research and Development Project of Sichuan Province(No.2021YFSY0012,No.2020YFG0307,No.2021YFG0332)+3 种基金the Science and Technology Innovation Project of Sichuan(No.2020017)the Key Research and Development Project of Chengdu(No.2019-YF05-02028-GX)the Innovation Team of Quantum Security Communication of Sichuan Province(No.17TD0009)the Academic and Technical Leaders Training Funding Support Projects of Sichuan Province(No.2016120080102643).
文摘In the objective world,how to deal with the complexity and uncertainty of big data efficiently and accurately has become the premise and key to machine learning.Fuzzy support vector machine(FSVM)not only deals with the classifi-cation problems for training samples with fuzzy information,but also assigns a fuzzy membership degree to each training sample,allowing different training samples to contribute differently in predicting an optimal hyperplane to separate two classes with maximum margin,reducing the effect of outliers and noise,Quantum computing has super parallel computing capabilities and holds the pro-mise of faster algorithmic processing of data.However,FSVM and quantum com-puting are incapable of dealing with the complexity and uncertainty of big data in an efficient and accurate manner.This paper research and propose an efficient and accurate quantum fuzzy support vector machine(QFSVM)algorithm based on the fact that quantum computing can efficiently process large amounts of data and FSVM is easy to deal with the complexity and uncertainty problems.The central idea of the proposed algorithm is to use the quantum algorithm for solving linear systems of equations(HHL algorithm)and the least-squares method to solve the quadratic programming problem in the FSVM.The proposed algorithm can deter-mine whether a sample belongs to the positive or negative class while also achiev-ing a good generalization performance.Furthermore,this paper applies QFSVM to handwritten character recognition and demonstrates that QFSVM can be run on quantum computers,and achieve accurate classification of handwritten characters.When compared to FSVM,QFSVM’s computational complexity decreases expo-nentially with the number of training samples.
基金Supported by the Special Funds for Major State Basic Research Program of China (973 Program,No.2002CB312200)the Na-tional Natural Science Foundation of China (No.60574019,No.60474045)+1 种基金the Key Technologies R&D Program of Zhejiang Province (No.2005C21087)the Academician Foundation of Zhejiang Province (No.2005A1001-13).
文摘This study describes a classification methodology based on support vector machines(SVMs),which offer superior classification performance for fault diagnosis in chemical process engineering.The method incorporates an efficient parameter tuning procedure(based on minimization of radius/margin bound for SVM's leave-one-out errors)into a multi-class classification strategy using a fuzzy decision factor,which is named fuzzy support vector machine(FSVM).The datasets generated from the Tennessee Eastman process(TEP)simulator were used to evaluate the clas-sification performance.To decrease the negative influence of the auto-correlated and irrelevant variables,a key vari-able identification procedure using recursive feature elimination,based on the SVM is implemented,with time lags incorporated,before every classifier is trained,and the number of relatively important variables to every classifier is basically determined by 10-fold cross-validation.Performance comparisons are implemented among several kinds of multi-class decision machines,by which the effectiveness of the proposed approach is proved.
基金National High Technology Research andDevelopment Program of China( Project 863 G2 0 0 1AA413 13 0
文摘A new multiple models(MM) approach was proposed to model complex industrial process by using Fuzzy Support Vector Machines(F -SVMs). By applying the proposed approach to a pH neutralization titration experiment, F -SVMs MM not only provides satisfactory approximation and generalization property, but also achieves superior performance to USOCPN multiple modeling method and single modeling method based on standard SVMs.
基金Project(2003AA430200)supported by the National High Technology Research and Development Program of China
文摘In order to measure the backhoe vibratory excavating resistance of a hydraulic excavator fast and precisely,the influences of vibratory excavating depth,angle,vibratory frequency,amplitude,bucket inserting velocity and soil type on the vibratory excavating resistance were analyzed.Simulation analysis was carded out to establish the bucket inserting velocity,amplitude and vibratory frequency considered as secondary variables and excavating resistance as primary variable.A fttzzy membership function was introduced to improve the anti-noise capacity of support vector machine,which is a soft-sensing model on the hydraulic excavator's backhoe vibratory excavating resistance based on fuzzy support vector machine.The simulation result reveals that its maximum relative training and testing error are nearly 0.68% and-0.47%,respectively.It is concluded that the model has quite high modeling precision and generalization capacity,and it can measure the vibratory excavating resistance accurately,reliably and fast in an indirect way.
基金Hebei Province Key Research and Development Project(No.20313701D)Hebei Province Key Research and Development Project(No.19210404D)+13 种基金Mobile computing and universal equipment for the Beijing Key Laboratory Open Project,The National Social Science Fund of China(17AJL014)Beijing University of Posts and Telecommunications Construction of World-Class Disciplines and Characteristic Development Guidance Special Fund “Cultural Inheritance and Innovation”Project(No.505019221)National Natural Science Foundation of China(No.U1536112)National Natural Science Foundation of China(No.81673697)National Natural Science Foundation of China(61872046)The National Social Science Fund Key Project of China(No.17AJL014)“Blue Fire Project”(Huizhou)University of Technology Joint Innovation Project(CXZJHZ201729)Industry-University Cooperation Cooperative Education Project of the Ministry of Education(No.201902218004)Industry-University Cooperation Cooperative Education Project of the Ministry of Education(No.201902024006)Industry-University Cooperation Cooperative Education Project of the Ministry of Education(No.201901197007)Industry-University Cooperation Collaborative Education Project of the Ministry of Education(No.201901199005)The Ministry of Education Industry-University Cooperation Collaborative Education Project(No.201901197001)Shijiazhuang science and technology plan project(236240267A)Hebei Province key research and development plan project(20312701D)。
文摘The distribution of data has a significant impact on the results of classification.When the distribution of one class is insignificant compared to the distribution of another class,data imbalance occurs.This will result in rising outlier values and noise.Therefore,the speed and performance of classification could be greatly affected.Given the above problems,this paper starts with the motivation and mathematical representing of classification,puts forward a new classification method based on the relationship between different classification formulations.Combined with the vector characteristics of the actual problem and the choice of matrix characteristics,we firstly analyze the orderly regression to introduce slack variables to solve the constraint problem of the lone point.Then we introduce the fuzzy factors to solve the problem of the gap between the isolated points on the basis of the support vector machine.We introduce the cost control to solve the problem of sample skew.Finally,based on the bi-boundary support vector machine,a twostep weight setting twin classifier is constructed.This can help to identify multitasks with feature-selected patterns without the need for additional optimizers,which solves the problem of large-scale classification that can’t deal effectively with the very low category distribution gap.
基金supported in part by National Natural Science Foundation of China(Nos.62102311,62202377,62272385)in part by Natural Science Basic Research Program of Shaanxi(Nos.2022JQ-600,2022JM-353,2023-JC-QN-0327)+2 种基金in part by Shaanxi Distinguished Youth Project(No.2022JC-47)in part by Scientific Research Program Funded by Shaanxi Provincial Education Department(No.22JK0560)in part by Distinguished Youth Talents of Shaanxi Universities,and in part by Youth Innovation Team of Shaanxi Universities.
文摘With the widespread data collection and processing,privacy-preserving machine learning has become increasingly important in addressing privacy risks related to individuals.Support vector machine(SVM)is one of the most elementary learning models of machine learning.Privacy issues surrounding SVM classifier training have attracted increasing attention.In this paper,we investigate Differential Privacy-compliant Federated Machine Learning with Dimensionality Reduction,called FedDPDR-DPML,which greatly improves data utility while providing strong privacy guarantees.Considering in distributed learning scenarios,multiple participants usually hold unbalanced or small amounts of data.Therefore,FedDPDR-DPML enables multiple participants to collaboratively learn a global model based on weighted model averaging and knowledge aggregation and then the server distributes the global model to each participant to improve local data utility.Aiming at high-dimensional data,we adopt differential privacy in both the principal component analysis(PCA)-based dimensionality reduction phase and SVM classifiers training phase,which improves model accuracy while achieving strict differential privacy protection.Besides,we train Differential privacy(DP)-compliant SVM classifiers by adding noise to the objective function itself,thus leading to better data utility.Extensive experiments on three high-dimensional datasets demonstrate that FedDPDR-DPML can achieve high accuracy while ensuring strong privacy protection.
基金Supported by the joint fund of National Natural Science Foundation of China and Civil Aviation Administration Foundation of China(No.U1233201)
文摘A fault diagnosis model is proposed based on fuzzy support vector machine (FSVM) combined with fuzzy clustering (FC).Considering the relationship between the sample point and non-self class,FC algorithm is applied to generate fuzzy memberships.In the algorithm,sample weights based on a distribution density function of data point and genetic algorithm (GA) are introduced to enhance the performance of FC.Then a multi-class FSVM with radial basis function kernel is established according to directed acyclic graph algorithm,the penalty factor and kernel parameter of which are optimized by GA.Finally,the model is executed for multi-class fault diagnosis of rolling element bearings.The results show that the presented model achieves high performances both in identifying fault types and fault degrees.The performance comparisons of the presented model with SVM and distance-based FSVM for noisy case demonstrate the capacity of dealing with noise and generalization.
基金supported in part by the National Natural Science Foundation of China under Grants (61471212)Natural Science Foundation of Zhejiang Province under Grants (LY16F010001)+1 种基金Science and Technology Program of Zhejiang Meteorological Bureau under Grants (2016YB01)Natural Science Foundation of Ningbo under Grants(2016A610091,2017A610297)
文摘Support vector machine(SVM)is easily affected by noises and outliers,and its training time dramatically increases with the growing in number of training samples.Satellite cloud image may easily be deteriorated by noises and intensity non-uniformity with a huge amount of data needs to be processed regularly,so it is hard to detect convective clouds in satellite image using traditional SVM.To deal with this problem,a novel method for detection of convective clouds was proposed based on fast fuzzy support vector machine(FFSVM).FFSVM was constructed by eliminating feeble samples and designing new membership function as two aspects.Firstly,according to the distribution characteristics of fuzzy inseparable sample set and the fact that the classification hyper-plane is only determined by support vectors,this paper uses SVDD,Gaussian model and border vector extraction model comprehensively to design a sample selection method in three steps,which can eliminate most of redundant samples and keep possible support vectors.Then,by defining adaptive parameters related to attenuation rate and critical membership on the basis of the distribution characteristics of training set,an adaptive membership function is designed.Finally,the FFSVM is trained by the remaining samples using adaptive membership function to detect convective clouds.The experiments on FY-2D satellite images show that the proposed method,compared with traditional FSVM,not only remarkably reduces training time,but also further improves the accuracy of convective clouds detection.
文摘Concluding the conformity of XBRL(eXtensible Business Reporting Language)instance documents law to the Benford's law yields different results before and after a company's financial distress.A new idea of applying the machine learning technique to redefine the way conventional auditors work is therefore proposed since the unacceptable conformity implies a large likelihood of a fraudulent document.Fuzzy support vector machines models are developed to implement such an idea.The dependent variable is a fuzzy variable quantifying the conformity of an XBRL instance document to the Benford's law;whereas,independent variables are financial ratios.The interval factor method is introduced to express the fuzziness in input data.It is found the range of a fuzzy support vector machines model is controlled by maximum and minimum dependent and independent variables.Therefore,defining any member function to describe the fuzziness in input data is unnecessary.The results of this study indicate that the price-to-book ratio versus equity ratio is suitable to classify the priority of auditing XBRL instance documents with the less than 30%misclassification rate.In conclusion,the machine learning technique may be used to redefine the way conventional auditors work.This study provides the main evidence of applying a future project of training smart auditors.
文摘Soil quality determination and estimation is an important issue not only for terrestrial ecosystems but also for sustainable management of soils.In this study,soil quality was determined by linear and nonlinear standard scoring function methods integrated with a neutrosophic fuzzy analytic hierarchy process in the micro catchment.In addition,soil quality values were estimated using a support vector machine(SVM)in machine learning algorithms.In order to generate spatial distribution maps of soil quality indice values,different interpolation methods were evaluated to detect the most suitable semivariogram model.While the soil quality index values obtained by the linear method were determined between 0.458-0.717,the soil quality index with the nonlinear method showed variability at the levels of 0.433-0.651.There was no statistical difference between the two methods,and they were determined to be similar.In the estimation of soil quality with SVM,the normalized root means square error(NRMSE)values obtained in the linear and nonlinear method estimation were determined as 0.057 and 0.047,respectively.The spherical model of simple kriging was determined as the interpolation method with the lowest RMSE value in the actual and predicted values of the linear method while,in the nonlinear method,the lowest error in the distribution maps was determined with exponential of the simple kriging.
基金Project(51176045)supported by the National Natural Science Foundation of ChinaProject(2011ZK2032)supported by the Major Soft Science Program of Science and Technology Ministry of Hunan Province,China
文摘In order to enhance measuring precision of the real complex electromechanical system,complex industrial system and complex ecological & management system with characteristics of multi-variable,non-liner,strong coupling and large time-delay,in terms of the fuzzy character of this real complex system,a fuzzy least squares support vector machine(FLS-SVM) soft measurement model was established and its parameters were optimized by using adaptive mutative scale chaos immune algorithm.The simulation results reveal that fuzzy least squares support vector machines soft measurement model is of better approximation accuracy and robustness.And application results show that the relative errors of the soft measurement model are less than 3.34%.
基金Project(70671039) supported by the National Natural Science Foundation of China
文摘According to the chaotic and non-linear characters of power load data,the time series matrix is established with the theory of phase-space reconstruction,and then Lyapunov exponents with chaotic time series are computed to determine the time delay and the embedding dimension.Due to different features of the data,data mining algorithm is conducted to classify the data into different groups.Redundant information is eliminated by the advantage of data mining technology,and the historical loads that have highly similar features with the forecasting day are searched by the system.As a result,the training data can be decreased and the computing speed can also be improved when constructing support vector machine(SVM) model.Then,SVM algorithm is used to predict power load with parameters that get in pretreatment.In order to prove the effectiveness of the new model,the calculation with data mining SVM algorithm is compared with that of single SVM and back propagation network.It can be seen that the new DSVM algorithm effectively improves the forecast accuracy by 0.75%,1.10% and 1.73% compared with SVM for two random dimensions of 11-dimension,14-dimension and BP network,respectively.This indicates that the DSVM gains perfect improvement effect in the short-term power load forecasting.
基金supported by the National Natural Science Foundation of China (60974082)
文摘Smooth support vector machine (SSVM) changs the normal support vector machine (SVM) into the unconstrained op- timization by using the smooth sigmoid function. The method can be solved under the Broyden-Fletcher-Goldfarb-Shanno (BFGS) algorithm and the Newdon-Armijio (NA) algorithm easily, however the accuracy of sigmoid function is not as good as that of polyno- mial smooth function. Furthermore, the method cannot reduce the influence of outliers or noise in dataset. A fuzzy smooth support vector machine (FSSVM) with fuzzy membership and polynomial smooth functions is introduced into the SVM. The fuzzy member- ship considers the contribution rate of each sample to the optimal separating hyperplane and makes the optimization problem more accurate at the inflection point. Those changes play a positive role on trials. The results of the experiments show that those FSSVMs can obtain a better accuracy and consume the shorter time than SSVM and lagrange support vector machine (LSVM).
基金This study stemmed from a research project(code number:96000838)which was sponsored by the Institute for Futures Studies in Health at Kerman University of Medical Sciences.
文摘Workers’exposure to excessive noise is a big universal work-related challenges.One of the major consequences of exposure to noise is permanent or transient hearing loss.The current study sought to utilize audiometric data to weigh and prioritize the factors affecting workers’hearing loss based using the Support Vector Machine(SVM)algorithm.This cross sectional-descriptive study was conducted in 2017 in a mining industry in southeast Iran.The participating workers(n=150)were divided into three groups of 50 based on the sound pressure level to which they were exposed(two experimental groups and one control group).Audiometric tests were carried out for all members of each group.The study generally entailed the following steps:(1)selecting predicting variables to weigh and prioritize factors affecting hearing loss;(2)conducting audiometric tests and assessing permanent hearing loss in each ear and then evaluating total hearing loss;(3)categorizing different types of hearing loss;(4)weighing and prioritizing factors that affect hearing loss based on the SVM algorithm;and(5)assessing the error rate and accuracy of the models.The collected data were fed into SPSS 18,followed by conducting linear regression and paired samples t-test.It was revealed that,in the first model(SPL<70 dBA),the frequency of 8 KHz had the greatest impact(with a weight of 33%),while noise had the smallest influence(with a weight of 5%).The accuracy of this model was 100%.In the second model(70<SPL<80 dBA),the frequency of 4 KHz had the most profound effect(with a weight of 21%),whereas the frequency of 250 Hz had the lowest impact(with a weight of 6%).The accuracy of this model was 100%too.In the third model(SPL>85 dBA),the frequency of 4 KHz had the highest impact(with a weight of 22%),while the frequency of 250 Hz had the smallest influence(with a weight of 3%).The accuracy of this model was 100%too.In the fourth model,the frequency of 4 KHz had the greatest effect(with a weight of 24%),while the frequency of 500 Hz had the smallest effect(with a weight of 4%).The accuracy of this model was found to be 94%.According to the modeling conducted using the SVM algorithm,the frequency of 4 KHz has the most profound effect on predicting changes in hearing loss.Given the high accuracy of the obtained model,this algorithm is an appropriate and powerful tool to predict and model hearing loss.
文摘Support vector machines (SVM) have been widely used in chaotic time series predictions in recent years. In order to enhance the prediction efficiency of this method and implement it in hardware, the sigmoid kernel in SVM is drawn in a more natural way by using the fuzzy logic method proposed in this paper. This method provides easy hardware implementation and straightforward interpretability. Experiments on two typical chaotic time series predictions have been carried out and the obtained results show that the average CPU time can be reduced significantly at the cost of a small decrease in prediction accuracy, which is favourable for the hardware implementation for chaotic time series prediction.
基金supported by the National Basic Research Program of China (the 973 Program,Grant No.2010CB951101)the Program for Changjiang Scholars and Innovative Research Teams in Universities,the Ministry of Education,China (Grant No. IRT0717)
文摘Hybrid data assimilation (DA) is a method seeing more use in recent hydrology and water resources research. In this study, a DA method coupled with the support vector machines (SVMs) and the ensemble Kalman filter (EnKF) technology was used for the prediction of soil moisture in different soil layers: 0-5 cm, 30 cm, 50 cm, 100 cm, 200 cm, and 300 cm. The SVM methodology was first used to train the ground measurements of soil moisture and meteorological parameters from the Meilin study area, in East China, to construct soil moisture statistical prediction models. Subsequent observations and their statistics were used for predictions, with two approaches: the SVM predictor and the SVM-EnKF model made by coupling the SVM model with the EnKF technique using the DA method. Validation results showed that the proposed SVM-EnKF model can improve the prediction results of soil moisture in different layers, from the surface to the root zone.
文摘Multi-source multi-class classification methods based on multi-class Support Vector Machines and data fusion strategies are proposed in this paper. The centralized and distributed fusion schemes are applied to combine information from several data sources. In the centralized scheme, all information from several data sources is centralized to construct an input space. Then a multi-class Support Vector Machine classifier is trained. In the distributed schemes, the individual data sources are proc-essed separately and modelled by using the multi-class Support Vector Machine. Then new data fusion strategies are proposed to combine the information from the individual multi-class Support Vector Machine models. Our proposed fusion strategies take into account that an Support Vector Machine (SVM) classifier achieves classification by finding the optimal classification hyperplane with maximal margin. The proposed methods are applied for fault diagnosis of a diesel engine. The experimental results showed that almost all the proposed approaches can largely improve the diagnostic accuracy. The robustness of diagnosis is also improved because of the implementation of data fusion strategies. The proposed methods can also be applied in other fields.
基金Supported by Zhejiang Province Nature Science Fund (No.Y106259)
文摘This paper firstly proposes a new support vector machine regression (SVR) with a robust loss function, and designs a gradient based algorithm for implementation of the SVR, then uses the SVR to extract fuzzy rules and designs fuzzy rule-based system. Simulations show that fuzzy rule-based system technique based on robust SVR achieves superior performance to the conventional fuzzy inference method, the proposed method provides satisfactory performance with excellent approximation and generalization property than the existing algorithm.
文摘The internal combustion engine is the main power source of current large⁃scale machinery and equipment.Overhaul and maintenance of its faults are important conditions for ensuring the safe and stable operation of machinery and equipment,and the identification of faults is a prerequisite.Therefore,the fault identification of internal combustion engines is one of the important directions of current research.In order to further improve the accuracy of the fault recognition of internal combustion engines,this paper takes a certain type of internal combustion engine as the research object,and constructs a support vector machine and a fuzzy neural network fault recognition model.The binary tree multi⁃class classification algorithm is used to determine the priority,and then the fuzzy neural network is verified.The feasibility of the model is proved through experiments,which can quickly identify the failure of the internal combustion engine and improve the failure processing efficiency.
基金supported by the National Key Research and Development Program (Grant No. 2017YFC0504901)Sichuan Traffic Construction Science and Technology Project(Grant No. 2016B2–2)Doctoral Innovation Fund Program of Southwest Jiaotong University(Grant No. D-CX201804)
文摘Machine learning method has been widely used in various geotechnical engineering risk analysis in recent years. However, the overfitting problem often occurs due to the small number of samples obtained in history. This paper proposes the FuzzySVM(support vector machine) geotechnical engineering risk analysis method based on the Bayesian network. The proposed method utilizes the fuzzy set theory to build a Bayesian network to reflect prior knowledge, and utilizes the SVM to build a Bayesian network to reflect historical samples. Then a Bayesian network for evaluation is built in Bayesian estimation method by combining prior knowledge with historical samples. Taking seismic damage evaluation of slopes as an example, the steps of the method are stated in detail. The proposed method is used to evaluate the seismic damage of 96 slopes along roads in the area affected by the Wenchuan earthquake. The evaluation results show that the method can solve the overfitting problem, which often occurs if the machine learning methods are used to evaluate risk of geotechnical engineering, and the performance of the method is much better than that of the previous machine learning methods. Moreover,the proposed method can also effectively evaluate various geotechnical engineering risks in the absence of some influencing factors.