A semi-supervised vector machine is a relatively new learning method using both labeled and unlabeled data in classifi- cation. Since the objective function of the model for an unstrained semi-supervised vector machin...A semi-supervised vector machine is a relatively new learning method using both labeled and unlabeled data in classifi- cation. Since the objective function of the model for an unstrained semi-supervised vector machine is not smooth, many fast opti- mization algorithms cannot be applied to solve the model. In order to overcome the difficulty of dealing with non-smooth objective functions, new methods that can solve the semi-supervised vector machine with desired classification accuracy are in great demand. A quintic spline function with three-times differentiability at the ori- gin is constructed by a general three-moment method, which can be used to approximate the symmetric hinge loss function. The approximate accuracy of the quintic spiine function is estimated. Moreover, a quintic spline smooth semi-support vector machine is obtained and the convergence accuracy of the smooth model to the non-smooth one is analyzed. Three experiments are performed to test the efficiency of the model. The experimental results show that the new model outperforms other smooth models, in terms of classification performance. Furthermore, the new model is not sensitive to the increasing number of the labeled samples, which means that the new model is more efficient.展开更多
In this work, support vector classification (SVC) algorithm was used to build structure-activity relationship (SAR) model of the 5-hydroxytryptamine type 3 (5-HT3 ) receptor antagonists with 26 compounds. In a b...In this work, support vector classification (SVC) algorithm was used to build structure-activity relationship (SAR) model of the 5-hydroxytryptamine type 3 (5-HT3 ) receptor antagonists with 26 compounds. In a benchmark test, SVC was compared with several techniques of machine learning currently used in the field. The prediction performance of the model was discussed on the basis of the leave-one-out cross-validation. The results show that the accuracy of prediction of SVC model was higher than those of back propagation artificial neural network (BP ANN), K-nearest neighbor (KNN) and Fisher methods.展开更多
Engine spark ignition is an important source for diagnosis of engine faults.Based on the waveform of the ignition pattern,a mechanic can guess what may be the potential malfunctioning parts of an engine with his/her e...Engine spark ignition is an important source for diagnosis of engine faults.Based on the waveform of the ignition pattern,a mechanic can guess what may be the potential malfunctioning parts of an engine with his/her experience and handbooks.However,this manual diagnostic method is imprecise because many spark ignition patterns are very similar.Therefore,a diagnosis needs many trials to identify the malfunctioning parts.Meanwhile the mechanic needs to disassemble and assemble the engine parts for verification.To tackle this problem,an intelligent diagnosis system was established based on ignition patterns.First,the captured patterns were normalized and compressed.Then wavelet packet transform(WPT) was employed to extract the representative features of the ignition patterns.Finally,a classification system was constructed by using multi-class support vector machines(SVM) and the extracted features.The classification system can intelligently classify the most likely engine fault so as to reduce the number of diagnosis trials.Experimental results show that SVM produces higher diagnosis accuracy than the traditional multilayer feedforward neural network.This is the first trial on the combination of WPT and SVM to analyze ignition patterns and diagnose automotive engines.展开更多
The paper is related to the error analysis of Multicategory Support Vector Machine (MSVM) classifiers based on reproducing kernel Hilbert spaces. We choose the polynomial kernel as Mercer kernel and give the error e...The paper is related to the error analysis of Multicategory Support Vector Machine (MSVM) classifiers based on reproducing kernel Hilbert spaces. We choose the polynomial kernel as Mercer kernel and give the error estimate with De La Vall6e Poussin means. We also introduce the standard estimation of sample error, and derive the explicit learning rate.展开更多
A comprehensive assessment of the spatial-aware supervised learning algorithms for hyper-spectral image(HSI)classification was presented.For this purpose,standard support vector machines(SVMs),multinomial logistic reg...A comprehensive assessment of the spatial-aware supervised learning algorithms for hyper-spectral image(HSI)classification was presented.For this purpose,standard support vector machines(SVMs),multinomial logistic regression(MLR)and sparse representation(SR) based supervised learning algorithm were compared both theoretically and experimentally.Performance of the discussed techniques was evaluated in terms of overall accuracy,average accuracy,kappa statistic coefficients,and sparsity of the solutions.Execution time,the computational burden,and the capability of the methods were investigated by using probabilistic analysis.For validating the accuracy a classical benchmark AVIRIS Indian pines data set was used.Experiments show that integrating spectral-spatial context can further improve the accuracy,reduce the misclassification error although the cost of computational time will be increased.展开更多
In this study, a Discriminator Model for Glaucoma Diagnosis (DMGD)using soft computing techniques is presented. As the biomedical images such asfundus images are often acquired in high resolution, the Region of Intere...In this study, a Discriminator Model for Glaucoma Diagnosis (DMGD)using soft computing techniques is presented. As the biomedical images such asfundus images are often acquired in high resolution, the Region of Interest (ROI)for glaucoma diagnosis must be selected at first to reduce the complexity of anysystem. The DMGD system uses a series of pre-processing;initial cropping by thegreen channel’s intensity, Spatially Weighted Fuzzy C Means (SWFCM), bloodvessel detection and removal by Gaussian Derivative Filters (GDF) and inpaintingalgorithms. Once the ROI has been selected, the numerical features such as colour, spatial domain features from Local Binary Pattern (LBP) and frequencydomain features from LAWS are generated from the corresponding ROI forfurther classification using kernel based Support Vector Machine (SVM). TheDMGD system performances are validated using four fundus image databases;ORIGA, RIM-ONE, DRISHTI-GS1, and HRF with four different kernels;LinearKernel (LK), Polynomial Kernel (PK), Radial Basis Function (RBFK) kernel,Quadratic Kernel (QK) based SVM classifiers. Results show that the DMGD system classifies the fundus images accurately using the multiple features and kernelbased classifies from the properly segmented ROI.展开更多
Support vector machines and a Kalman-like observer are used for fault detection and isolation in a variable speed horizontalaxis wind turbine composed of three blades and a full converter. The support vector approach ...Support vector machines and a Kalman-like observer are used for fault detection and isolation in a variable speed horizontalaxis wind turbine composed of three blades and a full converter. The support vector approach is data-based and is therefore robust to process knowledge. It is based on structural risk minimization which enhances generalization even with small training data set and it allows for process nonlinearity by using flexible kernels. In this work, a radial basis function is used as the kernel. Different parts of the process are investigated including actuators and sensors faults. With duplicated sensors, sensor faults in blade pitch positions,generator and rotor speeds can be detected. Faults of type stuck measurements can be detected in 2 sampling periods. The detection time of offset/scaled measurements depends on the severity of the fault and on the process dynamics when the fault occurs. The converter torque actuator fault can be detected within 2 sampling periods. Faults in the actuators of the pitch systems represents a higher difficulty for fault detection which is due to the fact that such faults only affect the transitory state(which is very fast) but not the final stationary state. Therefore, two methods are considered and compared for fault detection and isolation of this fault: support vector machines and a Kalman-like observer. Advantages and disadvantages of each method are discussed. On one hand, support vector machines training of transitory states would require a big amount of data in different situations, but the fault detection and isolation results are robust to variations in the input/operating point. On the other hand, the observer is model-based, and therefore does not require training, and it allows identification of the fault level, which is interesting for fault reconfiguration. But the observability of the system is ensured under specific conditions, related to the dynamics of the inputs and outputs. The whole fault detection and isolation scheme is evaluated using a wind turbine benchmark with a real sequence of wind speed.展开更多
Advances in the technology of astronomical spectra acquisition have resulted in an enormous amount of data available in world-wide telescope archives. It is no longer feasible to analyze them using classical approache...Advances in the technology of astronomical spectra acquisition have resulted in an enormous amount of data available in world-wide telescope archives. It is no longer feasible to analyze them using classical approaches, so a new astronomical discipline,astroinformatics, has emerged. We describe the initial experiments in the investigation of spectral line profiles of emission line stars using machine learning with attempt to automatically identify Be and B[e] stars spectra in large archives and classify their types in an automatic manner. Due to the size of spectra collections, the dimension reduction techniques based on wavelet transformation are studied as well. The result clearly justifies that machine learning is able to distinguish different shapes of line profiles even after drastic dimension reduction.展开更多
Network traffic anomalies are unusual changes in a network,so diagnosing anomalies is important for network management.Feature-based anomaly detection models (ab)normal network traffic behavior by analyzing packet h...Network traffic anomalies are unusual changes in a network,so diagnosing anomalies is important for network management.Feature-based anomaly detection models (ab)normal network traffic behavior by analyzing packet header features.PCA-subspace method (Principal Component Analysis) has been verified as an efficient feature-based way in network-wide anomaly detection.Despite the powerful ability of PCA-subspace method for network-wide traffic detection,it cannot be effectively used for detection on a single link.In this paper,different from most works focusing on detection on flow-level traffic,based on observations of six traffic features for packet-level traffic,we propose a new approach B6SVM to detect anomalies for packet-level traffic on a single link.The basic idea of B6-SVM is to diagnose anomalies in a multi-dimensional view of traffic features using Support Vector Machine (SVM).Through two-phase classification,B6-SVM can detect anomalies with high detection rate and low false alarm rate.The test results demonstrate the effectiveness and potential of our technique in diagnosing anomalies.Further,compared to previous feature-based anomaly detection approaches,B6-SVM provides a framework to automatically identify possible anomalous types.The framework of B6-SVM is generic and therefore,we expect the derived insights will be helpful for similar future research efforts.展开更多
基金supported by the Fundamental Research Funds for University of Science and Technology Beijing(FRF-BR-12-021)
文摘A semi-supervised vector machine is a relatively new learning method using both labeled and unlabeled data in classifi- cation. Since the objective function of the model for an unstrained semi-supervised vector machine is not smooth, many fast opti- mization algorithms cannot be applied to solve the model. In order to overcome the difficulty of dealing with non-smooth objective functions, new methods that can solve the semi-supervised vector machine with desired classification accuracy are in great demand. A quintic spline function with three-times differentiability at the ori- gin is constructed by a general three-moment method, which can be used to approximate the symmetric hinge loss function. The approximate accuracy of the quintic spiine function is estimated. Moreover, a quintic spline smooth semi-support vector machine is obtained and the convergence accuracy of the smooth model to the non-smooth one is analyzed. Three experiments are performed to test the efficiency of the model. The experimental results show that the new model outperforms other smooth models, in terms of classification performance. Furthermore, the new model is not sensitive to the increasing number of the labeled samples, which means that the new model is more efficient.
基金Project supported by National Natural Science Foundation of China( Grant No. 20373040)
文摘In this work, support vector classification (SVC) algorithm was used to build structure-activity relationship (SAR) model of the 5-hydroxytryptamine type 3 (5-HT3 ) receptor antagonists with 26 compounds. In a benchmark test, SVC was compared with several techniques of machine learning currently used in the field. The prediction performance of the model was discussed on the basis of the leave-one-out cross-validation. The results show that the accuracy of prediction of SVC model was higher than those of back propagation artificial neural network (BP ANN), K-nearest neighbor (KNN) and Fisher methods.
基金supported by University of Macao Research Grant,China (Grant No. RG057/08-09S/VCM/FST, Grant No. UL011/09-Y1/ EME/ WPK01/FST)
文摘Engine spark ignition is an important source for diagnosis of engine faults.Based on the waveform of the ignition pattern,a mechanic can guess what may be the potential malfunctioning parts of an engine with his/her experience and handbooks.However,this manual diagnostic method is imprecise because many spark ignition patterns are very similar.Therefore,a diagnosis needs many trials to identify the malfunctioning parts.Meanwhile the mechanic needs to disassemble and assemble the engine parts for verification.To tackle this problem,an intelligent diagnosis system was established based on ignition patterns.First,the captured patterns were normalized and compressed.Then wavelet packet transform(WPT) was employed to extract the representative features of the ignition patterns.Finally,a classification system was constructed by using multi-class support vector machines(SVM) and the extracted features.The classification system can intelligently classify the most likely engine fault so as to reduce the number of diagnosis trials.Experimental results show that SVM produces higher diagnosis accuracy than the traditional multilayer feedforward neural network.This is the first trial on the combination of WPT and SVM to analyze ignition patterns and diagnose automotive engines.
文摘The paper is related to the error analysis of Multicategory Support Vector Machine (MSVM) classifiers based on reproducing kernel Hilbert spaces. We choose the polynomial kernel as Mercer kernel and give the error estimate with De La Vall6e Poussin means. We also introduce the standard estimation of sample error, and derive the explicit learning rate.
基金National Key Research and Development Program of China(No.2016YFF0103604)National Natural Science Foundations of China(Nos.61171165,11431015,61571230)+1 种基金National Scientific Equipment Developing Project of China(No.2012YQ050250)Natural Science Foundation of Jiangsu Province,China(No.BK20161500)
文摘A comprehensive assessment of the spatial-aware supervised learning algorithms for hyper-spectral image(HSI)classification was presented.For this purpose,standard support vector machines(SVMs),multinomial logistic regression(MLR)and sparse representation(SR) based supervised learning algorithm were compared both theoretically and experimentally.Performance of the discussed techniques was evaluated in terms of overall accuracy,average accuracy,kappa statistic coefficients,and sparsity of the solutions.Execution time,the computational burden,and the capability of the methods were investigated by using probabilistic analysis.For validating the accuracy a classical benchmark AVIRIS Indian pines data set was used.Experiments show that integrating spectral-spatial context can further improve the accuracy,reduce the misclassification error although the cost of computational time will be increased.
文摘In this study, a Discriminator Model for Glaucoma Diagnosis (DMGD)using soft computing techniques is presented. As the biomedical images such asfundus images are often acquired in high resolution, the Region of Interest (ROI)for glaucoma diagnosis must be selected at first to reduce the complexity of anysystem. The DMGD system uses a series of pre-processing;initial cropping by thegreen channel’s intensity, Spatially Weighted Fuzzy C Means (SWFCM), bloodvessel detection and removal by Gaussian Derivative Filters (GDF) and inpaintingalgorithms. Once the ROI has been selected, the numerical features such as colour, spatial domain features from Local Binary Pattern (LBP) and frequencydomain features from LAWS are generated from the corresponding ROI forfurther classification using kernel based Support Vector Machine (SVM). TheDMGD system performances are validated using four fundus image databases;ORIGA, RIM-ONE, DRISHTI-GS1, and HRF with four different kernels;LinearKernel (LK), Polynomial Kernel (PK), Radial Basis Function (RBFK) kernel,Quadratic Kernel (QK) based SVM classifiers. Results show that the DMGD system classifies the fundus images accurately using the multiple features and kernelbased classifies from the properly segmented ROI.
文摘Support vector machines and a Kalman-like observer are used for fault detection and isolation in a variable speed horizontalaxis wind turbine composed of three blades and a full converter. The support vector approach is data-based and is therefore robust to process knowledge. It is based on structural risk minimization which enhances generalization even with small training data set and it allows for process nonlinearity by using flexible kernels. In this work, a radial basis function is used as the kernel. Different parts of the process are investigated including actuators and sensors faults. With duplicated sensors, sensor faults in blade pitch positions,generator and rotor speeds can be detected. Faults of type stuck measurements can be detected in 2 sampling periods. The detection time of offset/scaled measurements depends on the severity of the fault and on the process dynamics when the fault occurs. The converter torque actuator fault can be detected within 2 sampling periods. Faults in the actuators of the pitch systems represents a higher difficulty for fault detection which is due to the fact that such faults only affect the transitory state(which is very fast) but not the final stationary state. Therefore, two methods are considered and compared for fault detection and isolation of this fault: support vector machines and a Kalman-like observer. Advantages and disadvantages of each method are discussed. On one hand, support vector machines training of transitory states would require a big amount of data in different situations, but the fault detection and isolation results are robust to variations in the input/operating point. On the other hand, the observer is model-based, and therefore does not require training, and it allows identification of the fault level, which is interesting for fault reconfiguration. But the observability of the system is ensured under specific conditions, related to the dynamics of the inputs and outputs. The whole fault detection and isolation scheme is evaluated using a wind turbine benchmark with a real sequence of wind speed.
基金supported by Czech Science Foundation(No.GACR13-08195S)the project Central Register of Research Intentions CEZMSM0021630528 Security-oriented Research in Information Technology,the specific research(No.FIT-S-11-2)+2 种基金the project RVO:67985815the Technological agency of the Czech Republic(TACR)project V3C(No.TE01020415)Grant Agency of the Czech Republic-GACR P103/13/08195S
文摘Advances in the technology of astronomical spectra acquisition have resulted in an enormous amount of data available in world-wide telescope archives. It is no longer feasible to analyze them using classical approaches, so a new astronomical discipline,astroinformatics, has emerged. We describe the initial experiments in the investigation of spectral line profiles of emission line stars using machine learning with attempt to automatically identify Be and B[e] stars spectra in large archives and classify their types in an automatic manner. Due to the size of spectra collections, the dimension reduction techniques based on wavelet transformation are studied as well. The result clearly justifies that machine learning is able to distinguish different shapes of line profiles even after drastic dimension reduction.
基金supported by the National Basic Research 973 Program of China under Grant No. 2009CB320505the National Science and Technology Supporting Plan of China under Grant No. 2008BAH37B05+2 种基金the National Natural Science Foundation of China under Grant No. 61170211the Ph.D. Programs Foundation of Ministry of Education of China under Grant No. 20110002110056the National High Technology Research and Development 863 Program of China under Grant Nos. 2008AA01A303 and 2009AA01Z251
文摘Network traffic anomalies are unusual changes in a network,so diagnosing anomalies is important for network management.Feature-based anomaly detection models (ab)normal network traffic behavior by analyzing packet header features.PCA-subspace method (Principal Component Analysis) has been verified as an efficient feature-based way in network-wide anomaly detection.Despite the powerful ability of PCA-subspace method for network-wide traffic detection,it cannot be effectively used for detection on a single link.In this paper,different from most works focusing on detection on flow-level traffic,based on observations of six traffic features for packet-level traffic,we propose a new approach B6SVM to detect anomalies for packet-level traffic on a single link.The basic idea of B6-SVM is to diagnose anomalies in a multi-dimensional view of traffic features using Support Vector Machine (SVM).Through two-phase classification,B6-SVM can detect anomalies with high detection rate and low false alarm rate.The test results demonstrate the effectiveness and potential of our technique in diagnosing anomalies.Further,compared to previous feature-based anomaly detection approaches,B6-SVM provides a framework to automatically identify possible anomalous types.The framework of B6-SVM is generic and therefore,we expect the derived insights will be helpful for similar future research efforts.