A fault diagnosis model is proposed based on fuzzy support vector machine (FSVM) combined with fuzzy clustering (FC).Considering the relationship between the sample point and non-self class,FC algorithm is applied to ...A fault diagnosis model is proposed based on fuzzy support vector machine (FSVM) combined with fuzzy clustering (FC).Considering the relationship between the sample point and non-self class,FC algorithm is applied to generate fuzzy memberships.In the algorithm,sample weights based on a distribution density function of data point and genetic algorithm (GA) are introduced to enhance the performance of FC.Then a multi-class FSVM with radial basis function kernel is established according to directed acyclic graph algorithm,the penalty factor and kernel parameter of which are optimized by GA.Finally,the model is executed for multi-class fault diagnosis of rolling element bearings.The results show that the presented model achieves high performances both in identifying fault types and fault degrees.The performance comparisons of the presented model with SVM and distance-based FSVM for noisy case demonstrate the capacity of dealing with noise and generalization.展开更多
Turbopump condition monitoring is a significant approach to ensure the safety of liquid rocket engine (LRE).Because of lack of fault samples,a monitoring system cannot be trained on all possible condition patterns.T...Turbopump condition monitoring is a significant approach to ensure the safety of liquid rocket engine (LRE).Because of lack of fault samples,a monitoring system cannot be trained on all possible condition patterns.Thus it is important to differentiate abnormal or unknown patterns from normal pattern with novelty detection methods.One-class support vector machine (OCSVM) that has been commonly used for novelty detection cannot deal well with large scale samples.In order to model the normal pattern of the turbopump with OCSVM and so as to monitor the condition of the turbopump,a monitoring method that integrates OCSVM with incremental clustering is presented.In this method,the incremental clustering is used for sample reduction by extracting representative vectors from a large training set.The representative vectors are supposed to distribute uniformly in the object region and fulfill the region.And training OCSVM on these representative vectors yields a novelty detector.By applying this method to the analysis of the turbopump's historical test data,it shows that the incremental clustering algorithm can extract 91 representative points from more than 36 000 training vectors,and the OCSVM detector trained on these 91 representative points can recognize spikes in vibration signals caused by different abnormal events such as vane shedding,rub-impact and sensor faults.This monitoring method does not need fault samples during training as classical recognition methods.The method resolves the learning problem of large samples and is an alternative method for condition monitoring of the LRE turbopump.展开更多
Based on the framework of support vector machines( SVM) using one-against-one( OAO) strategy, a new multi-class kernel method based on directed acyclic graph( DAG) and probabilistic distance is proposed to raise the m...Based on the framework of support vector machines( SVM) using one-against-one( OAO) strategy, a new multi-class kernel method based on directed acyclic graph( DAG) and probabilistic distance is proposed to raise the multi-class classification accuracies. The topology structure of DAG is constructed by rearranging the nodes' sequence in the graph. DAG is equivalent to guided operating SVM on a list,and the classification performance depends on the nodes' sequence in the graph. Jeffries-Matusita distance( JMD) is introduced to estimate the separability of each class,and the implementation list is initialized with all classes organized according to certain sequence in the list. To testify the effectiveness of the proposed method,numerical analysis is conducted on UCI data and hyperspectral data. Meanwhile,comparative studies using standard OAO and DAG classification methods are also conducted and the results illustrate better performance and higher accuracy of the proposed JMD-DAG method.展开更多
It is a challenging topic to develop an efficient algorithm for large scale classification problems in many applications of machine learning. In this paper, a hierarchical clustering and fixed-layer local learning (HC...It is a challenging topic to develop an efficient algorithm for large scale classification problems in many applications of machine learning. In this paper, a hierarchical clustering and fixed-layer local learning (HCFLL) based support vector machine(SVM) algorithm is proposed to deal with this problem. Firstly, HCFLL hierarchically clusters a given dataset into a modified clustering feature tree based on the ideas of unsupervised clustering and supervised clustering. Then it locally trains SVM on each labeled subtree at a fixed-layer of the tree. The experimental results show that compared with the existing popular algorithms such as core vector machine and decision-tree support vector machine, HCFLL can significantly improve the training and testing speeds with comparable testing accuracy.展开更多
During the last two decades signicant work has been reported in the eld of cursive language’s recognition especially,in the Arabic,the Urdu and the Persian languages.The unavailability of such work in the Pashto lang...During the last two decades signicant work has been reported in the eld of cursive language’s recognition especially,in the Arabic,the Urdu and the Persian languages.The unavailability of such work in the Pashto language is because of:the absence of a standard database and of signicant research work that ultimately acts as a big barrier for the research community.The slight change in the Pashto characters’shape is an additional challenge for researchers.This paper presents an efcient OCR system for the handwritten Pashto characters based on multi-class enabled support vector machine using manifold feature extraction techniques.These feature extraction techniques include,tools such as zoning feature extractor,discrete cosine transform,discrete wavelet transform,and Gabor lters and histogram of oriented gradients.A hybrid feature map is developed by combining the manifold feature maps.This research work is performed by developing a medium-sized dataset of handwritten Pashto characters that encapsulate 200 handwritten samples for each 44 characters in the Pashto language.Recognition results are generated for the proposed model based on a manifold and hybrid feature map.An overall accuracy rates of 63.30%,65.13%,68.55%,68.28%,67.02%and 83%are generated based on a zoning technique,HoGs,Gabor lter,DCT,DWT and hybrid feature maps respectively.Applicability of the proposed model is also tested by comparing its results with a convolution neural network model.The convolution neural network-based model generated an accuracy rate of 81.02%smaller than the multi-class support vector machine.The highest accuracy rate of 83%for the multi-class SVM model based on a hybrid feature map reects the applicability of the proposed model.展开更多
A new algorithm named kernel bisecting k-means and sample removal(KBK-SR) is proposed as sampling preprocessing for support vector machine(SVM) training to improve the efficiency.The proposed algorithm tends to quickl...A new algorithm named kernel bisecting k-means and sample removal(KBK-SR) is proposed as sampling preprocessing for support vector machine(SVM) training to improve the efficiency.The proposed algorithm tends to quickly produce balanced clusters of similar sizes in the kernel feature space,which makes it efficient and effective for reducing training samples.Theoretical analysis and experimental results on three UCI real data benchmarks both show that,with very short sampling time,the proposed algorithm dramatically accelerates SVM sampling and training while maintaining high test accuracy.展开更多
In this paper, a support vector machine-based multi-model predictive control is proposed, in which SVM classification combines well with SVM regression. At first, each working environment is modeled by SVM regression ...In this paper, a support vector machine-based multi-model predictive control is proposed, in which SVM classification combines well with SVM regression. At first, each working environment is modeled by SVM regression and the support vector machine network-based model predictive control (SVMN-MPC) algorithm corresponding to each environment is developed, and then a multi-class SVM model is established to recognize multiple operating conditions. As for control, the current environment is identified by the multi-class SVM model and then the corresponding SVMN-MPC controller is activated at each sampling instant. The proposed modeling, switching and controller design is demonstrated in simulation results.展开更多
The least squares support vector machine (LS-SVM) is used to study the nonlinear time series prediction.First, the parameter γ and multi-step prediction capabilities of the LS-SVM network are discussed. Then we emplo...The least squares support vector machine (LS-SVM) is used to study the nonlinear time series prediction.First, the parameter γ and multi-step prediction capabilities of the LS-SVM network are discussed. Then we employ clustering method in the model to prune the number of the support values. The learning rate and the capabilities of filtering noise for LS-SVM are all greatly improved.展开更多
Tensor representation is useful to reduce the overfitting problem in vector-based learning algorithm in pattern recognition.This is mainly because the structure information of objects in pattern analysis is a reasonab...Tensor representation is useful to reduce the overfitting problem in vector-based learning algorithm in pattern recognition.This is mainly because the structure information of objects in pattern analysis is a reasonable constraint to reduce the number of unknown parameters used to model a classifier.In this paper, we generalize the vector-based learning algorithm TWin Support Vector Machine(TWSVM) to the tensor-based method TWin Support Tensor Machines(TWSTM), which accepts general tensors as input.To examine the effectiveness of TWSTM, we implement the TWSTM method for Microcalcification Clusters(MCs) detection.In the tensor subspace domain, the MCs detection procedure is formulated as a supervised learning and classification problem, and TWSTM is used as a classifier to make decision for the presence of MCs or not.A large number of experiments were carried out to evaluate and compare the performance of the proposed MCs detection algorithm.By comparison with TWSVM, the tensor version reduces the overfitting problem.展开更多
Support Vector Clustering (SVC) is a kernel-based unsupervised learning clustering method. The main drawback of SVC is its high computational complexity in getting the adjacency matrix describing the connectivity for ...Support Vector Clustering (SVC) is a kernel-based unsupervised learning clustering method. The main drawback of SVC is its high computational complexity in getting the adjacency matrix describing the connectivity for each pairs of points. Based on the proximity graph model [3], the Euclidean distance in Hilbert space is calculated using a Gaussian kernel, which is the right criterion to generate a minimum spanning tree using Kruskal's algorithm. Then the connectivity estimation is lowered by only checking the linkages between the edges that construct the main stem of the MST (Minimum Spanning Tree), in which the non-compatibility degree is originally defined to support the edge selection during linkage estimations. This new approach is experimentally analyzed. The results show that the revised algorithm has a better performance than the proximity graph model with faster speed, optimized clustering quality and strong ability to noise suppression, which makes SVC scalable to large data sets.展开更多
Disease recognition in plants is one of the essential problems in agricultural image processing.This article focuses on designing a framework that can recognize and classify diseases on pomegranate plants exactly.The ...Disease recognition in plants is one of the essential problems in agricultural image processing.This article focuses on designing a framework that can recognize and classify diseases on pomegranate plants exactly.The framework utilizes image processing techniques such as image acquisition,image resizing,image enhancement,image segmentation,ROI extraction(region of interest),and feature extraction.An image dataset related to pomegranate leaf disease is utilized to implement the framework,divided into a training set and a test set.In the implementation process,techniques such as image enhancement and image segmentation are primarily used for identifying ROI and features.An image classification will then be implemented by combining a supervised learning model with a support vector machine.The proposed framework is developed based on MATLAB with a graphical user interface.According to the experimental results,the proposed framework can achieve 98.39%accuracy for classifying diseased and healthy leaves.Moreover,the framework can achieve an accuracy of 98.07%for classifying diseases on pomegranate leaves.展开更多
Focusing on strip steel surface defects classification, a novel support vector machine with adjustable hyper-sphere (AHSVM) is formulated. Meanwhile, a new multi-class classification method is proposed. Originated f...Focusing on strip steel surface defects classification, a novel support vector machine with adjustable hyper-sphere (AHSVM) is formulated. Meanwhile, a new multi-class classification method is proposed. Originated from support vector data description, AHSVM adopts hyper-sphere to solve classification problem. AHSVM can obey two principles: the margin maximization and inner-class dispersion minimization. Moreover, the hyper-sphere of AHSVM is adjustable, which makes the final classification hyper-sphere optimal for training dataset. On the other hand, AHSVM is combined with binary tree to solve multi-class classification for steel surface defects. A scheme of samples pruning in mapped feature space is provided, which can reduce the number of training samples under the premise of classification accuracy, resulting in the improvements of classification speed. Finally, some testing experiments are done for eight types of strip steel surface defects. Experimental results show that multi-class AHSVM classifier exhibits satisfactory results in classification accuracy and efficiency.展开更多
针对最小二乘孪生支持向量机(least squares twin support vector machine,LSTSVM)对噪声或是异常数据敏感和忽略数据内在结构信息的问题,提出了一种直觉模糊的结构化最小二乘孪生支持向量机(intuition fuzzy and structural least squa...针对最小二乘孪生支持向量机(least squares twin support vector machine,LSTSVM)对噪声或是异常数据敏感和忽略数据内在结构信息的问题,提出了一种直觉模糊的结构化最小二乘孪生支持向量机(intuition fuzzy and structural least squares twin support vector machine,IF-SLSTSVM)。首先采用孤立森林对输入样本点进行预处理;然后通过直觉模糊数的概念,赋予输入样本点不同的权重以减少噪声或是异常数据对分类超平面产生的影响;最后采用K-Means算法,以协方差的形式获取输入样本点之间的结构信息。IFSLSTSVM在LS-TSVM的基础上,考虑了输入样本点在特征空间中的分布信息及输入样本点之间的关系,提高了模型的鲁棒性。实验采取UCI数据集,在0%、5%、10%以及20%的不同比例噪声环境对IF-SLSTSVM算法的有效性进行验证。结果显示相较于6种对比算法,IF-SLSTSVM算法有更好的鲁棒性。展开更多
基金Supported by the joint fund of National Natural Science Foundation of China and Civil Aviation Administration Foundation of China(No.U1233201)
文摘A fault diagnosis model is proposed based on fuzzy support vector machine (FSVM) combined with fuzzy clustering (FC).Considering the relationship between the sample point and non-self class,FC algorithm is applied to generate fuzzy memberships.In the algorithm,sample weights based on a distribution density function of data point and genetic algorithm (GA) are introduced to enhance the performance of FC.Then a multi-class FSVM with radial basis function kernel is established according to directed acyclic graph algorithm,the penalty factor and kernel parameter of which are optimized by GA.Finally,the model is executed for multi-class fault diagnosis of rolling element bearings.The results show that the presented model achieves high performances both in identifying fault types and fault degrees.The performance comparisons of the presented model with SVM and distance-based FSVM for noisy case demonstrate the capacity of dealing with noise and generalization.
基金supported by National Natural Science Foundation of China (Grant No. 50675219)Hu’nan Provincial Science Committee Excellent Youth Foundation of China (Grant No. 08JJ1008)
文摘Turbopump condition monitoring is a significant approach to ensure the safety of liquid rocket engine (LRE).Because of lack of fault samples,a monitoring system cannot be trained on all possible condition patterns.Thus it is important to differentiate abnormal or unknown patterns from normal pattern with novelty detection methods.One-class support vector machine (OCSVM) that has been commonly used for novelty detection cannot deal well with large scale samples.In order to model the normal pattern of the turbopump with OCSVM and so as to monitor the condition of the turbopump,a monitoring method that integrates OCSVM with incremental clustering is presented.In this method,the incremental clustering is used for sample reduction by extracting representative vectors from a large training set.The representative vectors are supposed to distribute uniformly in the object region and fulfill the region.And training OCSVM on these representative vectors yields a novelty detector.By applying this method to the analysis of the turbopump's historical test data,it shows that the incremental clustering algorithm can extract 91 representative points from more than 36 000 training vectors,and the OCSVM detector trained on these 91 representative points can recognize spikes in vibration signals caused by different abnormal events such as vane shedding,rub-impact and sensor faults.This monitoring method does not need fault samples during training as classical recognition methods.The method resolves the learning problem of large samples and is an alternative method for condition monitoring of the LRE turbopump.
基金Sponsored by the National Natural Science Foundation of China(Grant No.61201310)the Fundamental Research Funds for the Central Universities(Grant No.HIT.NSRIF.201160)the China Postdoctoral Science Foundation(Grant No.20110491067)
文摘Based on the framework of support vector machines( SVM) using one-against-one( OAO) strategy, a new multi-class kernel method based on directed acyclic graph( DAG) and probabilistic distance is proposed to raise the multi-class classification accuracies. The topology structure of DAG is constructed by rearranging the nodes' sequence in the graph. DAG is equivalent to guided operating SVM on a list,and the classification performance depends on the nodes' sequence in the graph. Jeffries-Matusita distance( JMD) is introduced to estimate the separability of each class,and the implementation list is initialized with all classes organized according to certain sequence in the list. To testify the effectiveness of the proposed method,numerical analysis is conducted on UCI data and hyperspectral data. Meanwhile,comparative studies using standard OAO and DAG classification methods are also conducted and the results illustrate better performance and higher accuracy of the proposed JMD-DAG method.
基金National Natural Science Foundation of China ( No. 61070033 )Fundamental Research Funds for the Central Universities,China( No. 2012ZM0061)
文摘It is a challenging topic to develop an efficient algorithm for large scale classification problems in many applications of machine learning. In this paper, a hierarchical clustering and fixed-layer local learning (HCFLL) based support vector machine(SVM) algorithm is proposed to deal with this problem. Firstly, HCFLL hierarchically clusters a given dataset into a modified clustering feature tree based on the ideas of unsupervised clustering and supervised clustering. Then it locally trains SVM on each labeled subtree at a fixed-layer of the tree. The experimental results show that compared with the existing popular algorithms such as core vector machine and decision-tree support vector machine, HCFLL can significantly improve the training and testing speeds with comparable testing accuracy.
基金funded by Qatar University Internal Grant under Grant No.IRCC-2020-009.The ndings achieved herein are solely the responsibility of the authors。
文摘During the last two decades signicant work has been reported in the eld of cursive language’s recognition especially,in the Arabic,the Urdu and the Persian languages.The unavailability of such work in the Pashto language is because of:the absence of a standard database and of signicant research work that ultimately acts as a big barrier for the research community.The slight change in the Pashto characters’shape is an additional challenge for researchers.This paper presents an efcient OCR system for the handwritten Pashto characters based on multi-class enabled support vector machine using manifold feature extraction techniques.These feature extraction techniques include,tools such as zoning feature extractor,discrete cosine transform,discrete wavelet transform,and Gabor lters and histogram of oriented gradients.A hybrid feature map is developed by combining the manifold feature maps.This research work is performed by developing a medium-sized dataset of handwritten Pashto characters that encapsulate 200 handwritten samples for each 44 characters in the Pashto language.Recognition results are generated for the proposed model based on a manifold and hybrid feature map.An overall accuracy rates of 63.30%,65.13%,68.55%,68.28%,67.02%and 83%are generated based on a zoning technique,HoGs,Gabor lter,DCT,DWT and hybrid feature maps respectively.Applicability of the proposed model is also tested by comparing its results with a convolution neural network model.The convolution neural network-based model generated an accuracy rate of 81.02%smaller than the multi-class support vector machine.The highest accuracy rate of 83%for the multi-class SVM model based on a hybrid feature map reects the applicability of the proposed model.
基金National Natural Science Foundation of China (No. 60975083)Key Grant Project,Ministry of Education,China(No. 104145)
文摘A new algorithm named kernel bisecting k-means and sample removal(KBK-SR) is proposed as sampling preprocessing for support vector machine(SVM) training to improve the efficiency.The proposed algorithm tends to quickly produce balanced clusters of similar sizes in the kernel feature space,which makes it efficient and effective for reducing training samples.Theoretical analysis and experimental results on three UCI real data benchmarks both show that,with very short sampling time,the proposed algorithm dramatically accelerates SVM sampling and training while maintaining high test accuracy.
基金the 973 Program of China (No.2002CB312200)the National Science Foundation of China (No.60574019)
文摘In this paper, a support vector machine-based multi-model predictive control is proposed, in which SVM classification combines well with SVM regression. At first, each working environment is modeled by SVM regression and the support vector machine network-based model predictive control (SVMN-MPC) algorithm corresponding to each environment is developed, and then a multi-class SVM model is established to recognize multiple operating conditions. As for control, the current environment is identified by the multi-class SVM model and then the corresponding SVMN-MPC controller is activated at each sampling instant. The proposed modeling, switching and controller design is demonstrated in simulation results.
文摘The least squares support vector machine (LS-SVM) is used to study the nonlinear time series prediction.First, the parameter γ and multi-step prediction capabilities of the LS-SVM network are discussed. Then we employ clustering method in the model to prune the number of the support values. The learning rate and the capabilities of filtering noise for LS-SVM are all greatly improved.
基金Supported by the National Natural Science Foundation of China (No. 60771068)the Natural Science Basic Research Plan in Shaanxi Province of China (No. 2007F248)
文摘Tensor representation is useful to reduce the overfitting problem in vector-based learning algorithm in pattern recognition.This is mainly because the structure information of objects in pattern analysis is a reasonable constraint to reduce the number of unknown parameters used to model a classifier.In this paper, we generalize the vector-based learning algorithm TWin Support Vector Machine(TWSVM) to the tensor-based method TWin Support Tensor Machines(TWSTM), which accepts general tensors as input.To examine the effectiveness of TWSTM, we implement the TWSTM method for Microcalcification Clusters(MCs) detection.In the tensor subspace domain, the MCs detection procedure is formulated as a supervised learning and classification problem, and TWSTM is used as a classifier to make decision for the presence of MCs or not.A large number of experiments were carried out to evaluate and compare the performance of the proposed MCs detection algorithm.By comparison with TWSVM, the tensor version reduces the overfitting problem.
基金TheNationalHighTechnologyResearchandDevelopmentProgramofChina (No .86 3 5 11 930 0 0 9)
文摘Support Vector Clustering (SVC) is a kernel-based unsupervised learning clustering method. The main drawback of SVC is its high computational complexity in getting the adjacency matrix describing the connectivity for each pairs of points. Based on the proximity graph model [3], the Euclidean distance in Hilbert space is calculated using a Gaussian kernel, which is the right criterion to generate a minimum spanning tree using Kruskal's algorithm. Then the connectivity estimation is lowered by only checking the linkages between the edges that construct the main stem of the MST (Minimum Spanning Tree), in which the non-compatibility degree is originally defined to support the edge selection during linkage estimations. This new approach is experimentally analyzed. The results show that the revised algorithm has a better performance than the proximity graph model with faster speed, optimized clustering quality and strong ability to noise suppression, which makes SVC scalable to large data sets.
文摘Disease recognition in plants is one of the essential problems in agricultural image processing.This article focuses on designing a framework that can recognize and classify diseases on pomegranate plants exactly.The framework utilizes image processing techniques such as image acquisition,image resizing,image enhancement,image segmentation,ROI extraction(region of interest),and feature extraction.An image dataset related to pomegranate leaf disease is utilized to implement the framework,divided into a training set and a test set.In the implementation process,techniques such as image enhancement and image segmentation are primarily used for identifying ROI and features.An image classification will then be implemented by combining a supervised learning model with a support vector machine.The proposed framework is developed based on MATLAB with a graphical user interface.According to the experimental results,the proposed framework can achieve 98.39%accuracy for classifying diseased and healthy leaves.Moreover,the framework can achieve an accuracy of 98.07%for classifying diseases on pomegranate leaves.
文摘Focusing on strip steel surface defects classification, a novel support vector machine with adjustable hyper-sphere (AHSVM) is formulated. Meanwhile, a new multi-class classification method is proposed. Originated from support vector data description, AHSVM adopts hyper-sphere to solve classification problem. AHSVM can obey two principles: the margin maximization and inner-class dispersion minimization. Moreover, the hyper-sphere of AHSVM is adjustable, which makes the final classification hyper-sphere optimal for training dataset. On the other hand, AHSVM is combined with binary tree to solve multi-class classification for steel surface defects. A scheme of samples pruning in mapped feature space is provided, which can reduce the number of training samples under the premise of classification accuracy, resulting in the improvements of classification speed. Finally, some testing experiments are done for eight types of strip steel surface defects. Experimental results show that multi-class AHSVM classifier exhibits satisfactory results in classification accuracy and efficiency.
文摘针对最小二乘孪生支持向量机(least squares twin support vector machine,LSTSVM)对噪声或是异常数据敏感和忽略数据内在结构信息的问题,提出了一种直觉模糊的结构化最小二乘孪生支持向量机(intuition fuzzy and structural least squares twin support vector machine,IF-SLSTSVM)。首先采用孤立森林对输入样本点进行预处理;然后通过直觉模糊数的概念,赋予输入样本点不同的权重以减少噪声或是异常数据对分类超平面产生的影响;最后采用K-Means算法,以协方差的形式获取输入样本点之间的结构信息。IFSLSTSVM在LS-TSVM的基础上,考虑了输入样本点在特征空间中的分布信息及输入样本点之间的关系,提高了模型的鲁棒性。实验采取UCI数据集,在0%、5%、10%以及20%的不同比例噪声环境对IF-SLSTSVM算法的有效性进行验证。结果显示相较于6种对比算法,IF-SLSTSVM算法有更好的鲁棒性。