One-class classification problem has become a popular problem in many fields, with a wide range of applications in anomaly detection, fault diagnosis, and face recognition. We investigate the one-class classification ...One-class classification problem has become a popular problem in many fields, with a wide range of applications in anomaly detection, fault diagnosis, and face recognition. We investigate the one-class classification problem for second-order tensor data. Traditional vector-based one-class classification methods such as one-class support vector machine (OCSVM) and least squares one-class support vector machine (LSOCSVM) have limitations when tensor is used as input data, so we propose a new tensor one-class classification method, LSOCSTM, which directly uses tensor as input data. On one hand, using tensor as input data not only enables to classify tensor data, but also for vector data, classifying it after high dimensionalizing it into tensor still improves the classification accuracy and overcomes the over-fitting problem. On the other hand, different from one-class support tensor machine (OCSTM), we use squared loss instead of the original loss function so that we solve a series of linear equations instead of quadratic programming problems. Therefore, we use the distance to the hyperplane as a metric for classification, and the proposed method is more accurate and faster compared to existing methods. The experimental results show the high efficiency of the proposed method compared with several state-of-the-art methods.展开更多
Turbopump condition monitoring is a significant approach to ensure the safety of liquid rocket engine (LRE).Because of lack of fault samples,a monitoring system cannot be trained on all possible condition patterns.T...Turbopump condition monitoring is a significant approach to ensure the safety of liquid rocket engine (LRE).Because of lack of fault samples,a monitoring system cannot be trained on all possible condition patterns.Thus it is important to differentiate abnormal or unknown patterns from normal pattern with novelty detection methods.One-class support vector machine (OCSVM) that has been commonly used for novelty detection cannot deal well with large scale samples.In order to model the normal pattern of the turbopump with OCSVM and so as to monitor the condition of the turbopump,a monitoring method that integrates OCSVM with incremental clustering is presented.In this method,the incremental clustering is used for sample reduction by extracting representative vectors from a large training set.The representative vectors are supposed to distribute uniformly in the object region and fulfill the region.And training OCSVM on these representative vectors yields a novelty detector.By applying this method to the analysis of the turbopump's historical test data,it shows that the incremental clustering algorithm can extract 91 representative points from more than 36 000 training vectors,and the OCSVM detector trained on these 91 representative points can recognize spikes in vibration signals caused by different abnormal events such as vane shedding,rub-impact and sensor faults.This monitoring method does not need fault samples during training as classical recognition methods.The method resolves the learning problem of large samples and is an alternative method for condition monitoring of the LRE turbopump.展开更多
Machine learning algorithms operating in an unsupervised fashion has emerged as promising tools for detecting structural damage in an automated fashion.Its essence relies on selecting appropriate features to train the...Machine learning algorithms operating in an unsupervised fashion has emerged as promising tools for detecting structural damage in an automated fashion.Its essence relies on selecting appropriate features to train the model using the reference data set collected from the healthy structure and employing the trained model to identify outlier conditions representing the damaged state.In this paper,the coefficients and the residuals of the autoregressive model with exogenous input created using only the measured output signals are extracted as damage features.These features obtained at the baseline state for each sensor cluster are then utilized to train the one class support vector machine,an unsupervised classifier generating a decision function using only patterns belonging to this baseline state.Structural damage,once detected by the trained machine,a damage index based on comparison of the residuals between the trained class and the outlier state is implemented for localizing damage.The two-step damage assessment framework is first implemented on an eight degree-of-freedom numerical model with the effects of measurement noise integrated.Subsequently,vibration data collected from a one-story one-bay reinforced concrete frame inflicted with progressive levels of damage have been utilized to verify the accuracy and robustness of the proposed methodology.展开更多
Invasive breast carcinoma(BRCA)is associated with poor prognosis and high risk of mortality.Therefore,it is critical to identify novel biomarkers for the prognostic assessment of BRCA.Methods:The expression data of po...Invasive breast carcinoma(BRCA)is associated with poor prognosis and high risk of mortality.Therefore,it is critical to identify novel biomarkers for the prognostic assessment of BRCA.Methods:The expression data of polo-like kinase 1(PLK1)in BRCA and the corresponding clinical information were extracted from TCGA and GEO databases.PLK1 expression was validated in diverse breast cancer cell lines by quantitative real-time polymerase chain reaction(qRT-PCR)and western blotting.Single sample gene set enrichment analysis(ssGSEA)was performed to evaluate immune infiltration in the BRCA microenvironment,and the random forest(RF)and support vector machine(SVM)algorithms were used to screen for the hub infiltrating cells and calculate the immunophenoscore(IPS).The RF algorithm and COX regression model were applied to calculate survival risk scores based on the PLK1 expression and immune cell infiltration.Finally,a prognostic nomogram was constructed with the risk score and pathological stage,and its clinical potential was evaluated by plotting calibration charts and DCA curves.The application of the nomogram was further validated in an immunotherapy cohort.Results:PLK1 expression was significantly higher in the tumor samples in TCGA-BRCA cohort.Furthermore,PLK1 expression level,age and stage were identified as independent prognostic factors of BRCA.While the IPS was unaffected by PLK1 expression,the TMB and MATH scores were higher in the PLK1-high group,and the TIDE scores were higher for the PLK1-low patients.We also identified 6 immune cell types with high infiltration,along with 11 immune cell types with low infiltration in the PLK1-high tumors.A risk score was devised using PLK1 expression and hub immune cells,which predicted the prognosis of BRCA patients.In addition,a nomogram was constructed based on the risk score and pathological staging,and showed good predictive performance.Conclusions:PLK1 expression and immune cell infiltration can predict post-immunotherapy prognosis of BRCA patients.展开更多
One-class support vector machines (one-class SVMs) are powerful tools that are widely used in many applications. This paper describes a semi-supervised one-class SVM that uses supervision in terms of relative compar...One-class support vector machines (one-class SVMs) are powerful tools that are widely used in many applications. This paper describes a semi-supervised one-class SVM that uses supervision in terms of relative comparisons. The analysis uses a hypersphere version of one-class SVMs with a penalty term appended to the objective function. The method simultaneously finds the minimum sphere in the feature space that encloses most of the target points and considers the relative comparisons. The result is a standard convex quadratic programming problem, which can be solved by adapting standard methods for SVM training, i.e., sequential minimal optimization. This one-class SVM can be applied to semi-supervised clustering and multi-classification problems. Tests show that this method achieves higher accuracy and better generalization performance than previous SVMs.展开更多
One-class support vector machine (OCSVM) and support vector data description (SVDD) are two main domain-based one-class (kernel) classifiers. To reveal their relationship with density estimation in the case of t...One-class support vector machine (OCSVM) and support vector data description (SVDD) are two main domain-based one-class (kernel) classifiers. To reveal their relationship with density estimation in the case of the Gaussian kernel, OCSVM and SVDD are firstly unified into the framework of kernel density estimation, and the essential relationship between them is explicitly revealed. Then the result proves that the density estimation induced by OCSVM or SVDD is in agreement with the true density. Meanwhile, it can also reduce the integrated squared error (ISE). Finally, experiments on several simulated datasets verify the revealed relationships.展开更多
归一化植被指数(normalized difference vegetation index,NDVI)时间序列已广泛应用于植被信息提取研究,然而目前NDVI时间序列的研究主要集中于中低分辨率遥感影像,从而影响了植被信息提取的精度。随着中国高分专项首颗卫星高分一号(GF...归一化植被指数(normalized difference vegetation index,NDVI)时间序列已广泛应用于植被信息提取研究,然而目前NDVI时间序列的研究主要集中于中低分辨率遥感影像,从而影响了植被信息提取的精度。随着中国高分专项首颗卫星高分一号(GF-1)的发射,为高分辨率NDVI时间序列的构建提供了可能。该文尝试利用GF-1卫星16 m宽覆盖(wide field of view,WFV)影像,构建16 m分辨率NDVI时间序列,以河北省唐山市南部区域为研究区,开展作物分类研究。该文采用覆盖作物完整生长期的GF-1数据构建NDVI时间序列,避免了利用自然年(1-12月)数据构建NDVI时间序列的不足,有助于作物信息的提取。通过分析样地的NDVI时序曲线,发现GF-1/WFV NDVI时间序列能够清晰地区分不同作物的物候差异,捕捉作物特有的生长特性,而且能够识别研究区当年的作物种植模式。该文分别采用最大似然法、马氏距离、最小距离、神经网络分类、支持向量机(support vector machine,SVM)等分类方法,基于GF-1/WFV NDVI时间序列对研究区作物进行分类,研究结果表明SVM分类方法总体精度最高,达到96.33%。同时该文还采用时间序列谐波分析法(harmonic analysis of time series,HANTS)对NDVI时间序列进行了平滑处理,结果表明处理后的NDVI时间序列能更好地描述作物的物候特性,作物分类精度得到进一步提高。展开更多
Power quality assessment is an important performance measurement in smart grids.Utility companies are interested in power quality monitoring even in the low level distribution side such as smart meters.Addressing this...Power quality assessment is an important performance measurement in smart grids.Utility companies are interested in power quality monitoring even in the low level distribution side such as smart meters.Addressing this issue,in this study,we propose segregation of the power disturbance from regular values using one-class support vector machine(OCSVM).To precisely detect the power disturbances of a voltage wave,some practical wavelet filters are applied.Considering the unlimited types of waveform abnormalities,OCSVM is picked as a semisupervised machine learning algorithm which needs to be trained solely on a relatively large sample of normal data.This model is able to automatically detect the existence of any types of disturbances in real time,even unknown types which are not available in the training time.In the case of existence,the disturbances are further classified into different types such as sag,swell,transients and unbalanced.Being light weighted and fast,the proposed technique can be integrated into smart grid devices such as smart meter in order to perform a real-time disturbance monitoring.The continuous monitoring of power quality in smart meters will give helpful insight for quality power transmission and management.展开更多
文摘One-class classification problem has become a popular problem in many fields, with a wide range of applications in anomaly detection, fault diagnosis, and face recognition. We investigate the one-class classification problem for second-order tensor data. Traditional vector-based one-class classification methods such as one-class support vector machine (OCSVM) and least squares one-class support vector machine (LSOCSVM) have limitations when tensor is used as input data, so we propose a new tensor one-class classification method, LSOCSTM, which directly uses tensor as input data. On one hand, using tensor as input data not only enables to classify tensor data, but also for vector data, classifying it after high dimensionalizing it into tensor still improves the classification accuracy and overcomes the over-fitting problem. On the other hand, different from one-class support tensor machine (OCSTM), we use squared loss instead of the original loss function so that we solve a series of linear equations instead of quadratic programming problems. Therefore, we use the distance to the hyperplane as a metric for classification, and the proposed method is more accurate and faster compared to existing methods. The experimental results show the high efficiency of the proposed method compared with several state-of-the-art methods.
基金supported by National Natural Science Foundation of China (Grant No. 50675219)Hu’nan Provincial Science Committee Excellent Youth Foundation of China (Grant No. 08JJ1008)
文摘Turbopump condition monitoring is a significant approach to ensure the safety of liquid rocket engine (LRE).Because of lack of fault samples,a monitoring system cannot be trained on all possible condition patterns.Thus it is important to differentiate abnormal or unknown patterns from normal pattern with novelty detection methods.One-class support vector machine (OCSVM) that has been commonly used for novelty detection cannot deal well with large scale samples.In order to model the normal pattern of the turbopump with OCSVM and so as to monitor the condition of the turbopump,a monitoring method that integrates OCSVM with incremental clustering is presented.In this method,the incremental clustering is used for sample reduction by extracting representative vectors from a large training set.The representative vectors are supposed to distribute uniformly in the object region and fulfill the region.And training OCSVM on these representative vectors yields a novelty detector.By applying this method to the analysis of the turbopump's historical test data,it shows that the incremental clustering algorithm can extract 91 representative points from more than 36 000 training vectors,and the OCSVM detector trained on these 91 representative points can recognize spikes in vibration signals caused by different abnormal events such as vane shedding,rub-impact and sensor faults.This monitoring method does not need fault samples during training as classical recognition methods.The method resolves the learning problem of large samples and is an alternative method for condition monitoring of the LRE turbopump.
基金funding provided by the Scientific and Technological Research Council of Türkiye(TÜBİTAK).
文摘Machine learning algorithms operating in an unsupervised fashion has emerged as promising tools for detecting structural damage in an automated fashion.Its essence relies on selecting appropriate features to train the model using the reference data set collected from the healthy structure and employing the trained model to identify outlier conditions representing the damaged state.In this paper,the coefficients and the residuals of the autoregressive model with exogenous input created using only the measured output signals are extracted as damage features.These features obtained at the baseline state for each sensor cluster are then utilized to train the one class support vector machine,an unsupervised classifier generating a decision function using only patterns belonging to this baseline state.Structural damage,once detected by the trained machine,a damage index based on comparison of the residuals between the trained class and the outlier state is implemented for localizing damage.The two-step damage assessment framework is first implemented on an eight degree-of-freedom numerical model with the effects of measurement noise integrated.Subsequently,vibration data collected from a one-story one-bay reinforced concrete frame inflicted with progressive levels of damage have been utilized to verify the accuracy and robustness of the proposed methodology.
基金funded by the Natural Science Foundation of Higher Education Institutions of Auhui Province(Grant No.KJ2021A0352)the Research Fund Project of Anhui Medical University(Grant No.2020xkj236)Applied Medicine Research Project of Hefei Health Commission(Grant No.HWKJ2019-172-14).
文摘Invasive breast carcinoma(BRCA)is associated with poor prognosis and high risk of mortality.Therefore,it is critical to identify novel biomarkers for the prognostic assessment of BRCA.Methods:The expression data of polo-like kinase 1(PLK1)in BRCA and the corresponding clinical information were extracted from TCGA and GEO databases.PLK1 expression was validated in diverse breast cancer cell lines by quantitative real-time polymerase chain reaction(qRT-PCR)and western blotting.Single sample gene set enrichment analysis(ssGSEA)was performed to evaluate immune infiltration in the BRCA microenvironment,and the random forest(RF)and support vector machine(SVM)algorithms were used to screen for the hub infiltrating cells and calculate the immunophenoscore(IPS).The RF algorithm and COX regression model were applied to calculate survival risk scores based on the PLK1 expression and immune cell infiltration.Finally,a prognostic nomogram was constructed with the risk score and pathological stage,and its clinical potential was evaluated by plotting calibration charts and DCA curves.The application of the nomogram was further validated in an immunotherapy cohort.Results:PLK1 expression was significantly higher in the tumor samples in TCGA-BRCA cohort.Furthermore,PLK1 expression level,age and stage were identified as independent prognostic factors of BRCA.While the IPS was unaffected by PLK1 expression,the TMB and MATH scores were higher in the PLK1-high group,and the TIDE scores were higher for the PLK1-low patients.We also identified 6 immune cell types with high infiltration,along with 11 immune cell types with low infiltration in the PLK1-high tumors.A risk score was devised using PLK1 expression and hub immune cells,which predicted the prognosis of BRCA patients.In addition,a nomogram was constructed based on the risk score and pathological staging,and showed good predictive performance.Conclusions:PLK1 expression and immune cell infiltration can predict post-immunotherapy prognosis of BRCA patients.
基金Supported by the National Natural Science Foundation of China(No. 60872070)
文摘One-class support vector machines (one-class SVMs) are powerful tools that are widely used in many applications. This paper describes a semi-supervised one-class SVM that uses supervision in terms of relative comparisons. The analysis uses a hypersphere version of one-class SVMs with a penalty term appended to the objective function. The method simultaneously finds the minimum sphere in the feature space that encloses most of the target points and considers the relative comparisons. The result is a standard convex quadratic programming problem, which can be solved by adapting standard methods for SVM training, i.e., sequential minimal optimization. This one-class SVM can be applied to semi-supervised clustering and multi-classification problems. Tests show that this method achieves higher accuracy and better generalization performance than previous SVMs.
基金Supported by the National Natural Science Foundation of China(60603029)the Natural Science Foundation of Jiangsu Province(BK2007074)the Natural Science Foundation for Colleges and Universities in Jiangsu Province(06KJB520132)~~
文摘One-class support vector machine (OCSVM) and support vector data description (SVDD) are two main domain-based one-class (kernel) classifiers. To reveal their relationship with density estimation in the case of the Gaussian kernel, OCSVM and SVDD are firstly unified into the framework of kernel density estimation, and the essential relationship between them is explicitly revealed. Then the result proves that the density estimation induced by OCSVM or SVDD is in agreement with the true density. Meanwhile, it can also reduce the integrated squared error (ISE). Finally, experiments on several simulated datasets verify the revealed relationships.
文摘归一化植被指数(normalized difference vegetation index,NDVI)时间序列已广泛应用于植被信息提取研究,然而目前NDVI时间序列的研究主要集中于中低分辨率遥感影像,从而影响了植被信息提取的精度。随着中国高分专项首颗卫星高分一号(GF-1)的发射,为高分辨率NDVI时间序列的构建提供了可能。该文尝试利用GF-1卫星16 m宽覆盖(wide field of view,WFV)影像,构建16 m分辨率NDVI时间序列,以河北省唐山市南部区域为研究区,开展作物分类研究。该文采用覆盖作物完整生长期的GF-1数据构建NDVI时间序列,避免了利用自然年(1-12月)数据构建NDVI时间序列的不足,有助于作物信息的提取。通过分析样地的NDVI时序曲线,发现GF-1/WFV NDVI时间序列能够清晰地区分不同作物的物候差异,捕捉作物特有的生长特性,而且能够识别研究区当年的作物种植模式。该文分别采用最大似然法、马氏距离、最小距离、神经网络分类、支持向量机(support vector machine,SVM)等分类方法,基于GF-1/WFV NDVI时间序列对研究区作物进行分类,研究结果表明SVM分类方法总体精度最高,达到96.33%。同时该文还采用时间序列谐波分析法(harmonic analysis of time series,HANTS)对NDVI时间序列进行了平滑处理,结果表明处理后的NDVI时间序列能更好地描述作物的物候特性,作物分类精度得到进一步提高。
基金supported in part through U.S.National Science Foundation(No.1553494).
文摘Power quality assessment is an important performance measurement in smart grids.Utility companies are interested in power quality monitoring even in the low level distribution side such as smart meters.Addressing this issue,in this study,we propose segregation of the power disturbance from regular values using one-class support vector machine(OCSVM).To precisely detect the power disturbances of a voltage wave,some practical wavelet filters are applied.Considering the unlimited types of waveform abnormalities,OCSVM is picked as a semisupervised machine learning algorithm which needs to be trained solely on a relatively large sample of normal data.This model is able to automatically detect the existence of any types of disturbances in real time,even unknown types which are not available in the training time.In the case of existence,the disturbances are further classified into different types such as sag,swell,transients and unbalanced.Being light weighted and fast,the proposed technique can be integrated into smart grid devices such as smart meter in order to perform a real-time disturbance monitoring.The continuous monitoring of power quality in smart meters will give helpful insight for quality power transmission and management.