Principal Component Analysis (PCA) is a widely used technique for data analysis and dimensionality reduction, but its sensitivity to feature scale and outliers limits its applicability. Robust Principal Component Anal...Principal Component Analysis (PCA) is a widely used technique for data analysis and dimensionality reduction, but its sensitivity to feature scale and outliers limits its applicability. Robust Principal Component Analysis (RPCA) addresses these limitations by decomposing data into a low-rank matrix capturing the underlying structure and a sparse matrix identifying outliers, enhancing robustness against noise and outliers. This paper introduces a novel RPCA variant, Robust PCA Integrating Sparse and Low-rank Priors (RPCA-SL). Each prior targets a specific aspect of the data’s underlying structure and their combination allows for a more nuanced and accurate separation of the main data components from outliers and noise. Then RPCA-SL is solved by employing a proximal gradient algorithm for improved anomaly detection and data decomposition. Experimental results on simulation and real data demonstrate significant advancements.展开更多
In this paper, we present continuous iteratively reweighted least squares algorithm (CIRLS) for solving the linear models problem by convex relaxation, and prove the convergence of this algorithm. Under some condition...In this paper, we present continuous iteratively reweighted least squares algorithm (CIRLS) for solving the linear models problem by convex relaxation, and prove the convergence of this algorithm. Under some conditions, we give an error bound for the algorithm. In addition, the numerical result shows the efficiency of the algorithm.展开更多
The principal component analysis (PCA) algorithm is widely applied in a diverse range of fields for performance assessment, fault detection, and diagnosis. However, in the presence of noise and gross errors, the non...The principal component analysis (PCA) algorithm is widely applied in a diverse range of fields for performance assessment, fault detection, and diagnosis. However, in the presence of noise and gross errors, the nonlinear PCA (NLPCA) using autoassociative bottle-neck neural networks is so sensitive that the obtained model differs significantly from the underlying system. In this paper, a robust version of NLPCA is introduced by replacing the generally used error criterion mean squared error with a mean log squared error. This is followed by a concise analysis of the corresponding training method. A novel multivariate statistical process monitoring (MSPM) scheme incorporating the proposed robust NLPCA technique is then investigated and its efficiency is assessed through application to an industrial fluidized catalytic cracking plant. The results demonstrate that, compared with NLPCA, the proposed approach can effectively reduce the number of false alarms and is, hence, expected to better monitor real-world processes.展开更多
In order to reduce the variations of the product quality in batch processes, multivariate statistical process control methods according to multi-way principal component analysis (MPCA) or multi-way projection to laten...In order to reduce the variations of the product quality in batch processes, multivariate statistical process control methods according to multi-way principal component analysis (MPCA) or multi-way projection to latent structure (MPLS) were proposed for on-line batch process monitoring. However, they are based on the decomposition of relative covariance matrix and strongly affected by outlying observations. In this paper, in view of an efficient projection pursuit algorithm, a robust statistical batch process monitoring (RSBPM) framework,which is resistant to outliers, is proposed to reduce the high demand for modeling data. The construction of robust normal operating condition model and robust control limits are discussed in detail. It is evaluated on monitoring an industrial streptomycin fermentation process and compared with the conventional MPCA. The results show that the RSBPM framework is resistant to possible outliers and the robustness is confirmed.展开更多
The existing recommendation algorithms have lower robustness in facing of shilling attacks. Considering this problem, we present a robust recommendation algorithm based on kernel principal component analysis and fuzzy...The existing recommendation algorithms have lower robustness in facing of shilling attacks. Considering this problem, we present a robust recommendation algorithm based on kernel principal component analysis and fuzzy c-means clustering. Firstly, we use kernel principal component analysis method to reduce the dimensionality of the original rating matrix, which can extract the effective features of users and items. Then, according to the dimension-reduced rating matrix and the high correlation characteristic between attack profiles, we use fuzzy c-means clustering method to cluster user profiles, which can realize the effective separation of genuine profiles and attack profiles. Finally, we construct an indicator function based on the attack detection results to decrease the influence of attack profiles on the recommendation, and incorporate it into the matrix factorization technology to design the corresponding robust recommendation algorithm. Experiment results indicate that the proposed algorithm is superior to the existing methods in both recommendation accuracy and robustness.展开更多
Discriminating internal layers by radio echo sounding is important in analyzing the thickness and ice deposits in the Antarctic ice sheet.The signal processing method of synthesis aperture radar(SAR)has been widely us...Discriminating internal layers by radio echo sounding is important in analyzing the thickness and ice deposits in the Antarctic ice sheet.The signal processing method of synthesis aperture radar(SAR)has been widely used for improving the signal to noise ratio(SNR)and discriminating internal layers by radio echo sounding data of ice sheets.This method is not efficient when we use edge detection operators to obtain accurate information of the layers,especially the ice-bed interface.This paper presents a new image processing method via a combined robust principal component analysis-total variation(RPCA-TV)approach for discriminating internal layers of ice sheets by radio echo sounding data.The RPCA-based method is adopted to project the high-dimensional observations to low-dimensional subspace structure to accelerate the operation of the TV-based method,which is used to discriminate the internal layers.The efficiency of the presented method has been tested on simulation data and the dataset of the Institute of Electronics,Chinese Academy of Sciences,collected during CHINARE 28.The results show that the new method is more efficient than the previous method in discriminating internal layers of ice sheets by radio echo sounding data.展开更多
Robust principal component analysis(PCA) is widely used in many applications, such as image processing, data mining and bioinformatics. The existing methods for solving the robust PCA are mostly based on nuclear norm ...Robust principal component analysis(PCA) is widely used in many applications, such as image processing, data mining and bioinformatics. The existing methods for solving the robust PCA are mostly based on nuclear norm minimization. Those methods simultaneously minimize all the singular values, and thus the rank cannot be well approximated in practice. We extend the idea of truncated nuclear norm regularization(TNNR) to the robust PCA and consider truncated nuclear norm minimization(TNNM) instead of nuclear norm minimization(NNM). This method only minimizes the smallest N-r singular values to preserve the low-rank components, where N is the number of singular values and r is the matrix rank. Moreover, we propose an effective way to determine r via the shrinkage operator. Then we develop an effective iterative algorithm based on the alternating direction method to solve this optimization problem. Experimental results demonstrate the efficiency and accuracy of the TNNM method. Moreover, this method is much more robust in terms of the rank of the reconstructed matrix and the sparsity of the error.展开更多
Gauge duality theory was originated by Preund (1987), and was recently further investigated by Friedlander et al. (2014). When solving some matrix optimization problems via gauge dual, one is usually able to avoid...Gauge duality theory was originated by Preund (1987), and was recently further investigated by Friedlander et al. (2014). When solving some matrix optimization problems via gauge dual, one is usually able to avoid full matrix decompositions such as singular value and/or eigenvalue decompositions. In such an approach, a gauge dual problem is solved in the first stage, and then an optimal solution to the primal problem can be recovered from the dual optimal solution obtained in the first stage. Recently, this theory has been applied to a class of semidefinite programming (SDP) problems with promising numerical results by Friedlander and Mac^to (2016). We establish some theoretical results on applying the gauge duality theory to robust principal component analysis (PCA) and general SDP. For each problem, we present its gauge dual problem, characterize the optimality conditions for the primal-dual gauge pair, and validate a way to recover a primal optimal solution from a dual one. These results are extensions of Friedlander and Macedo (2016) from nuclear norm regularization to robust PCA and from a special class of SDP which requires the coefficient matrix in the linear objective to be positive definite to SDP problems without this restriction. Our results provide further understanding in the potential advantages and disadvantages of the gauge duality theory.展开更多
文摘Principal Component Analysis (PCA) is a widely used technique for data analysis and dimensionality reduction, but its sensitivity to feature scale and outliers limits its applicability. Robust Principal Component Analysis (RPCA) addresses these limitations by decomposing data into a low-rank matrix capturing the underlying structure and a sparse matrix identifying outliers, enhancing robustness against noise and outliers. This paper introduces a novel RPCA variant, Robust PCA Integrating Sparse and Low-rank Priors (RPCA-SL). Each prior targets a specific aspect of the data’s underlying structure and their combination allows for a more nuanced and accurate separation of the main data components from outliers and noise. Then RPCA-SL is solved by employing a proximal gradient algorithm for improved anomaly detection and data decomposition. Experimental results on simulation and real data demonstrate significant advancements.
文摘In this paper, we present continuous iteratively reweighted least squares algorithm (CIRLS) for solving the linear models problem by convex relaxation, and prove the convergence of this algorithm. Under some conditions, we give an error bound for the algorithm. In addition, the numerical result shows the efficiency of the algorithm.
基金Supported by the National High-Tech Research and Development (863) Program of China (No. 2001AA413320)
文摘The principal component analysis (PCA) algorithm is widely applied in a diverse range of fields for performance assessment, fault detection, and diagnosis. However, in the presence of noise and gross errors, the nonlinear PCA (NLPCA) using autoassociative bottle-neck neural networks is so sensitive that the obtained model differs significantly from the underlying system. In this paper, a robust version of NLPCA is introduced by replacing the generally used error criterion mean squared error with a mean log squared error. This is followed by a concise analysis of the corresponding training method. A novel multivariate statistical process monitoring (MSPM) scheme incorporating the proposed robust NLPCA technique is then investigated and its efficiency is assessed through application to an industrial fluidized catalytic cracking plant. The results demonstrate that, compared with NLPCA, the proposed approach can effectively reduce the number of false alarms and is, hence, expected to better monitor real-world processes.
文摘In order to reduce the variations of the product quality in batch processes, multivariate statistical process control methods according to multi-way principal component analysis (MPCA) or multi-way projection to latent structure (MPLS) were proposed for on-line batch process monitoring. However, they are based on the decomposition of relative covariance matrix and strongly affected by outlying observations. In this paper, in view of an efficient projection pursuit algorithm, a robust statistical batch process monitoring (RSBPM) framework,which is resistant to outliers, is proposed to reduce the high demand for modeling data. The construction of robust normal operating condition model and robust control limits are discussed in detail. It is evaluated on monitoring an industrial streptomycin fermentation process and compared with the conventional MPCA. The results show that the RSBPM framework is resistant to possible outliers and the robustness is confirmed.
基金Supported by the Scientific Research Foundation of Liaoning Provincial Education Department(L2015240)the National Natural Science Foundation of China(61379116,61503169)the Joint Fund of the Science and Technology Department of Liaoning Province(20170540448)
文摘The existing recommendation algorithms have lower robustness in facing of shilling attacks. Considering this problem, we present a robust recommendation algorithm based on kernel principal component analysis and fuzzy c-means clustering. Firstly, we use kernel principal component analysis method to reduce the dimensionality of the original rating matrix, which can extract the effective features of users and items. Then, according to the dimension-reduced rating matrix and the high correlation characteristic between attack profiles, we use fuzzy c-means clustering method to cluster user profiles, which can realize the effective separation of genuine profiles and attack profiles. Finally, we construct an indicator function based on the attack detection results to decrease the influence of attack profiles on the recommendation, and incorporate it into the matrix factorization technology to design the corresponding robust recommendation algorithm. Experiment results indicate that the proposed algorithm is superior to the existing methods in both recommendation accuracy and robustness.
基金supported by the National Hi-Tech Research and Development Program of China("863"Project)(Grant No.2011AA040202)the National Natural Science Foundation of China(Grant No.40976114)
文摘Discriminating internal layers by radio echo sounding is important in analyzing the thickness and ice deposits in the Antarctic ice sheet.The signal processing method of synthesis aperture radar(SAR)has been widely used for improving the signal to noise ratio(SNR)and discriminating internal layers by radio echo sounding data of ice sheets.This method is not efficient when we use edge detection operators to obtain accurate information of the layers,especially the ice-bed interface.This paper presents a new image processing method via a combined robust principal component analysis-total variation(RPCA-TV)approach for discriminating internal layers of ice sheets by radio echo sounding data.The RPCA-based method is adopted to project the high-dimensional observations to low-dimensional subspace structure to accelerate the operation of the TV-based method,which is used to discriminate the internal layers.The efficiency of the presented method has been tested on simulation data and the dataset of the Institute of Electronics,Chinese Academy of Sciences,collected during CHINARE 28.The results show that the new method is more efficient than the previous method in discriminating internal layers of ice sheets by radio echo sounding data.
基金the Doctoral Program of Higher Education of China(No.20120032110034)
文摘Robust principal component analysis(PCA) is widely used in many applications, such as image processing, data mining and bioinformatics. The existing methods for solving the robust PCA are mostly based on nuclear norm minimization. Those methods simultaneously minimize all the singular values, and thus the rank cannot be well approximated in practice. We extend the idea of truncated nuclear norm regularization(TNNR) to the robust PCA and consider truncated nuclear norm minimization(TNNM) instead of nuclear norm minimization(NNM). This method only minimizes the smallest N-r singular values to preserve the low-rank components, where N is the number of singular values and r is the matrix rank. Moreover, we propose an effective way to determine r via the shrinkage operator. Then we develop an effective iterative algorithm based on the alternating direction method to solve this optimization problem. Experimental results demonstrate the efficiency and accuracy of the TNNM method. Moreover, this method is much more robust in terms of the rank of the reconstructed matrix and the sparsity of the error.
基金supported by Hong Kong Research Grants Council General Research Fund (Grant No. 14205314)National Natural Science Foundation of China (Grant No. 11371192)
文摘Gauge duality theory was originated by Preund (1987), and was recently further investigated by Friedlander et al. (2014). When solving some matrix optimization problems via gauge dual, one is usually able to avoid full matrix decompositions such as singular value and/or eigenvalue decompositions. In such an approach, a gauge dual problem is solved in the first stage, and then an optimal solution to the primal problem can be recovered from the dual optimal solution obtained in the first stage. Recently, this theory has been applied to a class of semidefinite programming (SDP) problems with promising numerical results by Friedlander and Mac^to (2016). We establish some theoretical results on applying the gauge duality theory to robust principal component analysis (PCA) and general SDP. For each problem, we present its gauge dual problem, characterize the optimality conditions for the primal-dual gauge pair, and validate a way to recover a primal optimal solution from a dual one. These results are extensions of Friedlander and Macedo (2016) from nuclear norm regularization to robust PCA and from a special class of SDP which requires the coefficient matrix in the linear objective to be positive definite to SDP problems without this restriction. Our results provide further understanding in the potential advantages and disadvantages of the gauge duality theory.
文摘运动目标传统检测方法只考虑图像的亮度或纹理等某一种特性,受特异值影响较大,对噪声比较敏感,鲁棒性也不够好,而且背景恢复精度不高。针对以上局限性,提出一种融合结构相似度(structural similarity,SSIM)全参考模型和鲁棒主成分分析(robust principal component analysis,RPCA)的运动目标检测方法。此方法综合考虑图像的亮度、对比度和结构三种特性,不采用传统的背景减除法,而是把图像像素点的结构相似度作为度量来实现运动对象与背景的分离。实验结果表明,此方法准确率可达0.95,且F度量较传统运动目标检测算法平均提升0.15,总体上比传统方法更具优势。