The estimation of covariance matrices is very important in many fields, such as statistics. In real applications, data are frequently influenced by high dimensions and noise. However, most relevant studies are based o...The estimation of covariance matrices is very important in many fields, such as statistics. In real applications, data are frequently influenced by high dimensions and noise. However, most relevant studies are based on complete data. This paper studies the optimal estimation of high-dimensional covariance matrices based on missing and noisy sample under the norm. First, the model with sub-Gaussian additive noise is presented. The generalized sample covariance is then modified to define a hard thresholding estimator , and the minimax upper bound is derived. After that, the minimax lower bound is derived, and it is concluded that the estimator presented in this article is rate-optimal. Finally, numerical simulation analysis is performed. The result shows that for missing samples with sub-Gaussian noise, if the true covariance matrix is sparse, the hard thresholding estimator outperforms the traditional estimate method.展开更多
Faced with the tight coupling of multi energy sources,the interaction between different energy supply systems makes it difficult for integrated energy systems(IES)to identify weak nodes.Based on the analysis of the da...Faced with the tight coupling of multi energy sources,the interaction between different energy supply systems makes it difficult for integrated energy systems(IES)to identify weak nodes.Based on the analysis of the data generated by the actual operation of IES,this paper proposes a weak node identification method based on random matrix theory(RMT).First,establish a unified power flow model for IES.Secondly.introduce RMT and the characteristics of weak nodes,without considering the detailed physical model of the system,using historical data and real-time data to construct the random matrix.Thirdly,the two limit spectrum distribution functions(Marchenko-Pastur law and ring law)are used to qualitatively analyze the system’s operating status,calculate linear eigenvalue statistics such as mean spectral radius(MSR),and establish the weak node identification model based on entropy theory.Finally,the simulation of IES verifies the effectiveness of the proposed method and provides a new approach for the identification of weak nodes in IES.展开更多
Latent factor(LF) models are highly effective in extracting useful knowledge from High-Dimensional and Sparse(HiDS) matrices which are commonly seen in various industrial applications. An LF model usually adopts itera...Latent factor(LF) models are highly effective in extracting useful knowledge from High-Dimensional and Sparse(HiDS) matrices which are commonly seen in various industrial applications. An LF model usually adopts iterative optimizers,which may consume many iterations to achieve a local optima,resulting in considerable time cost. Hence, determining how to accelerate the training process for LF models has become a significant issue. To address this, this work proposes a randomized latent factor(RLF) model. It incorporates the principle of randomized learning techniques from neural networks into the LF analysis of HiDS matrices, thereby greatly alleviating computational burden. It also extends a standard learning process for randomized neural networks in context of LF analysis to make the resulting model represent an HiDS matrix correctly.Experimental results on three HiDS matrices from industrial applications demonstrate that compared with state-of-the-art LF models, RLF is able to achieve significantly higher computational efficiency and comparable prediction accuracy for missing data.I provides an important alternative approach to LF analysis of HiDS matrices, which is especially desired for industrial applications demanding highly efficient models.展开更多
We introduce and develop a novel approach to outlier detection based on adaptation of random subspace learning. Our proposed method handles both high-dimension low-sample size and traditional low-dimensional high-samp...We introduce and develop a novel approach to outlier detection based on adaptation of random subspace learning. Our proposed method handles both high-dimension low-sample size and traditional low-dimensional high-sample size datasets. Essentially, we avoid the computational bottleneck of techniques like Minimum Covariance Determinant (MCD) by computing the needed determinants and associated measures in much lower dimensional subspaces. Both theoretical and computational development of our approach reveal that it is computationally more efficient than the regularized methods in high-dimensional low-sample size, and often competes favorably with existing methods as far as the percentage of correct outlier detection are concerned.展开更多
High-dimensional and sparse(HiDS)matrices commonly arise in various industrial applications,e.g.,recommender systems(RSs),social networks,and wireless sensor networks.Since they contain rich information,how to accurat...High-dimensional and sparse(HiDS)matrices commonly arise in various industrial applications,e.g.,recommender systems(RSs),social networks,and wireless sensor networks.Since they contain rich information,how to accurately represent them is of great significance.A latent factor(LF)model is one of the most popular and successful ways to address this issue.Current LF models mostly adopt L2-norm-oriented Loss to represent an HiDS matrix,i.e.,they sum the errors between observed data and predicted ones with L2-norm.Yet L2-norm is sensitive to outlier data.Unfortunately,outlier data usually exist in such matrices.For example,an HiDS matrix from RSs commonly contains many outlier ratings due to some heedless/malicious users.To address this issue,this work proposes a smooth L1-norm-oriented latent factor(SL-LF)model.Its main idea is to adopt smooth L1-norm rather than L2-norm to form its Loss,making it have both strong robustness and high accuracy in predicting the missing data of an HiDS matrix.Experimental results on eight HiDS matrices generated by industrial applications verify that the proposed SL-LF model not only is robust to the outlier data but also has significantly higher prediction accuracy than state-of-the-art models when they are used to predict the missing data of HiDS matrices.展开更多
In this paper, a unified matrix recovery model was proposed for diverse corrupted matrices. Resulting from the separable structure of the proposed model, the convex optimization problem can be solved efficiently by ad...In this paper, a unified matrix recovery model was proposed for diverse corrupted matrices. Resulting from the separable structure of the proposed model, the convex optimization problem can be solved efficiently by adopting an inexact augmented Lagrange multiplier (IALM) method. Additionally, a random projection accelerated technique (IALM+RP) was adopted to improve the success rate. From the preliminary numerical comparisons, it was indicated that for the standard robust principal component analysis (PCA) problem, IALM+RP was at least two to six times faster than IALM with an insignificant reduction in accuracy; and for the outlier pursuit (OP) problem, IALM+RP was at least 6.9 times faster, even up to 8.3 times faster when the size of matrix was 2 000×2 000.展开更多
A new algorithm of structure random response numerical characteristics, namedas matrix algebra algorithm of structure analysis is presented. Using the algorithm, structurerandom response numerical characteristics can ...A new algorithm of structure random response numerical characteristics, namedas matrix algebra algorithm of structure analysis is presented. Using the algorithm, structurerandom response numerical characteristics can easily be got by directly solving linear matrixequations rather than structure motion differential equations. Moreover, in order to solve thecorresponding linear matrix equations, the numerical integration fast algorithm is presented. Thenaccording to the results, dynamic design and life-span estimation can be done. Besides, the newalgorithm can solve non-proportion damp structure response.展开更多
Random Matrix Theory (RMT) is a valuable tool for describing the asymptotic behavior of multiple systems,especially for large matrices. In this paper,using asymptotic random matrix theory,a new cooperative Multiple-In...Random Matrix Theory (RMT) is a valuable tool for describing the asymptotic behavior of multiple systems,especially for large matrices. In this paper,using asymptotic random matrix theory,a new cooperative Multiple-Input Multiple-Output (MIMO) scheme for spectrum sensing is proposed,which shows how asymptotic free property of random matrices and the property of Wishart distribution can be used to assist spectrum sensing for Cognitive Radios (CRs). Simulations over Rayleigh fading and AWGN channels demonstrate the proposed scheme has better detection performance compared with the energy detection techniques even in the case of a small sample of observations.展开更多
Cloud computing provides powerful processing capabilities for large-scale intelligent Internet of things(IoT)terminals.However,the massive realtime data processing requirements challenge the existing cloud computing m...Cloud computing provides powerful processing capabilities for large-scale intelligent Internet of things(IoT)terminals.However,the massive realtime data processing requirements challenge the existing cloud computing model.The edge server is closer to the data source.The end-edge-cloud collaboration offloads the cloud computing tasks to the edge environment,which solves the shortcomings of the cloud in resource storage,computing performance,and energy consumption.IoT terminals and sensors have caused security and privacy challenges due to resource constraints and exponential growth.As the key technology of IoT,Radio-Frequency Identification(RFID)authentication protocol tremendously strengthens privacy protection and improves IoT security.However,it inevitably increases system overhead while improving security,which is a major blow to low-cost RFID tags.The existing RFID authentication protocols are difficult to balance overhead and security.This paper designs an ultra-lightweight encryption function and proposes an RFID authentication scheme based on this function for the end-edge-cloud collaborative environment.The BAN logic proof and protocol verification tools AVISPA formally verify the protocol’s security.We use VIVADO to implement the encryption function and tag’s overhead on the FPGA platform.Performance evaluation indicates that the proposed protocol balances low computing costs and high-security requirements.展开更多
We propose and apply a new algorithm of principal component analysis which is suitable for a large sized, highly random time series data, such as a set of stock prices in a stock market. This algorithm utilizes the fa...We propose and apply a new algorithm of principal component analysis which is suitable for a large sized, highly random time series data, such as a set of stock prices in a stock market. This algorithm utilizes the fact that the major part of the time series is random, and compare the eigenvalue spectrum of cross correlation matrix of a large set of random time series, to the spectrum derived by the random matrix theory (RMT) at the limit of large dimension (the number of independent time series) and long enough length of time series. We test this algorithm on the real tick data of American stocks at different years between 1994 and 2002 and show that the extracted principal components indeed reflects the change of leading stock sectors during this period.展开更多
Spectrum sensing in a wideband regime for cognitive radio network(CRN) faces considerably technical challenge due to the constraints on analog-to-digital converters(ADCs).To solve this problem,an eigenvalue-based comp...Spectrum sensing in a wideband regime for cognitive radio network(CRN) faces considerably technical challenge due to the constraints on analog-to-digital converters(ADCs).To solve this problem,an eigenvalue-based compressive wideband spectrum sensing(ECWSS) scheme using random matrix theory(RMT) was proposed in this paper.The ECWSS directly utilized the compressive measurements based on compressive sampling(CS) theory to perform wideband spectrum sensing without requiring signal recovery,which could greatly reduce computational complexity and data acquisition burden.In the ECWSS,to alleviate the communication overhead of secondary user(SU),the sensors around SU carried out compressive sampling at the sub-Nyquist rate instead of SU.Furthermore,the exact probability density function of extreme eigenvalues was used to set the threshold.Theoretical analyses and simulation results show that compared with the existing eigenvalue-based sensing schemes,the ECWSS has much lower computational complexity and cost with no significant detection performance degradation.展开更多
We have applied the Random Matrix Theory in order to examine the validity of the NPT treatment in HSP. We have investigated the pathology examining the sEMG recorded signal for about eight minutes. We have performed s...We have applied the Random Matrix Theory in order to examine the validity of the NPT treatment in HSP. We have investigated the pathology examining the sEMG recorded signal for about eight minutes. We have performed standard electromyographic investigations as well as we have applied the RMT method of analysis. We have investigated the sEMG signals before and after the NPT treatment. The application of a so robust method as the RMT evidences that the NPT treatment was able to induce a net improvement of the disease respect to the pathological status before NPT.展开更多
In this paper, a method of power quality disturbance classification based on random matrix theory (RMT) is proposed. The method utilizes the power quality disturbance signal to construct a random matrix. By analyzing ...In this paper, a method of power quality disturbance classification based on random matrix theory (RMT) is proposed. The method utilizes the power quality disturbance signal to construct a random matrix. By analyzing the mean spectral radius (MSR) variation of the random matrix, the type and time of occurrence of power quality disturbance are classified. In this paper, the random matrix theory is used to analyze the voltage sag, swell and interrupt perturbation signals to classify the occurrence time, duration of the disturbance signal and thedepth of voltage sag or swell. Examples show that the method has strong anti-noise ability.展开更多
文摘The estimation of covariance matrices is very important in many fields, such as statistics. In real applications, data are frequently influenced by high dimensions and noise. However, most relevant studies are based on complete data. This paper studies the optimal estimation of high-dimensional covariance matrices based on missing and noisy sample under the norm. First, the model with sub-Gaussian additive noise is presented. The generalized sample covariance is then modified to define a hard thresholding estimator , and the minimax upper bound is derived. After that, the minimax lower bound is derived, and it is concluded that the estimator presented in this article is rate-optimal. Finally, numerical simulation analysis is performed. The result shows that for missing samples with sub-Gaussian noise, if the true covariance matrix is sparse, the hard thresholding estimator outperforms the traditional estimate method.
基金This work was supported in part by the National Key Research and Development Program of China(2018YFB0904200)Eponymous Complement S&T Program of State Grid Corporation of China(SGLNDKOOKJJS1800266).
文摘Faced with the tight coupling of multi energy sources,the interaction between different energy supply systems makes it difficult for integrated energy systems(IES)to identify weak nodes.Based on the analysis of the data generated by the actual operation of IES,this paper proposes a weak node identification method based on random matrix theory(RMT).First,establish a unified power flow model for IES.Secondly.introduce RMT and the characteristics of weak nodes,without considering the detailed physical model of the system,using historical data and real-time data to construct the random matrix.Thirdly,the two limit spectrum distribution functions(Marchenko-Pastur law and ring law)are used to qualitatively analyze the system’s operating status,calculate linear eigenvalue statistics such as mean spectral radius(MSR),and establish the weak node identification model based on entropy theory.Finally,the simulation of IES verifies the effectiveness of the proposed method and provides a new approach for the identification of weak nodes in IES.
基金supported in part by the National Natural Science Foundation of China (6177249391646114)+1 种基金Chongqing research program of technology innovation and application (cstc2017rgzn-zdyfX0020)in part by the Pioneer Hundred Talents Program of Chinese Academy of Sciences
文摘Latent factor(LF) models are highly effective in extracting useful knowledge from High-Dimensional and Sparse(HiDS) matrices which are commonly seen in various industrial applications. An LF model usually adopts iterative optimizers,which may consume many iterations to achieve a local optima,resulting in considerable time cost. Hence, determining how to accelerate the training process for LF models has become a significant issue. To address this, this work proposes a randomized latent factor(RLF) model. It incorporates the principle of randomized learning techniques from neural networks into the LF analysis of HiDS matrices, thereby greatly alleviating computational burden. It also extends a standard learning process for randomized neural networks in context of LF analysis to make the resulting model represent an HiDS matrix correctly.Experimental results on three HiDS matrices from industrial applications demonstrate that compared with state-of-the-art LF models, RLF is able to achieve significantly higher computational efficiency and comparable prediction accuracy for missing data.I provides an important alternative approach to LF analysis of HiDS matrices, which is especially desired for industrial applications demanding highly efficient models.
文摘We introduce and develop a novel approach to outlier detection based on adaptation of random subspace learning. Our proposed method handles both high-dimension low-sample size and traditional low-dimensional high-sample size datasets. Essentially, we avoid the computational bottleneck of techniques like Minimum Covariance Determinant (MCD) by computing the needed determinants and associated measures in much lower dimensional subspaces. Both theoretical and computational development of our approach reveal that it is computationally more efficient than the regularized methods in high-dimensional low-sample size, and often competes favorably with existing methods as far as the percentage of correct outlier detection are concerned.
基金supported in part by the National Natural Science Foundation of China(61702475,61772493,61902370,62002337)in part by the Natural Science Foundation of Chongqing,China(cstc2019jcyj-msxmX0578,cstc2019jcyjjqX0013)+1 种基金in part by the Chinese Academy of Sciences“Light of West China”Program,in part by the Pioneer Hundred Talents Program of Chinese Academy of Sciencesby Technology Innovation and Application Development Project of Chongqing,China(cstc2019jscx-fxydX0027)。
文摘High-dimensional and sparse(HiDS)matrices commonly arise in various industrial applications,e.g.,recommender systems(RSs),social networks,and wireless sensor networks.Since they contain rich information,how to accurately represent them is of great significance.A latent factor(LF)model is one of the most popular and successful ways to address this issue.Current LF models mostly adopt L2-norm-oriented Loss to represent an HiDS matrix,i.e.,they sum the errors between observed data and predicted ones with L2-norm.Yet L2-norm is sensitive to outlier data.Unfortunately,outlier data usually exist in such matrices.For example,an HiDS matrix from RSs commonly contains many outlier ratings due to some heedless/malicious users.To address this issue,this work proposes a smooth L1-norm-oriented latent factor(SL-LF)model.Its main idea is to adopt smooth L1-norm rather than L2-norm to form its Loss,making it have both strong robustness and high accuracy in predicting the missing data of an HiDS matrix.Experimental results on eight HiDS matrices generated by industrial applications verify that the proposed SL-LF model not only is robust to the outlier data but also has significantly higher prediction accuracy than state-of-the-art models when they are used to predict the missing data of HiDS matrices.
基金Supported by National Natural Science Foundation of China (No.51275348)College Students Innovation and Entrepreneurship Training Program of Tianjin University (No.201210056339)
文摘In this paper, a unified matrix recovery model was proposed for diverse corrupted matrices. Resulting from the separable structure of the proposed model, the convex optimization problem can be solved efficiently by adopting an inexact augmented Lagrange multiplier (IALM) method. Additionally, a random projection accelerated technique (IALM+RP) was adopted to improve the success rate. From the preliminary numerical comparisons, it was indicated that for the standard robust principal component analysis (PCA) problem, IALM+RP was at least two to six times faster than IALM with an insignificant reduction in accuracy; and for the outlier pursuit (OP) problem, IALM+RP was at least 6.9 times faster, even up to 8.3 times faster when the size of matrix was 2 000×2 000.
基金This project is supported by National Natural Science Foundation of China (No.59805001)
文摘A new algorithm of structure random response numerical characteristics, namedas matrix algebra algorithm of structure analysis is presented. Using the algorithm, structurerandom response numerical characteristics can easily be got by directly solving linear matrixequations rather than structure motion differential equations. Moreover, in order to solve thecorresponding linear matrix equations, the numerical integration fast algorithm is presented. Thenaccording to the results, dynamic design and life-span estimation can be done. Besides, the newalgorithm can solve non-proportion damp structure response.
基金Supported by the National Natural Science Foundation of China (No.60972039)Natural Science Foundation of Jiangsu Province (No.BK2007729)Natural Science Funding of Jiangsu Province (No.06KJA51001)
文摘Random Matrix Theory (RMT) is a valuable tool for describing the asymptotic behavior of multiple systems,especially for large matrices. In this paper,using asymptotic random matrix theory,a new cooperative Multiple-Input Multiple-Output (MIMO) scheme for spectrum sensing is proposed,which shows how asymptotic free property of random matrices and the property of Wishart distribution can be used to assist spectrum sensing for Cognitive Radios (CRs). Simulations over Rayleigh fading and AWGN channels demonstrate the proposed scheme has better detection performance compared with the energy detection techniques even in the case of a small sample of observations.
基金supported in part by the “Pioneer” and “Leading Goose” R&D Program of Zhejiang (Grant No. 2022C03174)the National Natural Science Foundation of China (No. 92067103)+4 种基金the Key Research and Development Program of Shaanxi (No.2021ZDLGY06- 02)the Natural Science Foundation of Shaanxi Province (No.2019ZDLGY12-02)the Shaanxi Innovation Team Project (No.2018TD007)the Xi’an Science and technology Innovation Plan (No.201809168CX9JC10)National 111 Program of China B16037
文摘Cloud computing provides powerful processing capabilities for large-scale intelligent Internet of things(IoT)terminals.However,the massive realtime data processing requirements challenge the existing cloud computing model.The edge server is closer to the data source.The end-edge-cloud collaboration offloads the cloud computing tasks to the edge environment,which solves the shortcomings of the cloud in resource storage,computing performance,and energy consumption.IoT terminals and sensors have caused security and privacy challenges due to resource constraints and exponential growth.As the key technology of IoT,Radio-Frequency Identification(RFID)authentication protocol tremendously strengthens privacy protection and improves IoT security.However,it inevitably increases system overhead while improving security,which is a major blow to low-cost RFID tags.The existing RFID authentication protocols are difficult to balance overhead and security.This paper designs an ultra-lightweight encryption function and proposes an RFID authentication scheme based on this function for the end-edge-cloud collaborative environment.The BAN logic proof and protocol verification tools AVISPA formally verify the protocol’s security.We use VIVADO to implement the encryption function and tag’s overhead on the FPGA platform.Performance evaluation indicates that the proposed protocol balances low computing costs and high-security requirements.
文摘We propose and apply a new algorithm of principal component analysis which is suitable for a large sized, highly random time series data, such as a set of stock prices in a stock market. This algorithm utilizes the fact that the major part of the time series is random, and compare the eigenvalue spectrum of cross correlation matrix of a large set of random time series, to the spectrum derived by the random matrix theory (RMT) at the limit of large dimension (the number of independent time series) and long enough length of time series. We test this algorithm on the real tick data of American stocks at different years between 1994 and 2002 and show that the extracted principal components indeed reflects the change of leading stock sectors during this period.
基金National Natural Science Foundations of China(Nos.61201161,61271335)Postdoctoral Science Foundation of Jiangsu Province of China(No.1301002B)
文摘Spectrum sensing in a wideband regime for cognitive radio network(CRN) faces considerably technical challenge due to the constraints on analog-to-digital converters(ADCs).To solve this problem,an eigenvalue-based compressive wideband spectrum sensing(ECWSS) scheme using random matrix theory(RMT) was proposed in this paper.The ECWSS directly utilized the compressive measurements based on compressive sampling(CS) theory to perform wideband spectrum sensing without requiring signal recovery,which could greatly reduce computational complexity and data acquisition burden.In the ECWSS,to alleviate the communication overhead of secondary user(SU),the sensors around SU carried out compressive sampling at the sub-Nyquist rate instead of SU.Furthermore,the exact probability density function of extreme eigenvalues was used to set the threshold.Theoretical analyses and simulation results show that compared with the existing eigenvalue-based sensing schemes,the ECWSS has much lower computational complexity and cost with no significant detection performance degradation.
文摘We have applied the Random Matrix Theory in order to examine the validity of the NPT treatment in HSP. We have investigated the pathology examining the sEMG recorded signal for about eight minutes. We have performed standard electromyographic investigations as well as we have applied the RMT method of analysis. We have investigated the sEMG signals before and after the NPT treatment. The application of a so robust method as the RMT evidences that the NPT treatment was able to induce a net improvement of the disease respect to the pathological status before NPT.
文摘In this paper, a method of power quality disturbance classification based on random matrix theory (RMT) is proposed. The method utilizes the power quality disturbance signal to construct a random matrix. By analyzing the mean spectral radius (MSR) variation of the random matrix, the type and time of occurrence of power quality disturbance are classified. In this paper, the random matrix theory is used to analyze the voltage sag, swell and interrupt perturbation signals to classify the occurrence time, duration of the disturbance signal and thedepth of voltage sag or swell. Examples show that the method has strong anti-noise ability.