This paper proposes a Graph regularized Lpsmooth non-negative matrix factorization(GSNMF) method by incorporating graph regularization and L_p smoothing constraint, which considers the intrinsic geometric information ...This paper proposes a Graph regularized Lpsmooth non-negative matrix factorization(GSNMF) method by incorporating graph regularization and L_p smoothing constraint, which considers the intrinsic geometric information of a data set and produces smooth and stable solutions. The main contributions are as follows: first, graph regularization is added into NMF to discover the hidden semantics and simultaneously respect the intrinsic geometric structure information of a data set. Second,the Lpsmoothing constraint is incorporated into NMF to combine the merits of isotropic(L_2-norm) and anisotropic(L_1-norm)diffusion smoothing, and produces a smooth and more accurate solution to the optimization problem. Finally, the update rules and proof of convergence of GSNMF are given. Experiments on several data sets show that the proposed method outperforms related state-of-the-art methods.展开更多
Aiming at the Four-Dimensional Variation source term inversion algorithm proposed earlier,the observation error regularization factor is introduced to improve the prediction accuracy of the diffusion model,and an impr...Aiming at the Four-Dimensional Variation source term inversion algorithm proposed earlier,the observation error regularization factor is introduced to improve the prediction accuracy of the diffusion model,and an improved Four-Dimensional Variation source term inversion algorithm with observation error regularization(OER-4DVAR STI model)is formed.Firstly,by constructing the inversion process and basic model of OER-4DVAR STI model,its basic principle and logical structure are studied.Secondly,the observation error regularization factor estimation method based on Bayesian optimization is proposed,and the error factor is separated and optimized by two parameters:error statistical time and deviation degree.Finally,the scientific,feasible and advanced nature of the OER-4DVAR STI model are verified by numerical simulation and tracer test data.The experimental results show that OER-4DVAR STI model can better reverse calculate the hazard source term information under the conditions of high atmospheric stability and flat underlying surface.Compared with the previous inversion algorithm,the source intensity estimation accuracy of OER-4DVAR STI model is improved by about 46.97%,and the source location estimation accuracy is improved by about 26.72%.展开更多
Soil compressibility parameters are important indicators in the geotechnical field and are affected by various factors such as natural conditions and human interference.When the sample size is too large,conventional m...Soil compressibility parameters are important indicators in the geotechnical field and are affected by various factors such as natural conditions and human interference.When the sample size is too large,conventional methods require massive human and financial resources.In order to reasonably simulate the compressibility parameters of the sample,this paper firstly adopts the correlation analysis to select seven influencing factors.Each of the factors has a high correlation with compressibility parameters.Meanwhile,the proportion of the weights of the seven factors in the Bayesian neural network is analyzed based on Garson theory.Secondly,an output model of the compressibility parameters of BR-BP silty clay is established based on Bayesian regularized BP neural network.Finally,the model is used to simulate the measured compressibility parameters.The output results are compared with the measured values and the output results of the traditional LM-BP neural network.The results show that the model is more stable and has stronger nonlinear fitting ability.The output of the model is basically consistent with the actual value.Compared with the traditional LMBP neural network model,its data sensitivity is enhanced,and the accuracy of the output result is significantly improved,the average value of the relative error of the compression coefficient is reduced from 15.54%to 6.15%,and the average value of the relative error of the compression modulus is reduced from 6.07%to 4.62%.The results provide a new technical method for obtaining the compressibility parameters of silty clay in this area,showing good theoretical significance and practical value.展开更多
Bayesian empirical likelihood is a semiparametric method that combines parametric priors and nonparametric likelihoods, that is, replacing the parametric likelihood function in Bayes theorem with a nonparametric empir...Bayesian empirical likelihood is a semiparametric method that combines parametric priors and nonparametric likelihoods, that is, replacing the parametric likelihood function in Bayes theorem with a nonparametric empirical likelihood function, which can be used without assuming the distribution of the data. It can effectively avoid the problems caused by the wrong setting of the model. In the variable selection based on Bayesian empirical likelihood, the penalty term is introduced into the model in the form of parameter prior. In this paper, we propose a novel variable selection method, L<sub>1/2</sub> regularization based on Bayesian empirical likelihood. The L<sub>1/2</sub> penalty is introduced into the model through a scale mixture of uniform representation of generalized Gaussian prior, and the posterior distribution is then sampled using MCMC method. Simulations demonstrate that the proposed method can have better predictive ability when the error violates the zero-mean normality assumption of the standard parameter model, and can perform variable selection.展开更多
Bayesian regularized BP neural network(BRBPNN) technique was applied in the chlorophyll-α prediction of Nanzui water area in Dongting Lake. Through BP network interpolation method, the input and output samples of t...Bayesian regularized BP neural network(BRBPNN) technique was applied in the chlorophyll-α prediction of Nanzui water area in Dongting Lake. Through BP network interpolation method, the input and output samples of the network were obtained. After the selection of input variables using stepwise/multiple linear regression method in SPSS i1.0 software, the BRBPNN model was established between chlorophyll-α and environmental parameters, biological parameters. The achieved optimal network structure was 3-11-1 with the correlation coefficients and the mean square errors for the training set and the test set as 0.999 and 0.000?8426, 0.981 and 0.0216 respectively. The sum of square weights between each input neuron and the hidden layer of optimal BRBPNN models of different structures indicated that the effect of individual input parameter on chlorophyll- α declined in the order of alga amount 〉 secchi disc depth(SD) 〉 electrical conductivity (EC). Additionally, it also demonstrated that the contributions of these three factors were the maximal for the change of chlorophyll-α concentration, total phosphorus(TP) and total nitrogen(TN) were the minimal. All the results showed that BRBPNN model was capable of automated regularization parameter selection and thus it may ensure the excellent generation ability and robustness. Thus, this study laid the foundation for the application of BRBPNN model in the analysis of aquatic ecological data(chlorophyll-α prediction) and the explanation about the effective eutrophication treatment measures for Nanzui water area in Dongting Lake.展开更多
In recent years, variable selection based on penalty likelihood methods has aroused great concern. Based on the Gibbs sampling algorithm of asymmetric Laplace distribution, this paper considers the quantile regression...In recent years, variable selection based on penalty likelihood methods has aroused great concern. Based on the Gibbs sampling algorithm of asymmetric Laplace distribution, this paper considers the quantile regression with adaptive Lasso and Lasso penalty from a Bayesian point of view. Under the non-Bayesian and Bayesian framework, several regularization quantile regression methods are systematically compared for error terms with different distributions and heteroscedasticity. Under the error term of asymmetric Laplace distribution, statistical simulation results show that the Bayesian regularized quantile regression is superior to other distributions in all quantiles. And based on the asymmetric Laplace distribution, the Bayesian regularized quantile regression approach performs better than the non-Bayesian approach in parameter estimation and prediction. Through real data analyses, we also confirm the above conclusions.展开更多
In this paper,we will discuss smoothness of weak solutions for the system of second order differential equations eith non-negative characteristies.First of all,we establish boundary,and interior estimates and then we ...In this paper,we will discuss smoothness of weak solutions for the system of second order differential equations eith non-negative characteristies.First of all,we establish boundary,and interior estimates and then we prove that solutions of regularization problem satisfy Lipschitz condition.展开更多
Theoretical results related to properties of a regularized recursive algorithm for estimation of a high dimensional vector of parameters are presented and proved. The recursive character of the procedure is proposed t...Theoretical results related to properties of a regularized recursive algorithm for estimation of a high dimensional vector of parameters are presented and proved. The recursive character of the procedure is proposed to overcome the difficulties with high dimension of the observation vector in computation of a statistical regularized estimator. As to deal with high dimension of the vector of unknown parameters, the regularization is introduced by specifying a priori non-negative covariance structure for the vector of estimated parameters. Numerical example with Monte-Carlo simulation for a low-dimensional system as well as the state/parameter estimation in a very high dimensional oceanic model is presented to demonstrate the efficiency of the proposed approach.展开更多
Rank determination issue is one of the most significant issues in non-negative matrix factorization (NMF) research. However, rank determination problem has not received so much emphasis as sparseness regularization pr...Rank determination issue is one of the most significant issues in non-negative matrix factorization (NMF) research. However, rank determination problem has not received so much emphasis as sparseness regularization problem. Usually, the rank of base matrix needs to be assumed. In this paper, we propose an unsupervised multi-level non-negative matrix factorization model to extract the hidden data structure and seek the rank of base matrix. From machine learning point of view, the learning result depends on its prior knowledge. In our unsupervised multi-level model, we construct a three-level data structure for non-negative matrix factorization algorithm. Such a construction could apply more prior knowledge to the algorithm and obtain a better approximation of real data structure. The final bases selection is achieved through L2-norm optimization. We implement our experiment via binary datasets. The results demonstrate that our approach is able to retrieve the hidden structure of data, thus determine the correct rank of base matrix.展开更多
Conventional artificial neural networks used to solve electrical resistivity imaging (ERI) inversion problem suffer from overfitting and local minima. To solve these problems, we propose to use a pruning Bayesian ne...Conventional artificial neural networks used to solve electrical resistivity imaging (ERI) inversion problem suffer from overfitting and local minima. To solve these problems, we propose to use a pruning Bayesian neural network (PBNN) nonlinear inversion method and a sample design method based on the K-medoids clustering algorithm. In the sample design method, the training samples of the neural network are designed according to the prior information provided by the K-medoids clustering results; thus, the training process of the neural network is well guided. The proposed PBNN, based on Bayesian regularization, is used to select the hidden layer structure by assessing the effect of each hidden neuron to the inversion results. Then, the hyperparameter αk, which is based on the generalized mean, is chosen to guide the pruning process according to the prior distribution of the training samples under the small-sample condition. The proposed algorithm is more efficient than other common adaptive regularization methods in geophysics. The inversion of synthetic data and field data suggests that the proposed method suppresses the noise in the neural network training stage and enhances the generalization. The inversion results with the proposed method are better than those of the BPNN, RBFNN, and RRBFNN inversion methods as well as the conventional least squares inversion.展开更多
In this paper,the application of an algorithm for precipitation retrieval based on Himawari-8 (H8) satellite infrared data is studied.Based on GPM precipitation data and H8 Infrared spectrum channel brightness tempera...In this paper,the application of an algorithm for precipitation retrieval based on Himawari-8 (H8) satellite infrared data is studied.Based on GPM precipitation data and H8 Infrared spectrum channel brightness temperature data,corresponding "precipitation field dictionary" and "channel brightness temperature dictionary" are formed.The retrieval of precipitation field based on brightness temperature data is studied through the classification rule of k-nearest neighbor domain (KNN) and regularization constraint.Firstly,the corresponding "dictionary" is constructed according to the training sample database of the matched GPM precipitation data and H8 brightness temperature data.Secondly,according to the fact that precipitation characteristics in small organizations in different storm environments are often repeated,KNN is used to identify the spectral brightness temperature signal of "precipitation" and "non-precipitation" based on "the dictionary".Finally,the precipitation field retrieval is carried out in the precipitation signal "subspace" based on the regular term constraint method.In the process of retrieval,the contribution rate of brightness temperature retrieval of different channels was determined by Bayesian model averaging (BMA) model.The preliminary experimental results based on the "quantitative" evaluation indexes show that the precipitation of H8 retrieval has a good correlation with the GPM truth value,with a small error and similar structure.展开更多
A Bayesian decision method is considered,which is applied to analysingthe reform problem of economic system in our country.When the number of eco-nomic departments satisfies some certain cunditions,the optinial length...A Bayesian decision method is considered,which is applied to analysingthe reform problem of economic system in our country.When the number of eco-nomic departments satisfies some certain cunditions,the optinial lengths and optimalallocations are found in this paper.展开更多
Aiming at the low recognition accuracy of non-negative matrix factorization(NMF)in practical application,an improved spare graph NMF(New-SGNMF)is proposed in this paper.New-SGNMF makes full use of the inherent geometr...Aiming at the low recognition accuracy of non-negative matrix factorization(NMF)in practical application,an improved spare graph NMF(New-SGNMF)is proposed in this paper.New-SGNMF makes full use of the inherent geometric structure of image data to optimize the basis matrix in two steps.A threshold value s was first set to judge the threshold value of the decomposed base matrix to filter the redundant information in the data.Using L2 norm,sparse constraints were then implemented on the basis matrix,and integrated into the objective function to obtain the objective function of New-SGNMF.In addition,the derivation process of the algorithm and the convergence analysis of the algorithm were given.The experimental results on COIL20,PIE-pose09 and YaleB database show that compared with K-means,PCA,NMF and other algorithms,the proposed algorithm has higher accuracy and normalized mutual information.展开更多
基金supported by the National Natural Science Foundation of China(61702251,61363049,11571011)the State Scholarship Fund of China Scholarship Council(CSC)(201708360040)+3 种基金the Natural Science Foundation of Jiangxi Province(20161BAB212033)the Natural Science Basic Research Plan in Shaanxi Province of China(2018JM6030)the Doctor Scientific Research Starting Foundation of Northwest University(338050050)Youth Academic Talent Support Program of Northwest University
文摘This paper proposes a Graph regularized Lpsmooth non-negative matrix factorization(GSNMF) method by incorporating graph regularization and L_p smoothing constraint, which considers the intrinsic geometric information of a data set and produces smooth and stable solutions. The main contributions are as follows: first, graph regularization is added into NMF to discover the hidden semantics and simultaneously respect the intrinsic geometric structure information of a data set. Second,the Lpsmoothing constraint is incorporated into NMF to combine the merits of isotropic(L_2-norm) and anisotropic(L_1-norm)diffusion smoothing, and produces a smooth and more accurate solution to the optimization problem. Finally, the update rules and proof of convergence of GSNMF are given. Experiments on several data sets show that the proposed method outperforms related state-of-the-art methods.
基金Ministry of Science and Technology of the People’s Republic of China for its support and guidance(Grant No.2018YFC0214100)。
文摘Aiming at the Four-Dimensional Variation source term inversion algorithm proposed earlier,the observation error regularization factor is introduced to improve the prediction accuracy of the diffusion model,and an improved Four-Dimensional Variation source term inversion algorithm with observation error regularization(OER-4DVAR STI model)is formed.Firstly,by constructing the inversion process and basic model of OER-4DVAR STI model,its basic principle and logical structure are studied.Secondly,the observation error regularization factor estimation method based on Bayesian optimization is proposed,and the error factor is separated and optimized by two parameters:error statistical time and deviation degree.Finally,the scientific,feasible and advanced nature of the OER-4DVAR STI model are verified by numerical simulation and tracer test data.The experimental results show that OER-4DVAR STI model can better reverse calculate the hazard source term information under the conditions of high atmospheric stability and flat underlying surface.Compared with the previous inversion algorithm,the source intensity estimation accuracy of OER-4DVAR STI model is improved by about 46.97%,and the source location estimation accuracy is improved by about 26.72%.
基金This project is sponsored by the Basic scientific research business funding project of Institute of Seismic Prediction,CEA(2018 IESLZ06)the Natural Science Foundation of China(51778590)Earthquake Science and Technology Spark Project(XH20057)。
文摘Soil compressibility parameters are important indicators in the geotechnical field and are affected by various factors such as natural conditions and human interference.When the sample size is too large,conventional methods require massive human and financial resources.In order to reasonably simulate the compressibility parameters of the sample,this paper firstly adopts the correlation analysis to select seven influencing factors.Each of the factors has a high correlation with compressibility parameters.Meanwhile,the proportion of the weights of the seven factors in the Bayesian neural network is analyzed based on Garson theory.Secondly,an output model of the compressibility parameters of BR-BP silty clay is established based on Bayesian regularized BP neural network.Finally,the model is used to simulate the measured compressibility parameters.The output results are compared with the measured values and the output results of the traditional LM-BP neural network.The results show that the model is more stable and has stronger nonlinear fitting ability.The output of the model is basically consistent with the actual value.Compared with the traditional LMBP neural network model,its data sensitivity is enhanced,and the accuracy of the output result is significantly improved,the average value of the relative error of the compression coefficient is reduced from 15.54%to 6.15%,and the average value of the relative error of the compression modulus is reduced from 6.07%to 4.62%.The results provide a new technical method for obtaining the compressibility parameters of silty clay in this area,showing good theoretical significance and practical value.
文摘Bayesian empirical likelihood is a semiparametric method that combines parametric priors and nonparametric likelihoods, that is, replacing the parametric likelihood function in Bayes theorem with a nonparametric empirical likelihood function, which can be used without assuming the distribution of the data. It can effectively avoid the problems caused by the wrong setting of the model. In the variable selection based on Bayesian empirical likelihood, the penalty term is introduced into the model in the form of parameter prior. In this paper, we propose a novel variable selection method, L<sub>1/2</sub> regularization based on Bayesian empirical likelihood. The L<sub>1/2</sub> penalty is introduced into the model through a scale mixture of uniform representation of generalized Gaussian prior, and the posterior distribution is then sampled using MCMC method. Simulations demonstrate that the proposed method can have better predictive ability when the error violates the zero-mean normality assumption of the standard parameter model, and can perform variable selection.
文摘Bayesian regularized BP neural network(BRBPNN) technique was applied in the chlorophyll-α prediction of Nanzui water area in Dongting Lake. Through BP network interpolation method, the input and output samples of the network were obtained. After the selection of input variables using stepwise/multiple linear regression method in SPSS i1.0 software, the BRBPNN model was established between chlorophyll-α and environmental parameters, biological parameters. The achieved optimal network structure was 3-11-1 with the correlation coefficients and the mean square errors for the training set and the test set as 0.999 and 0.000?8426, 0.981 and 0.0216 respectively. The sum of square weights between each input neuron and the hidden layer of optimal BRBPNN models of different structures indicated that the effect of individual input parameter on chlorophyll- α declined in the order of alga amount 〉 secchi disc depth(SD) 〉 electrical conductivity (EC). Additionally, it also demonstrated that the contributions of these three factors were the maximal for the change of chlorophyll-α concentration, total phosphorus(TP) and total nitrogen(TN) were the minimal. All the results showed that BRBPNN model was capable of automated regularization parameter selection and thus it may ensure the excellent generation ability and robustness. Thus, this study laid the foundation for the application of BRBPNN model in the analysis of aquatic ecological data(chlorophyll-α prediction) and the explanation about the effective eutrophication treatment measures for Nanzui water area in Dongting Lake.
文摘In recent years, variable selection based on penalty likelihood methods has aroused great concern. Based on the Gibbs sampling algorithm of asymmetric Laplace distribution, this paper considers the quantile regression with adaptive Lasso and Lasso penalty from a Bayesian point of view. Under the non-Bayesian and Bayesian framework, several regularization quantile regression methods are systematically compared for error terms with different distributions and heteroscedasticity. Under the error term of asymmetric Laplace distribution, statistical simulation results show that the Bayesian regularized quantile regression is superior to other distributions in all quantiles. And based on the asymmetric Laplace distribution, the Bayesian regularized quantile regression approach performs better than the non-Bayesian approach in parameter estimation and prediction. Through real data analyses, we also confirm the above conclusions.
文摘In this paper,we will discuss smoothness of weak solutions for the system of second order differential equations eith non-negative characteristies.First of all,we establish boundary,and interior estimates and then we prove that solutions of regularization problem satisfy Lipschitz condition.
文摘Theoretical results related to properties of a regularized recursive algorithm for estimation of a high dimensional vector of parameters are presented and proved. The recursive character of the procedure is proposed to overcome the difficulties with high dimension of the observation vector in computation of a statistical regularized estimator. As to deal with high dimension of the vector of unknown parameters, the regularization is introduced by specifying a priori non-negative covariance structure for the vector of estimated parameters. Numerical example with Monte-Carlo simulation for a low-dimensional system as well as the state/parameter estimation in a very high dimensional oceanic model is presented to demonstrate the efficiency of the proposed approach.
文摘Rank determination issue is one of the most significant issues in non-negative matrix factorization (NMF) research. However, rank determination problem has not received so much emphasis as sparseness regularization problem. Usually, the rank of base matrix needs to be assumed. In this paper, we propose an unsupervised multi-level non-negative matrix factorization model to extract the hidden data structure and seek the rank of base matrix. From machine learning point of view, the learning result depends on its prior knowledge. In our unsupervised multi-level model, we construct a three-level data structure for non-negative matrix factorization algorithm. Such a construction could apply more prior knowledge to the algorithm and obtain a better approximation of real data structure. The final bases selection is achieved through L2-norm optimization. We implement our experiment via binary datasets. The results demonstrate that our approach is able to retrieve the hidden structure of data, thus determine the correct rank of base matrix.
基金supported by the National Natural Science Foundation of China(Grant No.41374118)the Research Fund for the Higher Education Doctoral Program of China(Grant No.20120162110015)+3 种基金the China Postdoctoral Science Foundation(Grant No.2015M580700)the Hunan Provincial Natural Science Foundation,the China(Grant No.2016JJ3086)the Hunan Provincial Science and Technology Program,China(Grant No.2015JC3067)the Scientific Research Fund of Hunan Provincial Education Department,China(Grant No.15B138)
文摘Conventional artificial neural networks used to solve electrical resistivity imaging (ERI) inversion problem suffer from overfitting and local minima. To solve these problems, we propose to use a pruning Bayesian neural network (PBNN) nonlinear inversion method and a sample design method based on the K-medoids clustering algorithm. In the sample design method, the training samples of the neural network are designed according to the prior information provided by the K-medoids clustering results; thus, the training process of the neural network is well guided. The proposed PBNN, based on Bayesian regularization, is used to select the hidden layer structure by assessing the effect of each hidden neuron to the inversion results. Then, the hyperparameter αk, which is based on the generalized mean, is chosen to guide the pruning process according to the prior distribution of the training samples under the small-sample condition. The proposed algorithm is more efficient than other common adaptive regularization methods in geophysics. The inversion of synthetic data and field data suggests that the proposed method suppresses the noise in the neural network training stage and enhances the generalization. The inversion results with the proposed method are better than those of the BPNN, RBFNN, and RRBFNN inversion methods as well as the conventional least squares inversion.
基金Supported by National Natural Science Foundation of China(41805080)Natural Science Foundation of Anhui Province,China(1708085QD89)+1 种基金Key Research and Development Program Projects of Anhui Province,China(201904a07020099)Open Foundation Project Shenyang Institute of Atmospheric Environment,China Meteorological Administration(2016SYIAE14)
文摘In this paper,the application of an algorithm for precipitation retrieval based on Himawari-8 (H8) satellite infrared data is studied.Based on GPM precipitation data and H8 Infrared spectrum channel brightness temperature data,corresponding "precipitation field dictionary" and "channel brightness temperature dictionary" are formed.The retrieval of precipitation field based on brightness temperature data is studied through the classification rule of k-nearest neighbor domain (KNN) and regularization constraint.Firstly,the corresponding "dictionary" is constructed according to the training sample database of the matched GPM precipitation data and H8 brightness temperature data.Secondly,according to the fact that precipitation characteristics in small organizations in different storm environments are often repeated,KNN is used to identify the spectral brightness temperature signal of "precipitation" and "non-precipitation" based on "the dictionary".Finally,the precipitation field retrieval is carried out in the precipitation signal "subspace" based on the regular term constraint method.In the process of retrieval,the contribution rate of brightness temperature retrieval of different channels was determined by Bayesian model averaging (BMA) model.The preliminary experimental results based on the "quantitative" evaluation indexes show that the precipitation of H8 retrieval has a good correlation with the GPM truth value,with a small error and similar structure.
文摘A Bayesian decision method is considered,which is applied to analysingthe reform problem of economic system in our country.When the number of eco-nomic departments satisfies some certain cunditions,the optinial lengths and optimalallocations are found in this paper.
基金This work was supported by the National Natural Science Foundation of China(Grant No.61501005)the Anhui Natural Science Foundation(Grant No.1608085 MF 147)+2 种基金the Natural Science Foundation of Anhui Universities(Grant No.KJ2016A057)the Industry Collaborative Innovation Fund of Anhui Polytechnic University and Jiujiang District(Grant No.2021cyxtb4)the Science Research Project of Anhui Polytechnic University(Grant No.Xjky2020120).
文摘Aiming at the low recognition accuracy of non-negative matrix factorization(NMF)in practical application,an improved spare graph NMF(New-SGNMF)is proposed in this paper.New-SGNMF makes full use of the inherent geometric structure of image data to optimize the basis matrix in two steps.A threshold value s was first set to judge the threshold value of the decomposed base matrix to filter the redundant information in the data.Using L2 norm,sparse constraints were then implemented on the basis matrix,and integrated into the objective function to obtain the objective function of New-SGNMF.In addition,the derivation process of the algorithm and the convergence analysis of the algorithm were given.The experimental results on COIL20,PIE-pose09 and YaleB database show that compared with K-means,PCA,NMF and other algorithms,the proposed algorithm has higher accuracy and normalized mutual information.