Learning from imbalanced data is one of the greatest challenging problems in binary classification,and this problem has gained more importance in recent years.When the class distribution is imbalanced,classical machin...Learning from imbalanced data is one of the greatest challenging problems in binary classification,and this problem has gained more importance in recent years.When the class distribution is imbalanced,classical machine learning algorithms tend to move strongly towards the majority class and disregard the minority.Therefore,the accuracy may be high,but the model cannot recognize data instances in the minority class to classify them,leading to many misclassifications.Different methods have been proposed in the literature to handle the imbalance problem,but most are complicated and tend to simulate unnecessary noise.In this paper,we propose a simple oversampling method based on Multivariate Gaussian distribution and K-means clustering,called GK-Means.The new method aims to avoid generating noise and control imbalances between and within classes.Various experiments have been carried out with six classifiers and four oversampling methods.Experimental results on different imbalanced datasets show that the proposed GK-Means outperforms other oversampling methods and improves classification performance as measured by F1-score and Accuracy.展开更多
Structures of monotone systems and cold standby systems with exponen-tial life distributions and dependent components are studied. It is shown that a mono-tone system composed of components with multivariate HNBUE lif...Structures of monotone systems and cold standby systems with exponen-tial life distributions and dependent components are studied. It is shown that a mono-tone system composed of components with multivariate HNBUE life distributions isessentially a series system composed of components with multivariate exponential lifedistributions. Also, it is proved that for cold standby systems composed of componentswith multivariate NBU life distributions, all but oue of the components are degenerateat zero while the remaining one is exponential. In addition, several equivalent char-acterizations of multivariate exponential distribution are provided in the multivariateHNBUE life distribution class which include many existing results as special cases.展开更多
This paper introduces a sliding-window mean removal high pass filter by which background clutter of infrared multispectral image is obtained. The method of selecting the optimum size of the sliding-window is based on ...This paper introduces a sliding-window mean removal high pass filter by which background clutter of infrared multispectral image is obtained. The method of selecting the optimum size of the sliding-window is based on the skewness-kurtosis test. In the end, a multivariate Gaussian distribution mathematical expression of background clutter image is given.展开更多
Displacement is vital in the evaluations of tunnel excavation processes,as well as in determining the postexcavation stability of surrounding rock masses.The prediction of tunnel displacement is a complex problem beca...Displacement is vital in the evaluations of tunnel excavation processes,as well as in determining the postexcavation stability of surrounding rock masses.The prediction of tunnel displacement is a complex problem because of the uncertainties of rock mass properties.Meanwhile,the variation and the correlation relationship of geotechnical material properties have been gradually recognized by researchers in recent years.In this paper,a novel probabilistic method is proposed to estimate the uncertainties of rock mass properties and tunnel displacement,which integrated multivariate distribution function and a relevance vector machine(RVM).The multivariate distribution function is used to establish the probability model of related random variables.RVM is coupled with the numerical simulation methods to construct the nonlinear relationship between tunnel displacements and rock mass parameters,which avoided a large number of numerical simulations.Also,the residual rock mass parameters are taken into account to reflect the brittleness of deeply buried rock mass.Then,based on the proposed method,the uncertainty of displacement in a deep tunnel of CJPL-II laboratory are analyzed and compared with the in-situ measurements.It is found that the predicted tunnel displacements by the RVM model closely match with the measured ones.The correlations of parameters have significant impacts on the uncertainty results.The uncertainty of tunnel displacement decreases while the reliability of the tunnel increases with the increases of the negative correlations among rock mass parameters.When compared to the deterministic method,the proposed approach is more rational and scientific,and also conformed to rock engineering practices.展开更多
The reliability of a vertical breakwater is calculated using direct integration methods based on joint density functions.The horizontal and uplifting wave forces on the vertical breakwater can be well fitted by the lo...The reliability of a vertical breakwater is calculated using direct integration methods based on joint density functions.The horizontal and uplifting wave forces on the vertical breakwater can be well fitted by the lognormal and the Gumbel distributions,respectively.The joint distribution of the horizontal and uplifting wave forces is analyzed using different probabilistic distributions,including the bivariate logistic Gumbel distribution,the bivariate lognormal distribution,and three bivariate Archimedean copulas functions constructed with different marginal distributions simultaneously.We use the fully nested copulas to construct multivariate distributions taking into account related variables.Different goodness fitting tests are carried out to determine the best bivariate copula model for wave forces on a vertical breakwater.We show that a bivariate model constructed by Frank copula gives the best reliability analysis,using marginal distributions of Gumbel and lognormal to account for uplifting pressure and horizontal wave force on a vertical breakwater,respectively.The results show that failure probability of the vertical breakwater calculated by multivariate density function is comparable to those by the Joint Committee on Structural Safety methods.As copulas are suitable for constructing a bivariate or multivariate joint distribution,they have great potential in reliability analysis for other coastal structures.展开更多
The paper is concerned with the basic properties of multivariate extreme value distribution (in the Logistic model). We obtain the characteristic function and recurrence formula of the density function. The explicit a...The paper is concerned with the basic properties of multivariate extreme value distribution (in the Logistic model). We obtain the characteristic function and recurrence formula of the density function. The explicit algebraic formula for Fisher information matrix is indicated. A simple and accurate procedure for generating random vector from multivariate extreme value distribution is presented.展开更多
Hurricanes Katrina and Rita resulted in the largest number of platforms destroyed and damaged in the history of Gulf of Mexico operations. With the trend of global warming, sea level rising and the frequency and inten...Hurricanes Katrina and Rita resulted in the largest number of platforms destroyed and damaged in the history of Gulf of Mexico operations. With the trend of global warming, sea level rising and the frequency and intensity of typhoon increase. How to determine a reasonable deck elevation against the largest hurricane waves has become a key issue in offshore platforms design and construction for the unification of economy and safety. In this paper, the multivariate compound extreme value distribution (MCEVD) model is used to predict the deck elevation with different combination of tide, surge height, and crest height. Compared with practice recommended by American Petroleum Institute (API), the prediction by MCEVD has probabilistic meaning and universality.展开更多
Accurate head poses are useful for many face-related tasks such as face recognition, gaze estimation,and emotion analysis. Most existing methods estimate head poses that are included in the training data(i.e.,previous...Accurate head poses are useful for many face-related tasks such as face recognition, gaze estimation,and emotion analysis. Most existing methods estimate head poses that are included in the training data(i.e.,previously seen head poses). To predict head poses that are not seen in the training data, some regression-based methods have been proposed. However, they focus on estimating continuous head pose angles, and thus do not systematically evaluate the performance on predicting unseen head poses. In this paper, we use a dense multivariate label distribution(MLD) to represent the pose angle of a face image. By incorporating both seen and unseen pose angles into MLD, the head pose predictor can estimate unseen head poses with an accuracy comparable to that of estimating seen head poses. On the Pointing'04 database, the mean absolute errors of results for yaw and pitch are 4.01?and 2.13?, respectively. In addition, experiments on the CAS-PEAL and CMU Multi-PIE databases show that the proposed dense MLD-based head pose estimation method can obtain the state-of-the-art performance when compared to some existing methods.展开更多
In this paper, the Bayes estimator and the parametric empirical Bayes estimator(PEBE) of mean vector in multivariate normal distribution are obtained. The superiority of the PEBE over the minimum variance unbiased est...In this paper, the Bayes estimator and the parametric empirical Bayes estimator(PEBE) of mean vector in multivariate normal distribution are obtained. The superiority of the PEBE over the minimum variance unbiased estimator(MVUE) and a revised James-Stein estimators(RJSE) are investigated respectively under mean square error(MSE) criterion. Extensive simulations are conducted to show that performance of the PEBE is optimal among these three estimators under the MSE criterion.展开更多
Based on the De.Morgan laws and Boolean simplification, a recursive decomposition method is introduced in this paper to identify the main exclusive safe paths and failed paths of a network. The reliability or the reli...Based on the De.Morgan laws and Boolean simplification, a recursive decomposition method is introduced in this paper to identify the main exclusive safe paths and failed paths of a network. The reliability or the reliability bound of a network can be conveniently expressed as the summation of the joint probabilities of these paths. Under the multivariate normal distribution assumption, a conditioned reliability index method is developed to evaluate joint probabilities of various exclusive safe paths and failed paths, and, finally, the seismic reliability or the reliability bound of an electric power system. Examples given in the paper show that the method is very simple and provides accurate results in the seismic reliability analysis.展开更多
In this study, an advanced probabilistic neural network (APNN) method is proposed to reflect the global probability density function (PDF) by summing up the heterogeneous local PDF which is automatically determine...In this study, an advanced probabilistic neural network (APNN) method is proposed to reflect the global probability density function (PDF) by summing up the heterogeneous local PDF which is automatically determined in the individual standard deviation of variables. The APNN is applied to predict the stability number of armor blocks of breakwaters using the experimental data of' van der Meet, and the estimated results of the APNN are compared with those of an empirical formula and a previous artificial neural network (ANN) model. The APNN shows better results in predicting the stability number of armor bilks of breakwater and it provided the promising probabilistic viewpoints by using the individual standard deviation in a variable.展开更多
Sampling from a truncated multivariate normal distribution (TMVND) constitutes the core computational module in fitting many statistical and econometric models. We propose two efficient methods, an iterative data au...Sampling from a truncated multivariate normal distribution (TMVND) constitutes the core computational module in fitting many statistical and econometric models. We propose two efficient methods, an iterative data augmentation (DA) algorithm and a non-iterative inverse Bayes formulae (IBF) sampler, to simulate TMVND and generalize them to multivariate normal distributions with linear inequality constraints. By creating a Bayesian incomplete-data structure, the posterior step of the DA Mgorithm directly generates random vector draws as opposed to single element draws, resulting obvious computational advantage and easy coding with common statistical software packages such as S-PLUS, MATLAB and GAUSS. Furthermore, the DA provides a ready structure for implementing a fast EM algorithm to identify the mode of TMVND, which has many potential applications in statistical inference of constrained parameter problems. In addition, utilizing this mode as an intermediate result, the IBF sampling provides a novel alternative to Gibbs sampling and elimi- nares problems with convergence and possible slow convergence due to the high correlation between components of a TMVND. The DA algorithm is applied to a linear regression model with constrained parameters and is illustrated with a published data set. Numerical comparisons show that the proposed DA algorithm and IBF sampler are more efficient than the Gibbs sampler and the accept-reject algorithm.展开更多
With the increasing complexity of industrial processes, the high-dimensional industrial data exhibit a strong nonlinearity, bringing considerable challenges to the fault diagnosis of industrial processes. To efficient...With the increasing complexity of industrial processes, the high-dimensional industrial data exhibit a strong nonlinearity, bringing considerable challenges to the fault diagnosis of industrial processes. To efficiently extract deep meaningful features that are crucial for fault diagnosis, a sparse Gaussian feature extractor(SGFE) is designed to learn a nonlinear mapping that projects the raw data into the feature space with the fault label dimension. The feature space is described by the one-hot encoding of the fault category label as an orthogonal basis. In this way, the deep sparse Gaussian features related to fault categories can be gradually learned from the raw data by SGFE. In the feature space,the sparse Gaussian(SG) loss function is designed to constrain the distribution of features to multiple sparse multivariate Gaussian distributions. The sparse Gaussian features are linearly separable in the feature space, which is conducive to improving the accuracy of the downstream fault classification task. The feasibility and practical utility of the proposed SGFE are verified by the handwritten digits MNIST benchmark and Tennessee-Eastman(TE) benchmark process,respectively.展开更多
The purpose of this paper is to present a general universal formula for <span style="font-family:Verdana;"><i></i></span><i><span><span><i><span style="...The purpose of this paper is to present a general universal formula for <span style="font-family:Verdana;"><i></i></span><i><span><span><i><span style="font-family:Verdana;">k</span></i><span style="font-family:Verdana;"></span></span></span></i><span><span><span style="font-family:Verdana;">-variate survival functions for arbitrary </span><span style="font-family:Verdana;"><i></i></span></span></span><i><span><span><i><span style="font-family:Verdana;">k</span></i><span style="font-family:Verdana;"></span></span></span></i><span style="font-family:Verdana;"><span style="font-family:Verdana;"><span style="font-family:Verdana;"> = 2, 3, </span></span></span><span><span><span style="font-family:;" "=""><span style="font-family:Verdana;">...</span><span style="font-family:Verdana;">, given all the univariate marginal survival functions. This universal form of </span></span><span style="font-family:Verdana;"><i></i></span></span></span><i><span><span><i><span style="font-family:Verdana;">k</span></i><span style="font-family:Verdana;"></span></span></span></i><span><span><span style="font-family:Verdana;">-variate probability distributions was obtained by means of “dependence functions” named “joiners” in the text. These joiners determine all the involved stochastic dependencies between the underlying random variables. However, in order that the presented formula (the form) represents a legitimate survival function, some necessary and sufficient conditions for the joiners had to be found. Basically, finding those conditions is the main task of this paper. This task was successfully performed for the case </span><span style="font-family:Verdana;"><i></i></span></span></span><i><span><span><i><span style="font-family:Verdana;">k</span></i><span style="font-family:Verdana;"></span></span></span></i><span><span><span style="font-family:Verdana;"> = 2 and the main results for the case </span><span style="font-family:Verdana;"><i></i></span></span></span><i><span><span><i><span style="font-family:Verdana;">k</span></i><span style="font-family:Verdana;"></span></span></span></i><span><span><span style="font-family:Verdana;"> = 3 were formulated as Theorem 1 and Theorem 2 in Section 4. Nevertheless, the hypothetical conditions valid for the general </span><span style="font-family:Verdana;"><i></i></span></span></span><i><span><span><i><span style="font-family:Verdana;">k</span></i><span style="font-family:Verdana;"></span></span></span></i><span><span><span style="font-family:Verdana;"> ≥ 4 case were also formulated in Section 3 as the (very convincing) Hypothesis. As for the sufficient conditions for both the </span><span style="font-family:Verdana;"><i></i></span></span></span><i><span><span><i><span style="font-family:Verdana;">k</span></i><span style="font-family:Verdana;"></span></span></span></i><span><span><span style="font-family:;" "=""><span style="font-family:Verdana;"> = 3 and</span><i> </i></span><span style="font-family:Verdana;"><i></i></span></span></span><i><span><span><i><span style="font-family:Verdana;">k</span></i><span style="font-family:Verdana;"></span></span></span></i><span><span><span style="font-family:Verdana;"> ≥ 4 cases, the full generality was not achieved since two restrictions were imposed. Firstly, we limited ourselves to the, defined in the text, “continuous cases” (when the corresponding joint density exists and is continuous), and secondly we consider positive stochastic dependencies only. Nevertheless, the class of the </span><span style="font-family:Verdana;"><i></i></span></span></span><i><span><span><i><span style="font-family:Verdana;">k</span></i><span style="font-family:Verdana;"></span></span></span></i><span><span><span style="font-family:Verdana;">-variate distributions which can be constructed is very wide. The presented method of construction by means of joiners can be considered competitive to the </span><span style="font-family:Verdana;"><strong></strong></span></span></span><strong><span><span><b><span style="font-family:Verdana;">copula</span></b><span style="font-family:Verdana;"></span></span></span></strong><span style="font-family:Verdana;"><span style="font-family:Verdana;"><span style="font-family:Verdana;"> methodology. As it is suggested in the paper the possibility of building a common theory of both copulae and joiners is quite possible, and the joiners may play the role of tools within the theory of copulae, and vice versa copulae may, for example, be used for finding proper joiners. Another independent feature of the joiners methodology is the possibility of constructing many new stochastic processes including stationary and Markovian.</span></span></span>展开更多
The probability analysis of ground deformation is becoming a trend to estimate and control the risk brought by shield tunnelling.The gap parameter is regarded as an effective tool to estimate the ground loss of tunnel...The probability analysis of ground deformation is becoming a trend to estimate and control the risk brought by shield tunnelling.The gap parameter is regarded as an effective tool to estimate the ground loss of tunnelling in soft soil.More specifically,x,which is a gap parameter component defined as the over(or insufficient)excavation due to the change in the posture of the shield machine,may contribute more to the uncertainty of the ground loss.However,the existing uncertainty characterization methods for x have several limitations and cannot explain the uncertain correlations between the relevant parameters.Along these lines,to better characterize the uncertainty of x,the multivariate probability distribution was developed in this work and a dynamic prediction was proposed for it.To attain this goal,1523 rings of the field data coming from the shield tunnel between Longqing Road and Baiyun Road in Kunming Metro Line 5 were utilized and 44 parameters including the construction,stratigraphic,and posture parameters were collected to form the database.According to the variance filter method,the mutual information method,and the value of the correlation coefficients,the original 44 parameters were reduced to 10 main parameters,which were unit weight,the stoke of the jacks(A,B,C,and D groups),the pressure of the pushing jacks(A,C groups),the chamber pressure,the rotation speed,and the total force.The multivariate probability distribution was constructed based on the Johnson system of distributions.Moreover,the distribution was satisfactorily verified in explaining the pairwise correlation between x and other parameters through 2 million simulation cases.At last,the distribution was used as a prior distribution to update the marginal distribution of x with any group of the relevant parameters known.The performance of the dynamic prediction was further validated by the field data of 3 shield tunnel cases.展开更多
Two classes of multivariate DMRL distributions and a class of multivariate NBUE distributions are introduced in this paper by using conditional stochastic order.That is, a random vector belongs to a multivariate DMRL ...Two classes of multivariate DMRL distributions and a class of multivariate NBUE distributions are introduced in this paper by using conditional stochastic order.That is, a random vector belongs to a multivariate DMRL class of life distributions if its residual life(defined as a conditional random vector)is decreasing in time under convex or linear order.Some conservation properties of these classes are studied.展开更多
Truncated elliptical distributions occur naturally in theoretical and applied statistics and are essential for the study of other classes of multivariate distributions.Two members of this class are the multivariate tr...Truncated elliptical distributions occur naturally in theoretical and applied statistics and are essential for the study of other classes of multivariate distributions.Two members of this class are the multivariate truncated normal and multivariate truncated t distributions.We derive statistical properties of the truncated elliptical distributions.Applications of our results establish new properties of the multivariate truncated slash and multivariate truncated power exponential distributions.展开更多
This paper investigates and discusses the use of information divergence,through the widely used Kullback–Leibler(KL)divergence,under the multivariate(generalized)γ-order normal distribution(γ-GND).The behavior of t...This paper investigates and discusses the use of information divergence,through the widely used Kullback–Leibler(KL)divergence,under the multivariate(generalized)γ-order normal distribution(γ-GND).The behavior of the KL divergence,as far as its symmetricity is concerned,is studied by calculating the divergence of γ-GND over the Student’s multivariate t-distribution and vice versa.Certain special cases are also given and discussed.Furthermore,three symmetrized forms of the KL divergence,i.e.,the Jeffreys distance,the geometric-KL as well as the harmonic-KL distances,are computed between two members of the γ-GND family,while the corresponding differences between those information distances are also discussed.展开更多
As an integral part of tolerance design in the context of design for six sigma, determining optimal product specifications has become the focus of increased activity, as manufacturing industries strive to increase pro...As an integral part of tolerance design in the context of design for six sigma, determining optimal product specifications has become the focus of increased activity, as manufacturing industries strive to increase productivity and improve the quality of their products. Although a number of research papers have been reported in the research community, there is room for improvement. Most existing research papers consider determining optimal specification limits for a single quality characteristic. In this paper, we develop the modeling and optimization procedures for optimum circular and spherical configurations by considering multiple quality characteristics. The concepts of multivariate quality loss function and truncated distribution are incorporated. This has never been adequately addressed, nor has been appropriately applied in industry. A numerical example is shown and comparison studies are made.展开更多
The probability hypothesis density(PHD) filter has been recognized as a promising technique for tracking an unknown number of targets. The performance of the PHD filter, however, is sensitive to the available knowledg...The probability hypothesis density(PHD) filter has been recognized as a promising technique for tracking an unknown number of targets. The performance of the PHD filter, however, is sensitive to the available knowledge on model parameters such as the measurement noise variance and those associated with the changes in the maneuvering target trajectories. If these parameters are unknown in advance, the tracking performance may degrade greatly. To address this aspect, this paper proposes to incorporate the adaptive parameter estimation(APE) method in the PHD filter so that the model parameters, which may be static and/or time-varying, can be estimated jointly with target states. The resulting APE-PHD algorithm is implemented using the particle filter(PF), which leads to the PF-APE-PHD filter. Simulations show that the newly proposed algorithm can correctly identify the unknown measurement noise variances, and it is capable of tracking multiple maneuvering targets with abrupt changing parameters in a more robust manner, compared to the multi-model approaches.展开更多
文摘Learning from imbalanced data is one of the greatest challenging problems in binary classification,and this problem has gained more importance in recent years.When the class distribution is imbalanced,classical machine learning algorithms tend to move strongly towards the majority class and disregard the minority.Therefore,the accuracy may be high,but the model cannot recognize data instances in the minority class to classify them,leading to many misclassifications.Different methods have been proposed in the literature to handle the imbalance problem,but most are complicated and tend to simulate unnecessary noise.In this paper,we propose a simple oversampling method based on Multivariate Gaussian distribution and K-means clustering,called GK-Means.The new method aims to avoid generating noise and control imbalances between and within classes.Various experiments have been carried out with six classifiers and four oversampling methods.Experimental results on different imbalanced datasets show that the proposed GK-Means outperforms other oversampling methods and improves classification performance as measured by F1-score and Accuracy.
基金This work is supported by the Natural Science Foundation of the Jiangsu Provincial Education Commission.
文摘Structures of monotone systems and cold standby systems with exponen-tial life distributions and dependent components are studied. It is shown that a mono-tone system composed of components with multivariate HNBUE life distributions isessentially a series system composed of components with multivariate exponential lifedistributions. Also, it is proved that for cold standby systems composed of componentswith multivariate NBU life distributions, all but oue of the components are degenerateat zero while the remaining one is exponential. In addition, several equivalent char-acterizations of multivariate exponential distribution are provided in the multivariateHNBUE life distribution class which include many existing results as special cases.
文摘This paper introduces a sliding-window mean removal high pass filter by which background clutter of infrared multispectral image is obtained. The method of selecting the optimum size of the sliding-window is based on the skewness-kurtosis test. In the end, a multivariate Gaussian distribution mathematical expression of background clutter image is given.
基金by the National Natural Science Foundation of China(Grant Nos.U1765206,51621006 and 41877256)Innovation Research Group Project of Natural Science Foundation of Hubei Province(ZRQT2020000114).
文摘Displacement is vital in the evaluations of tunnel excavation processes,as well as in determining the postexcavation stability of surrounding rock masses.The prediction of tunnel displacement is a complex problem because of the uncertainties of rock mass properties.Meanwhile,the variation and the correlation relationship of geotechnical material properties have been gradually recognized by researchers in recent years.In this paper,a novel probabilistic method is proposed to estimate the uncertainties of rock mass properties and tunnel displacement,which integrated multivariate distribution function and a relevance vector machine(RVM).The multivariate distribution function is used to establish the probability model of related random variables.RVM is coupled with the numerical simulation methods to construct the nonlinear relationship between tunnel displacements and rock mass parameters,which avoided a large number of numerical simulations.Also,the residual rock mass parameters are taken into account to reflect the brittleness of deeply buried rock mass.Then,based on the proposed method,the uncertainty of displacement in a deep tunnel of CJPL-II laboratory are analyzed and compared with the in-situ measurements.It is found that the predicted tunnel displacements by the RVM model closely match with the measured ones.The correlations of parameters have significant impacts on the uncertainty results.The uncertainty of tunnel displacement decreases while the reliability of the tunnel increases with the increases of the negative correlations among rock mass parameters.When compared to the deterministic method,the proposed approach is more rational and scientific,and also conformed to rock engineering practices.
基金supported by the National Natural Science Foundation of China (51279186,51479183)the 111 Project (B14028)The first author thanks the Chinese Scholarship Council for funding his research in University of Washington
文摘The reliability of a vertical breakwater is calculated using direct integration methods based on joint density functions.The horizontal and uplifting wave forces on the vertical breakwater can be well fitted by the lognormal and the Gumbel distributions,respectively.The joint distribution of the horizontal and uplifting wave forces is analyzed using different probabilistic distributions,including the bivariate logistic Gumbel distribution,the bivariate lognormal distribution,and three bivariate Archimedean copulas functions constructed with different marginal distributions simultaneously.We use the fully nested copulas to construct multivariate distributions taking into account related variables.Different goodness fitting tests are carried out to determine the best bivariate copula model for wave forces on a vertical breakwater.We show that a bivariate model constructed by Frank copula gives the best reliability analysis,using marginal distributions of Gumbel and lognormal to account for uplifting pressure and horizontal wave force on a vertical breakwater,respectively.The results show that failure probability of the vertical breakwater calculated by multivariate density function is comparable to those by the Joint Committee on Structural Safety methods.As copulas are suitable for constructing a bivariate or multivariate joint distribution,they have great potential in reliability analysis for other coastal structures.
文摘The paper is concerned with the basic properties of multivariate extreme value distribution (in the Logistic model). We obtain the characteristic function and recurrence formula of the density function. The explicit algebraic formula for Fisher information matrix is indicated. A simple and accurate procedure for generating random vector from multivariate extreme value distribution is presented.
基金supported bythe National Natural Science Foundation of China (Grant No.51010009)
文摘Hurricanes Katrina and Rita resulted in the largest number of platforms destroyed and damaged in the history of Gulf of Mexico operations. With the trend of global warming, sea level rising and the frequency and intensity of typhoon increase. How to determine a reasonable deck elevation against the largest hurricane waves has become a key issue in offshore platforms design and construction for the unification of economy and safety. In this paper, the multivariate compound extreme value distribution (MCEVD) model is used to predict the deck elevation with different combination of tide, surge height, and crest height. Compared with practice recommended by American Petroleum Institute (API), the prediction by MCEVD has probabilistic meaning and universality.
基金supported by the National Key Scientific Instrument and Equipment Development Project of China(No.2013YQ49087903)the National Natural Science Foundation of China(No.61202160)
文摘Accurate head poses are useful for many face-related tasks such as face recognition, gaze estimation,and emotion analysis. Most existing methods estimate head poses that are included in the training data(i.e.,previously seen head poses). To predict head poses that are not seen in the training data, some regression-based methods have been proposed. However, they focus on estimating continuous head pose angles, and thus do not systematically evaluate the performance on predicting unseen head poses. In this paper, we use a dense multivariate label distribution(MLD) to represent the pose angle of a face image. By incorporating both seen and unseen pose angles into MLD, the head pose predictor can estimate unseen head poses with an accuracy comparable to that of estimating seen head poses. On the Pointing'04 database, the mean absolute errors of results for yaw and pitch are 4.01?and 2.13?, respectively. In addition, experiments on the CAS-PEAL and CMU Multi-PIE databases show that the proposed dense MLD-based head pose estimation method can obtain the state-of-the-art performance when compared to some existing methods.
基金supported by National Natural Science Foundation of China(Grant Nos.11201452 and 11271346)the Specialized Research Fund for the Doctoral Program of Higher Education of China(Grant No.20123402120017)the Fundamental Research Funds for the Central Universities(Grant No.WK0010000052)
文摘In this paper, the Bayes estimator and the parametric empirical Bayes estimator(PEBE) of mean vector in multivariate normal distribution are obtained. The superiority of the PEBE over the minimum variance unbiased estimator(MVUE) and a revised James-Stein estimators(RJSE) are investigated respectively under mean square error(MSE) criterion. Extensive simulations are conducted to show that performance of the PEBE is optimal among these three estimators under the MSE criterion.
基金National Outstanding Youth Science Foundation of China Under Grant No.598251005
文摘Based on the De.Morgan laws and Boolean simplification, a recursive decomposition method is introduced in this paper to identify the main exclusive safe paths and failed paths of a network. The reliability or the reliability bound of a network can be conveniently expressed as the summation of the joint probabilities of these paths. Under the multivariate normal distribution assumption, a conditioned reliability index method is developed to evaluate joint probabilities of various exclusive safe paths and failed paths, and, finally, the seismic reliability or the reliability bound of an electric power system. Examples given in the paper show that the method is very simple and provides accurate results in the seismic reliability analysis.
基金This work was supported by grant PM484400 PM41500 from"High-Tech Port Research Program"founded by Ministry of Maritime Affairs and Fisheries of Korean Government.
文摘In this study, an advanced probabilistic neural network (APNN) method is proposed to reflect the global probability density function (PDF) by summing up the heterogeneous local PDF which is automatically determined in the individual standard deviation of variables. The APNN is applied to predict the stability number of armor blocks of breakwaters using the experimental data of' van der Meet, and the estimated results of the APNN are compared with those of an empirical formula and a previous artificial neural network (ANN) model. The APNN shows better results in predicting the stability number of armor bilks of breakwater and it provided the promising probabilistic viewpoints by using the individual standard deviation in a variable.
基金Supported by the National Social Science Foundation of China (No. 09BTJ012)Scientific Research Fund ofHunan Provincial Education Department (No. 09c390)+1 种基金supported in part by a HKUSeed Funding Program for Basic Research (Project No. 2009-1115-9042)a grant from Hong Kong ResearchGrant Council-General Research Fund (Project No. HKU779210M)
文摘Sampling from a truncated multivariate normal distribution (TMVND) constitutes the core computational module in fitting many statistical and econometric models. We propose two efficient methods, an iterative data augmentation (DA) algorithm and a non-iterative inverse Bayes formulae (IBF) sampler, to simulate TMVND and generalize them to multivariate normal distributions with linear inequality constraints. By creating a Bayesian incomplete-data structure, the posterior step of the DA Mgorithm directly generates random vector draws as opposed to single element draws, resulting obvious computational advantage and easy coding with common statistical software packages such as S-PLUS, MATLAB and GAUSS. Furthermore, the DA provides a ready structure for implementing a fast EM algorithm to identify the mode of TMVND, which has many potential applications in statistical inference of constrained parameter problems. In addition, utilizing this mode as an intermediate result, the IBF sampling provides a novel alternative to Gibbs sampling and elimi- nares problems with convergence and possible slow convergence due to the high correlation between components of a TMVND. The DA algorithm is applied to a linear regression model with constrained parameters and is illustrated with a published data set. Numerical comparisons show that the proposed DA algorithm and IBF sampler are more efficient than the Gibbs sampler and the accept-reject algorithm.
基金Projects(62125306, 62133003) supported by the National Natural Science Foundation of ChinaProject(TPL2019C03) supported by the Open Fund of Science and Technology on Thermal Energy and Power Laboratory,ChinaProject supported by the Fundamental Research Funds for the Central Universities(Zhejiang University NGICS Platform),China。
文摘With the increasing complexity of industrial processes, the high-dimensional industrial data exhibit a strong nonlinearity, bringing considerable challenges to the fault diagnosis of industrial processes. To efficiently extract deep meaningful features that are crucial for fault diagnosis, a sparse Gaussian feature extractor(SGFE) is designed to learn a nonlinear mapping that projects the raw data into the feature space with the fault label dimension. The feature space is described by the one-hot encoding of the fault category label as an orthogonal basis. In this way, the deep sparse Gaussian features related to fault categories can be gradually learned from the raw data by SGFE. In the feature space,the sparse Gaussian(SG) loss function is designed to constrain the distribution of features to multiple sparse multivariate Gaussian distributions. The sparse Gaussian features are linearly separable in the feature space, which is conducive to improving the accuracy of the downstream fault classification task. The feasibility and practical utility of the proposed SGFE are verified by the handwritten digits MNIST benchmark and Tennessee-Eastman(TE) benchmark process,respectively.
文摘The purpose of this paper is to present a general universal formula for <span style="font-family:Verdana;"><i></i></span><i><span><span><i><span style="font-family:Verdana;">k</span></i><span style="font-family:Verdana;"></span></span></span></i><span><span><span style="font-family:Verdana;">-variate survival functions for arbitrary </span><span style="font-family:Verdana;"><i></i></span></span></span><i><span><span><i><span style="font-family:Verdana;">k</span></i><span style="font-family:Verdana;"></span></span></span></i><span style="font-family:Verdana;"><span style="font-family:Verdana;"><span style="font-family:Verdana;"> = 2, 3, </span></span></span><span><span><span style="font-family:;" "=""><span style="font-family:Verdana;">...</span><span style="font-family:Verdana;">, given all the univariate marginal survival functions. This universal form of </span></span><span style="font-family:Verdana;"><i></i></span></span></span><i><span><span><i><span style="font-family:Verdana;">k</span></i><span style="font-family:Verdana;"></span></span></span></i><span><span><span style="font-family:Verdana;">-variate probability distributions was obtained by means of “dependence functions” named “joiners” in the text. These joiners determine all the involved stochastic dependencies between the underlying random variables. However, in order that the presented formula (the form) represents a legitimate survival function, some necessary and sufficient conditions for the joiners had to be found. Basically, finding those conditions is the main task of this paper. This task was successfully performed for the case </span><span style="font-family:Verdana;"><i></i></span></span></span><i><span><span><i><span style="font-family:Verdana;">k</span></i><span style="font-family:Verdana;"></span></span></span></i><span><span><span style="font-family:Verdana;"> = 2 and the main results for the case </span><span style="font-family:Verdana;"><i></i></span></span></span><i><span><span><i><span style="font-family:Verdana;">k</span></i><span style="font-family:Verdana;"></span></span></span></i><span><span><span style="font-family:Verdana;"> = 3 were formulated as Theorem 1 and Theorem 2 in Section 4. Nevertheless, the hypothetical conditions valid for the general </span><span style="font-family:Verdana;"><i></i></span></span></span><i><span><span><i><span style="font-family:Verdana;">k</span></i><span style="font-family:Verdana;"></span></span></span></i><span><span><span style="font-family:Verdana;"> ≥ 4 case were also formulated in Section 3 as the (very convincing) Hypothesis. As for the sufficient conditions for both the </span><span style="font-family:Verdana;"><i></i></span></span></span><i><span><span><i><span style="font-family:Verdana;">k</span></i><span style="font-family:Verdana;"></span></span></span></i><span><span><span style="font-family:;" "=""><span style="font-family:Verdana;"> = 3 and</span><i> </i></span><span style="font-family:Verdana;"><i></i></span></span></span><i><span><span><i><span style="font-family:Verdana;">k</span></i><span style="font-family:Verdana;"></span></span></span></i><span><span><span style="font-family:Verdana;"> ≥ 4 cases, the full generality was not achieved since two restrictions were imposed. Firstly, we limited ourselves to the, defined in the text, “continuous cases” (when the corresponding joint density exists and is continuous), and secondly we consider positive stochastic dependencies only. Nevertheless, the class of the </span><span style="font-family:Verdana;"><i></i></span></span></span><i><span><span><i><span style="font-family:Verdana;">k</span></i><span style="font-family:Verdana;"></span></span></span></i><span><span><span style="font-family:Verdana;">-variate distributions which can be constructed is very wide. The presented method of construction by means of joiners can be considered competitive to the </span><span style="font-family:Verdana;"><strong></strong></span></span></span><strong><span><span><b><span style="font-family:Verdana;">copula</span></b><span style="font-family:Verdana;"></span></span></span></strong><span style="font-family:Verdana;"><span style="font-family:Verdana;"><span style="font-family:Verdana;"> methodology. As it is suggested in the paper the possibility of building a common theory of both copulae and joiners is quite possible, and the joiners may play the role of tools within the theory of copulae, and vice versa copulae may, for example, be used for finding proper joiners. Another independent feature of the joiners methodology is the possibility of constructing many new stochastic processes including stationary and Markovian.</span></span></span>
基金support provided by the National Natural Science Foundation of China(Grant Nos.52078236 and 52122806)Guangzhou Metro Group Co.,Ltd(JT204-100111-23001)Chongqing Urban Investment Infrastructure Construction Co(Grant No.CQCT-JS-SC-GC-2022-0081).
文摘The probability analysis of ground deformation is becoming a trend to estimate and control the risk brought by shield tunnelling.The gap parameter is regarded as an effective tool to estimate the ground loss of tunnelling in soft soil.More specifically,x,which is a gap parameter component defined as the over(or insufficient)excavation due to the change in the posture of the shield machine,may contribute more to the uncertainty of the ground loss.However,the existing uncertainty characterization methods for x have several limitations and cannot explain the uncertain correlations between the relevant parameters.Along these lines,to better characterize the uncertainty of x,the multivariate probability distribution was developed in this work and a dynamic prediction was proposed for it.To attain this goal,1523 rings of the field data coming from the shield tunnel between Longqing Road and Baiyun Road in Kunming Metro Line 5 were utilized and 44 parameters including the construction,stratigraphic,and posture parameters were collected to form the database.According to the variance filter method,the mutual information method,and the value of the correlation coefficients,the original 44 parameters were reduced to 10 main parameters,which were unit weight,the stoke of the jacks(A,B,C,and D groups),the pressure of the pushing jacks(A,C groups),the chamber pressure,the rotation speed,and the total force.The multivariate probability distribution was constructed based on the Johnson system of distributions.Moreover,the distribution was satisfactorily verified in explaining the pairwise correlation between x and other parameters through 2 million simulation cases.At last,the distribution was used as a prior distribution to update the marginal distribution of x with any group of the relevant parameters known.The performance of the dynamic prediction was further validated by the field data of 3 shield tunnel cases.
文摘Two classes of multivariate DMRL distributions and a class of multivariate NBUE distributions are introduced in this paper by using conditional stochastic order.That is, a random vector belongs to a multivariate DMRL class of life distributions if its residual life(defined as a conditional random vector)is decreasing in time under convex or linear order.Some conservation properties of these classes are studied.
基金Conselho Nacional de Desenvolvi-mento Cientifico e Tecnologico-CNPq(Grant No.305963-2018-0).
文摘Truncated elliptical distributions occur naturally in theoretical and applied statistics and are essential for the study of other classes of multivariate distributions.Two members of this class are the multivariate truncated normal and multivariate truncated t distributions.We derive statistical properties of the truncated elliptical distributions.Applications of our results establish new properties of the multivariate truncated slash and multivariate truncated power exponential distributions.
文摘This paper investigates and discusses the use of information divergence,through the widely used Kullback–Leibler(KL)divergence,under the multivariate(generalized)γ-order normal distribution(γ-GND).The behavior of the KL divergence,as far as its symmetricity is concerned,is studied by calculating the divergence of γ-GND over the Student’s multivariate t-distribution and vice versa.Certain special cases are also given and discussed.Furthermore,three symmetrized forms of the KL divergence,i.e.,the Jeffreys distance,the geometric-KL as well as the harmonic-KL distances,are computed between two members of the γ-GND family,while the corresponding differences between those information distances are also discussed.
文摘As an integral part of tolerance design in the context of design for six sigma, determining optimal product specifications has become the focus of increased activity, as manufacturing industries strive to increase productivity and improve the quality of their products. Although a number of research papers have been reported in the research community, there is room for improvement. Most existing research papers consider determining optimal specification limits for a single quality characteristic. In this paper, we develop the modeling and optimization procedures for optimum circular and spherical configurations by considering multiple quality characteristics. The concepts of multivariate quality loss function and truncated distribution are incorporated. This has never been adequately addressed, nor has been appropriately applied in industry. A numerical example is shown and comparison studies are made.
基金supported by the National Natural Science Foundation of China (Nos. 61305017, 61304264)the Natural Science Foundation of Jiangsu Province (No. BK20130154)
文摘The probability hypothesis density(PHD) filter has been recognized as a promising technique for tracking an unknown number of targets. The performance of the PHD filter, however, is sensitive to the available knowledge on model parameters such as the measurement noise variance and those associated with the changes in the maneuvering target trajectories. If these parameters are unknown in advance, the tracking performance may degrade greatly. To address this aspect, this paper proposes to incorporate the adaptive parameter estimation(APE) method in the PHD filter so that the model parameters, which may be static and/or time-varying, can be estimated jointly with target states. The resulting APE-PHD algorithm is implemented using the particle filter(PF), which leads to the PF-APE-PHD filter. Simulations show that the newly proposed algorithm can correctly identify the unknown measurement noise variances, and it is capable of tracking multiple maneuvering targets with abrupt changing parameters in a more robust manner, compared to the multi-model approaches.