An improved Gaussian mixture model (GMM)- based clustering method is proposed for the difficult case where the true distribution of data is against the assumed GMM. First, an improved model selection criterion, the ...An improved Gaussian mixture model (GMM)- based clustering method is proposed for the difficult case where the true distribution of data is against the assumed GMM. First, an improved model selection criterion, the completed likelihood minimum message length criterion, is derived. It can measure both the goodness-of-fit of the candidate GMM to the data and the goodness-of-partition of the data. Secondly, by utilizing the proposed criterion as the clustering objective function, an improved expectation- maximization (EM) algorithm is developed, which can avoid poor local optimal solutions compared to the standard EM algorithm for estimating the model parameters. The experimental results demonstrate that the proposed method can rectify the over-fitting tendency of representative GMM-based clustering approaches and can robustly provide more accurate clustering results.展开更多
In view of the difficulty in determining remaining useful life of plant new variety right in economic analysis, Weibull Survival Analysis Method and Gaussian Model to were used to study how to accurately estimate the ...In view of the difficulty in determining remaining useful life of plant new variety right in economic analysis, Weibull Survival Analysis Method and Gaussian Model to were used to study how to accurately estimate the remaining useful life of plant new variety right. The results showed that the average life of the granted rice varieties was 10.013 years. With the increase of the age of plant variety rights, the probability of the same residual life Ttreaching x was smaller and smaller, the reliability lower and lower, while the probability of the variety rights becoming invalid became greater. The remaining useful life of a specific granted rice variety was closely related to the demonstration promotion age when the granted rice variety reached its maximum area and the appearance of alternative varieties, and when the demonstration promotion age of the granted rice variety reaching the one with the maximum area, the promotion area would be decreased once a new alternative variety appeared, correspondingly with the shortening of the remaining useful life of the variety. Therefore, Weibull Survival Analysis Method and Gaussian Model could describe the remaining useful life's time trend, as well as determine the remaining useful life of a concrete plant variety right, clarify the entire life time of varieties rights, and make the economic analysis of plant new varieties rights more accurate and reasonable.展开更多
Fault diagnosis plays an important role in complicated industrial process.It is a challenging task to detect,identify and locate faults quickly and accurately for large-scale process system.To solve the problem,a nove...Fault diagnosis plays an important role in complicated industrial process.It is a challenging task to detect,identify and locate faults quickly and accurately for large-scale process system.To solve the problem,a novel Multi Boost-based integrated ENN(extension neural network) fault diagnosis method is proposed.Fault data of complicated chemical process have some difficult-to-handle characteristics,such as high-dimension,non-linear and non-Gaussian distribution,so we use margin discriminant projection(MDP) algorithm to reduce dimensions and extract main features.Then,the affinity propagation(AP) clustering method is used to select core data and boundary data as training samples to reduce memory consumption and shorten learning time.Afterwards,an integrated ENN classifier based on Multi Boost strategy is constructed to identify fault types.The artificial data sets are tested to verify the effectiveness of the proposed method and make a detailed sensitivity analysis for the key parameters.Finally,a real industrial system—Tennessee Eastman(TE) process is employed to evaluate the performance of the proposed method.And the results show that the proposed method is efficient and capable to diagnose various types of faults in complicated chemical process.展开更多
Frequency domain wave equation forward modeling is a problem of solving large scale linear sparse systems which is often subject to the limits of computational efficiency and memory storage. Conventional Gaussian elim...Frequency domain wave equation forward modeling is a problem of solving large scale linear sparse systems which is often subject to the limits of computational efficiency and memory storage. Conventional Gaussian elimination cannot resolve the parallel computation of huge data. Therefore, we use the Gaussian elimination with static pivoting (GESP) method for sparse matrix decomposition and multi-source finite-difference modeling. The GESP method does not only improve the computational efficiency but also benefit the distributed parallel computation of matrix decomposition within a single frequency point. We test the proposed method using the classic Marmousi model. Both the single-frequency wave field and time domain seismic section show that the proposed method improves the simulation accuracy and computational efficiency and saves and makes full use of memory. This method can lay the basis for waveform inversion.展开更多
Complex industrial processes often have multiple operating modes and present time-varying behavior. The data in one mode may follow specific Gaussian or non-Gaussian distributions. In this paper, a numerically efficie...Complex industrial processes often have multiple operating modes and present time-varying behavior. The data in one mode may follow specific Gaussian or non-Gaussian distributions. In this paper, a numerically efficient movingwindow local outlier probability algorithm is proposed, lies key feature is the capability to handle complex data distributions and incursive operating condition changes including slow dynamic variations and instant mode shifts. First, a two-step adaption approach is introduced and some designed updating rules are applied to keep the monitoring model up-to-date. Then, a semi-supervised monitoring strategy is developed with an updating switch rule to deal with mode changes. Based on local probability models, the algorithm has a superior ability in detecting faulty conditions and fast adapting to slow variations and new operating modes. Finally, the utility of the proposed method is demonstrated with a numerical example and a non-isothermal continuous stirred tank reactor.展开更多
The wave crest is an important factor for the design of both fixed and floating marine structures.Wave crest height is a dominant parameter in assessing the likelihood of wave-in-deck impact and resultant severe damag...The wave crest is an important factor for the design of both fixed and floating marine structures.Wave crest height is a dominant parameter in assessing the likelihood of wave-in-deck impact and resultant severe damage.Many empirical and theoretical distribution functions for wave crest heights have been proposed,but there is a lack of agreement between them.It is of significance to develop a better new nonlinear wave crest height distribution model.The progress in the research of wave crest heights is reviewed in this paper.Based on Stokes' wave theory,an approximate nonlinear wave crest-height distribution formula with simple parameters is derived.Two sets of measured data are presented and compared with various theoretical distributions of wave crests obtained from nonlinear wave models and analysis of the comparison is given in detail.The new crest-height distribution model agrees well with observations.Also,the new theoretical distribution is more accurate than the other methods cited in this paper and has a greater range of applications.展开更多
Low pressure chemical vapor deposition(LPCVD) is one of the most important processes during semiconductor manufacturing.However,the spatial distribution of internal temperature and extremely few samples makes it hard ...Low pressure chemical vapor deposition(LPCVD) is one of the most important processes during semiconductor manufacturing.However,the spatial distribution of internal temperature and extremely few samples makes it hard to build a good-quality model of this batch process.Besides,due to the properties of this process,the reliability of the model must be taken into consideration when optimizing the MVs.In this work,an optimal design strategy based on the self-learning Gaussian process model(GPM) is proposed to control this kind of spatial batch process.The GPM is utilized as the internal model to predict the thicknesses of thin films on all spatial-distributed wafers using the limited data.Unlike the conventional model based design,the uncertainties of predictions provided by GPM are taken into consideration to guide the optimal design of manipulated variables so that the designing can be more prudent Besides,the GPM is also actively enhanced using as little data as possible based on the predictive uncertainties.The effectiveness of the proposed strategy is successfully demonstrated in an LPCVD process.展开更多
Vougiouklis & Vougiouklis have proposed the replacement of Likert scales, usually used in questionnaires, with a bar. With this proposal a discrete situation is replaced by a fuzzy one. There are identified certain a...Vougiouklis & Vougiouklis have proposed the replacement of Likert scales, usually used in questionnaires, with a bar. With this proposal a discrete situation is replaced by a fuzzy one. There are identified certain advantages concerning the use of the bar as compared to that of a scale during both the stages of filling-in as well as processing a questionnaire. The main advantage associated with business research requirements is the fact that it is expected to be much quicker to fill in and much easier to explain to participants. The bar provides the potential for different types of processing Likert scales cannot. Therefore the researchers are allowed to ascertain that the given answers follow not only the already suggested and used Gauss distribution but also a parabola distribution as it will be suggested in present paper, and which is expected to give the researcher the opportunity to "correct" this tendency. Therefore, a possibility of choosing amongst a number of alternatives is offered, by utilizing fuzzy logic in the same way as it has already been done in industry and combining mathematical models with multivalued operations.展开更多
For the two-parameter inverse Gaussian distribution denoted by IG(μ,A), the authors employ a linear Bayes procedure to estimate the parameters μ and A. The superiority of the proposed linear Bayes estimator (LBE...For the two-parameter inverse Gaussian distribution denoted by IG(μ,A), the authors employ a linear Bayes procedure to estimate the parameters μ and A. The superiority of the proposed linear Bayes estimator (LBE) over both the classical UMVUE and the maximum likelihood estimator (MLE) is established in terms of the mean squared error matrix (MSEM) criterion. Compared with the usual Bayes estimator, which is obtained by an MCMC method, the proposed LBE is simple and easy to use. Some numerical results are presented to verify that the LBE performs well.展开更多
Based on the angular spectrum method and the circular Gaussian distribution(CGD) model of scattering media,we numerically simulate light focusing through strongly scattering media.A high contrast focus in the target a...Based on the angular spectrum method and the circular Gaussian distribution(CGD) model of scattering media,we numerically simulate light focusing through strongly scattering media.A high contrast focus in the target area is produced by using feedback optimization algorithm with binary amplitude modulation.It is possible to form the focusing with one focus or multiple foci at arbitrary areas.The influence of the number of square segments of spatial light modulation on the enhancement factor of intensity is discussed.Simulation results are found to be in good agreement with theoretical analysis for light refocusing.展开更多
Turbulent dynamical systems involve dynamics with both a large dimensional phase space and a large number of positive Lyapunov exponents. Such systems are ubiqui- tous in applications in contemporary science and engin...Turbulent dynamical systems involve dynamics with both a large dimensional phase space and a large number of positive Lyapunov exponents. Such systems are ubiqui- tous in applications in contemporary science and engineering where the statistical ensemble prediction and the real time filtering/state estimation are needed despite the underlying complexity of the system. Statistically exactly solvable test models have a crucial role to provide firm mathematical underpinning or new algorithms for vastly more complex scien- tific phenomena. Here, a class of statistically exactly solvable non-Gaussian test models is introduced, where a generalized Feynman-Ka~ formulation reduces the exact behavior of conditional statistical moments to the solution to inhomogeneous Fokker-Planck equations modified by linear lower order coupling and source terms. This procedure is applied to a test model with hidden instabilities and is combined with information theory to address two important issues in the contemporary statistical prediction of turbulent dynamical systems: the coarse-grained ensemble prediction in a perfect model and the improving long range forecasting in imperfect models. The models discussed here should be use- ful for many other applications and algorithms for the real time prediction and the state estimation.展开更多
The systematic development of reduced low-dimensional stochastic climate models from observations or comprehensive high dimensional climate models is an important topic for atmospheric low-frequency variability,climat...The systematic development of reduced low-dimensional stochastic climate models from observations or comprehensive high dimensional climate models is an important topic for atmospheric low-frequency variability,climate sensitivity,and improved extended range forecasting.Recently,techniques from applied mathematics have been utilized to systematically derive normal forms for reduced stochastic climate models for low-frequency variables.It was shown that dyad and multiplicative triad interactions combine with the climatological linear operator interactions to produce a normal form with both strong nonlinear cubic dissipation and Correlated Additive and Multiplicative(CAM) stochastic noise.The probability distribution functions(PDFs) of low frequency climate variables exhibit small but significant departure from Gaussianity but have asymptotic tails which decay at most like a Gaussian.Here,rigorous upper bounds with Gaussian decay are proved for the invariant measure of general normal form stochastic models.Asymptotic Gaussian lower bounds are also established under suitable hypotheses.展开更多
We study a simplified version of the Sachdev-Ye-Kitaev(SYK) model with real interactions by exact diagonalization. Instead of satisfying a continuous Gaussian distribution, the interaction strengths are assumed to be ...We study a simplified version of the Sachdev-Ye-Kitaev(SYK) model with real interactions by exact diagonalization. Instead of satisfying a continuous Gaussian distribution, the interaction strengths are assumed to be chosen from discrete values with a finite separation. A quantum phase transition from a chaotic state to an integrable state is observed by increasing the discrete separation. Below the critical value, the discrete model can well reproduce various physical quantities of the original SYK model,including the volume law of the ground-state entanglement, level distribution, thermodynamic entropy,and out-of-time-order correlation(OTOC) functions. For systems of size up to N=20, we find that the transition point increases with system size, indicating that a relatively weak randomness of interaction can stabilize the chaotic phase. Our findings significantly relax the stringent conditions for the realization of SYK model, and can reduce the complexity of various experimental proposals down to realistic ranges.展开更多
基金The National Natural Science Foundation of China(No.61105048,60972165)the Doctoral Fund of Ministry of Education of China(No.20110092120034)+2 种基金the Natural Science Foundation of Jiangsu Province(No.BK2010240)the Technology Foundation for Selected Overseas Chinese Scholar,Ministry of Human Resources and Social Security of China(No.6722000008)the Open Fund of Jiangsu Province Key Laboratory for Remote Measuring and Control(No.YCCK201005)
文摘An improved Gaussian mixture model (GMM)- based clustering method is proposed for the difficult case where the true distribution of data is against the assumed GMM. First, an improved model selection criterion, the completed likelihood minimum message length criterion, is derived. It can measure both the goodness-of-fit of the candidate GMM to the data and the goodness-of-partition of the data. Secondly, by utilizing the proposed criterion as the clustering objective function, an improved expectation- maximization (EM) algorithm is developed, which can avoid poor local optimal solutions compared to the standard EM algorithm for estimating the model parameters. The experimental results demonstrate that the proposed method can rectify the over-fitting tendency of representative GMM-based clustering approaches and can robustly provide more accurate clustering results.
基金Supported by the National Natural Science Foundation of China(71273264)the Fundamental Research Funds for the Central Welfare Scientific Research Institutes of China(2015-14)~~
文摘In view of the difficulty in determining remaining useful life of plant new variety right in economic analysis, Weibull Survival Analysis Method and Gaussian Model to were used to study how to accurately estimate the remaining useful life of plant new variety right. The results showed that the average life of the granted rice varieties was 10.013 years. With the increase of the age of plant variety rights, the probability of the same residual life Ttreaching x was smaller and smaller, the reliability lower and lower, while the probability of the variety rights becoming invalid became greater. The remaining useful life of a specific granted rice variety was closely related to the demonstration promotion age when the granted rice variety reached its maximum area and the appearance of alternative varieties, and when the demonstration promotion age of the granted rice variety reaching the one with the maximum area, the promotion area would be decreased once a new alternative variety appeared, correspondingly with the shortening of the remaining useful life of the variety. Therefore, Weibull Survival Analysis Method and Gaussian Model could describe the remaining useful life's time trend, as well as determine the remaining useful life of a concrete plant variety right, clarify the entire life time of varieties rights, and make the economic analysis of plant new varieties rights more accurate and reasonable.
基金Project (61203021) supported by the National Natural Science Foundation of ChinaProject (2011216011) supported by the Key Science and Technology Program of Liaoning Province,China+1 种基金Project (2013020024) supported by the Natural Science Foundation of Liaoning Province,ChinaProject (LJQ2015061) supported by the Program for Liaoning Excellent Talents in Universities,China
文摘Fault diagnosis plays an important role in complicated industrial process.It is a challenging task to detect,identify and locate faults quickly and accurately for large-scale process system.To solve the problem,a novel Multi Boost-based integrated ENN(extension neural network) fault diagnosis method is proposed.Fault data of complicated chemical process have some difficult-to-handle characteristics,such as high-dimension,non-linear and non-Gaussian distribution,so we use margin discriminant projection(MDP) algorithm to reduce dimensions and extract main features.Then,the affinity propagation(AP) clustering method is used to select core data and boundary data as training samples to reduce memory consumption and shorten learning time.Afterwards,an integrated ENN classifier based on Multi Boost strategy is constructed to identify fault types.The artificial data sets are tested to verify the effectiveness of the proposed method and make a detailed sensitivity analysis for the key parameters.Finally,a real industrial system—Tennessee Eastman(TE) process is employed to evaluate the performance of the proposed method.And the results show that the proposed method is efficient and capable to diagnose various types of faults in complicated chemical process.
基金supported by China State Key Science and Technology Project on Marine Carbonate Reservoir Characterization (No. 2008ZX05004-006)
文摘Frequency domain wave equation forward modeling is a problem of solving large scale linear sparse systems which is often subject to the limits of computational efficiency and memory storage. Conventional Gaussian elimination cannot resolve the parallel computation of huge data. Therefore, we use the Gaussian elimination with static pivoting (GESP) method for sparse matrix decomposition and multi-source finite-difference modeling. The GESP method does not only improve the computational efficiency but also benefit the distributed parallel computation of matrix decomposition within a single frequency point. We test the proposed method using the classic Marmousi model. Both the single-frequency wave field and time domain seismic section show that the proposed method improves the simulation accuracy and computational efficiency and saves and makes full use of memory. This method can lay the basis for waveform inversion.
基金Supported by the National Natural Science Foundation of China(61374140)Shanghai Postdoctoral Sustentation Fund(12R21412600)+1 种基金the Fundamental Research Funds for the Central Universities(WH1214039)Shanghai Pujiang Program(12PJ1402200)
文摘Complex industrial processes often have multiple operating modes and present time-varying behavior. The data in one mode may follow specific Gaussian or non-Gaussian distributions. In this paper, a numerically efficient movingwindow local outlier probability algorithm is proposed, lies key feature is the capability to handle complex data distributions and incursive operating condition changes including slow dynamic variations and instant mode shifts. First, a two-step adaption approach is introduced and some designed updating rules are applied to keep the monitoring model up-to-date. Then, a semi-supervised monitoring strategy is developed with an updating switch rule to deal with mode changes. Based on local probability models, the algorithm has a superior ability in detecting faulty conditions and fast adapting to slow variations and new operating modes. Finally, the utility of the proposed method is demonstrated with a numerical example and a non-isothermal continuous stirred tank reactor.
基金supported by the Research Fund for the Doctoral Program of Higher Education of China (Grant No.20060423009)the Key Technological Research and Development Program of Shandong Province (Grant No.2008GGB01099)
文摘The wave crest is an important factor for the design of both fixed and floating marine structures.Wave crest height is a dominant parameter in assessing the likelihood of wave-in-deck impact and resultant severe damage.Many empirical and theoretical distribution functions for wave crest heights have been proposed,but there is a lack of agreement between them.It is of significance to develop a better new nonlinear wave crest height distribution model.The progress in the research of wave crest heights is reviewed in this paper.Based on Stokes' wave theory,an approximate nonlinear wave crest-height distribution formula with simple parameters is derived.Two sets of measured data are presented and compared with various theoretical distributions of wave crests obtained from nonlinear wave models and analysis of the comparison is given in detail.The new crest-height distribution model agrees well with observations.Also,the new theoretical distribution is more accurate than the other methods cited in this paper and has a greater range of applications.
基金Supported by the National High Technology Research and Development Program of China(2014AA041803)the National Natural Science Foundation of China(61320106009)
文摘Low pressure chemical vapor deposition(LPCVD) is one of the most important processes during semiconductor manufacturing.However,the spatial distribution of internal temperature and extremely few samples makes it hard to build a good-quality model of this batch process.Besides,due to the properties of this process,the reliability of the model must be taken into consideration when optimizing the MVs.In this work,an optimal design strategy based on the self-learning Gaussian process model(GPM) is proposed to control this kind of spatial batch process.The GPM is utilized as the internal model to predict the thicknesses of thin films on all spatial-distributed wafers using the limited data.Unlike the conventional model based design,the uncertainties of predictions provided by GPM are taken into consideration to guide the optimal design of manipulated variables so that the designing can be more prudent Besides,the GPM is also actively enhanced using as little data as possible based on the predictive uncertainties.The effectiveness of the proposed strategy is successfully demonstrated in an LPCVD process.
文摘Vougiouklis & Vougiouklis have proposed the replacement of Likert scales, usually used in questionnaires, with a bar. With this proposal a discrete situation is replaced by a fuzzy one. There are identified certain advantages concerning the use of the bar as compared to that of a scale during both the stages of filling-in as well as processing a questionnaire. The main advantage associated with business research requirements is the fact that it is expected to be much quicker to fill in and much easier to explain to participants. The bar provides the potential for different types of processing Likert scales cannot. Therefore the researchers are allowed to ascertain that the given answers follow not only the already suggested and used Gauss distribution but also a parabola distribution as it will be suggested in present paper, and which is expected to give the researcher the opportunity to "correct" this tendency. Therefore, a possibility of choosing amongst a number of alternatives is offered, by utilizing fuzzy logic in the same way as it has already been done in industry and combining mathematical models with multivalued operations.
基金supported by National Natural Science Foundation of China under Grant No.11371051
文摘For the two-parameter inverse Gaussian distribution denoted by IG(μ,A), the authors employ a linear Bayes procedure to estimate the parameters μ and A. The superiority of the proposed linear Bayes estimator (LBE) over both the classical UMVUE and the maximum likelihood estimator (MLE) is established in terms of the mean squared error matrix (MSEM) criterion. Compared with the usual Bayes estimator, which is obtained by an MCMC method, the proposed LBE is simple and easy to use. Some numerical results are presented to verify that the LBE performs well.
基金supported by the National Natural Science Foundation of China(Nos.61178015 and 11304104)
文摘Based on the angular spectrum method and the circular Gaussian distribution(CGD) model of scattering media,we numerically simulate light focusing through strongly scattering media.A high contrast focus in the target area is produced by using feedback optimization algorithm with binary amplitude modulation.It is possible to form the focusing with one focus or multiple foci at arbitrary areas.The influence of the number of square segments of spatial light modulation on the enhancement factor of intensity is discussed.Simulation results are found to be in good agreement with theoretical analysis for light refocusing.
基金Project supported by the Office of Naval Research (ONR) Grants (No. ONR DRI N00014-10-1-0554)the DOD-MURI award "Physics Constrained Stochastic-Statistical Models for Extended Range Environmental Prediction"
文摘Turbulent dynamical systems involve dynamics with both a large dimensional phase space and a large number of positive Lyapunov exponents. Such systems are ubiqui- tous in applications in contemporary science and engineering where the statistical ensemble prediction and the real time filtering/state estimation are needed despite the underlying complexity of the system. Statistically exactly solvable test models have a crucial role to provide firm mathematical underpinning or new algorithms for vastly more complex scien- tific phenomena. Here, a class of statistically exactly solvable non-Gaussian test models is introduced, where a generalized Feynman-Ka~ formulation reduces the exact behavior of conditional statistical moments to the solution to inhomogeneous Fokker-Planck equations modified by linear lower order coupling and source terms. This procedure is applied to a test model with hidden instabilities and is combined with information theory to address two important issues in the contemporary statistical prediction of turbulent dynamical systems: the coarse-grained ensemble prediction in a perfect model and the improving long range forecasting in imperfect models. The models discussed here should be use- ful for many other applications and algorithms for the real time prediction and the state estimation.
基金Project supported by the National Science Foundation Grant(No.DMS-0456713)the Office of Naval Research Grant(No.N0014-05-1-1064)
文摘The systematic development of reduced low-dimensional stochastic climate models from observations or comprehensive high dimensional climate models is an important topic for atmospheric low-frequency variability,climate sensitivity,and improved extended range forecasting.Recently,techniques from applied mathematics have been utilized to systematically derive normal forms for reduced stochastic climate models for low-frequency variables.It was shown that dyad and multiplicative triad interactions combine with the climatological linear operator interactions to produce a normal form with both strong nonlinear cubic dissipation and Correlated Additive and Multiplicative(CAM) stochastic noise.The probability distribution functions(PDFs) of low frequency climate variables exhibit small but significant departure from Gaussianity but have asymptotic tails which decay at most like a Gaussian.Here,rigorous upper bounds with Gaussian decay are proved for the invariant measure of general normal form stochastic models.Asymptotic Gaussian lower bounds are also established under suitable hypotheses.
基金This work was supported by the National Natural Science Foundation of China(11434011,11522436,11774425,11704029)the National Key R&D Program of China(2018YFA0306501)+1 种基金the Beijing Natural Science Foundation(Z180013)the Research Funds of Renmin University of China(16XNLQ03 and 18XNLQ15)。
文摘We study a simplified version of the Sachdev-Ye-Kitaev(SYK) model with real interactions by exact diagonalization. Instead of satisfying a continuous Gaussian distribution, the interaction strengths are assumed to be chosen from discrete values with a finite separation. A quantum phase transition from a chaotic state to an integrable state is observed by increasing the discrete separation. Below the critical value, the discrete model can well reproduce various physical quantities of the original SYK model,including the volume law of the ground-state entanglement, level distribution, thermodynamic entropy,and out-of-time-order correlation(OTOC) functions. For systems of size up to N=20, we find that the transition point increases with system size, indicating that a relatively weak randomness of interaction can stabilize the chaotic phase. Our findings significantly relax the stringent conditions for the realization of SYK model, and can reduce the complexity of various experimental proposals down to realistic ranges.