The large blast furnace is essential equipment in the process of iron and steel manufacturing. Due to the complex operation process and frequent fluctuations of variables, conventional monitoring methods often bring f...The large blast furnace is essential equipment in the process of iron and steel manufacturing. Due to the complex operation process and frequent fluctuations of variables, conventional monitoring methods often bring false alarms. To address the above problem, an ensemble of greedy dynamic principal component analysis-Gaussian mixture model(EGDPCA-GMM) is proposed in this paper. First, PCA-GMM is introduced to deal with the collinearity and the non-Gaussian distribution of blast furnace data.Second, in order to explain the dynamics of data, the greedy algorithm is used to determine the extended variables and their corresponding time lags, so as to avoid introducing unnecessary noise. Then the bagging ensemble is adopted to cooperate with greedy extension to eliminate the randomness brought by the greedy algorithm and further reduce the false alarm rate(FAR) of monitoring results. Finally, the algorithm is applied to the blast furnace of a large iron and steel group in South China to verify performance.Compared with the basic algorithms, the proposed method achieves lowest FAR, while keeping missed alarm rate(MAR) remain stable.展开更多
The contribution of this work is twofold: (1) a multimodality prediction method of chaotic time series with the Gaussian process mixture (GPM) model is proposed, which employs a divide and conquer strategy. It au...The contribution of this work is twofold: (1) a multimodality prediction method of chaotic time series with the Gaussian process mixture (GPM) model is proposed, which employs a divide and conquer strategy. It automatically divides the chaotic time series into multiple modalities with different extrinsic patterns and intrinsic characteristics, and thus can more precisely fit the chaotic time series. (2) An effective sparse hard-cut expec- tation maximization (SHC-EM) learning algorithm for the GPM model is proposed to improve the prediction performance. SHO-EM replaces a large learning sample set with fewer pseudo inputs, accelerating model learning based on these pseudo inputs. Experiments on Lorenz and Chua time series demonstrate that the proposed method yields not only accurate multimodality prediction, but also the prediction confidence interval SHC-EM outperforms the traditional variational 1earning in terms of both prediction accuracy and speed. In addition, SHC-EM is more robust and insusceptible to noise than variational learning.展开更多
The dynamic soft sensor based on a single Gaussian process regression(GPR) model has been developed in fermentation processes.However,limitations of single regression models,for multiphase/multimode fermentation proce...The dynamic soft sensor based on a single Gaussian process regression(GPR) model has been developed in fermentation processes.However,limitations of single regression models,for multiphase/multimode fermentation processes,may result in large prediction errors and complexity of the soft sensor.Therefore,a dynamic soft sensor based on Gaussian mixture regression(GMR) was proposed to overcome the problems.Two structure parameters,the number of Gaussian components and the order of the model,are crucial to the soft sensor model.To achieve a simple and effective soft sensor,an iterative strategy was proposed to optimize the two structure parameters synchronously.For the aim of comparisons,the proposed dynamic GMR soft sensor and the existing dynamic GPR soft sensor were both investigated to estimate biomass concentration in a Penicillin simulation process and an industrial Erythromycin fermentation process.Results show that the proposed dynamic GMR soft sensor has higher prediction accuracy and is more suitable for dynamic multiphase/multimode fermentation processes.展开更多
Delineation of the lung parenchyma in the thoracic Computed Tomography(CT)is an important processing step for most of the pulmonary image analysis such as lung volume extraction,lung nodule detection and pulmonary ves...Delineation of the lung parenchyma in the thoracic Computed Tomography(CT)is an important processing step for most of the pulmonary image analysis such as lung volume extraction,lung nodule detection and pulmonary vessel segmentation.An automatic method for accurate delineation of lung parenchyma in thoracic Computed Tomography images is presented in this paper.The proposed method involves a segmentation phase followed by a lung boundary correction technique.The tissues in the thoracic Computed Tomography can be represented by a number of Gaussians.We propose a histogram utilized Adaptive Multilevel Thresholding(AMT)for estimating the total number of Gaussians and their initial parameters.The parameters of Gaussian components are updated by Expectation Maximization(EM)algorithm.The segmented lung parenchyma from the Gaussian Mixture model(GMM)undergoes an Adaptive Morphological Filtering(AMF)to reduce the boundary errors.The proposed method has been tested on 70 diseased and 119 normal lung images from 28 cases obtained from Lung Image Database Consortium(LIDC).The performance of the proposed system has been validated.展开更多
The techniques for oceanographic observation have made great progress in both space-time coverage and quality, which make the observation data present some characteristics of big data. We explore the essence of global...The techniques for oceanographic observation have made great progress in both space-time coverage and quality, which make the observation data present some characteristics of big data. We explore the essence of global ocean dynamic via constructing a complex network with regard to sea surface temperature. The global ocean is divided into discrete regions to represent the nodes of the network. To understand the ocean dynamic behavior, we introduce the Gaussian mixture models to describe the nodes as limit-cycle oscillators. The interacting dynamical oscillators form the complex network that simulates the ocean as a stochastic system. Gaussian probability matching is suggested to measure the behavior similarity of regions. Complex network statistical characteristics of the network are analyzed in terms of degree distribution, clustering coefficient and betweenness. Experimental results show a pronounced sensitivity of network characteristics to the climatic anomaly in the oceanic circulation. Particularly, the betweenness reveals the main pathways to transfer thermal energy of El Niño–Southern oscillation. Our works provide new insights into the physical processes of ocean dynamic, as well as climate changes and ocean anomalies.展开更多
To make the quantitative results of nuclear magnetic resonance(NMR) transverse relaxation(T;) spectrums reflect the type and pore structure of reservoir more directly, an unsupervised clustering method was developed t...To make the quantitative results of nuclear magnetic resonance(NMR) transverse relaxation(T;) spectrums reflect the type and pore structure of reservoir more directly, an unsupervised clustering method was developed to obtain the quantitative pore structure information from the NMR T;spectrums based on the Gaussian mixture model(GMM). Firstly, We conducted the principal component analysis on T;spectrums in order to reduce the dimension data and the dependence of the original variables. Secondly, the dimension-reduced data was fitted using the GMM probability density function, and the model parameters and optimal clustering numbers were obtained according to the expectation-maximization algorithm and the change of the Akaike information criterion. Finally, the T;spectrum features and pore structure types of different clustering groups were analyzed and compared with T;geometric mean and T;arithmetic mean. The effectiveness of the algorithm has been verified by numerical simulation and field NMR logging data. The research shows that the clustering results based on GMM method have good correlations with the shape and distribution of the T;spectrum, pore structure, and petroleum productivity, providing a new means for quantitative identification of pore structure, reservoir grading, and oil and gas productivity evaluation.展开更多
The currently prevalent machine performance degradation assessment techniques involve estimating a machine's current condition based upon the recognition of indications of failure features,which entail complete data ...The currently prevalent machine performance degradation assessment techniques involve estimating a machine's current condition based upon the recognition of indications of failure features,which entail complete data collected in different conditions.However,failure data are always hard to acquire,thus making those techniques hard to be applied.In this paper,a novel method which does not need failure history data is introduced.Wavelet packet decomposition(WPD) is used to extract features from raw signals,principal component analysis(PCA) is utilized to reduce feature dimensions,and Gaussian mixture model(GMM) is then applied to approximate the feature space distributions.Single-channel confidence value(SCV) is calculated by the overlap between GMM of the monitoring condition and that of the normal condition,which can indicate the performance of single-channel.Furthermore,multi-channel confidence value(MCV),which can be deemed as the overall performance index of multi-channel,is calculated via logistic regression(LR) and that the task of decision-level sensor fusion is also completed.Both SCV and MCV can serve as the basis on which proactive maintenance measures can be taken,thus preventing machine breakdown.The method has been adopted to assess the performance of the turbine of a centrifugal compressor in a factory of Petro-China,and the result shows that it can effectively complete this task.The proposed method has engineering significance for machine performance degradation assessment.展开更多
Accurate classification and prediction of future traffic conditions are essential for developing effective strategies for congestion mitigation on the highway systems. Speed distribution is one of the traffic stream p...Accurate classification and prediction of future traffic conditions are essential for developing effective strategies for congestion mitigation on the highway systems. Speed distribution is one of the traffic stream parameters, which has been used to quantify the traffic conditions. Previous studies have shown that multi-modal probability distribution of speeds gives excellent results when simultaneously evaluating congested and free-flow traffic conditions. However, most of these previous analytical studies do not incorporate the influencing factors in characterizing these conditions. This study evaluates the impact of traffic occupancy on the multi-state speed distribution using the Bayesian Dirichlet Process Mixtures of Generalized Linear Models (DPM-GLM). Further, the study estimates the speed cut-point values of traffic states, which separate them into homogeneous groups using Bayesian change-point detection (BCD) technique. The study used 2015 archived one-year traffic data collected on Florida’s Interstate 295 freeway corridor. Information criteria results revealed three traffic states, which were identified as free-flow, transitional flow condition (congestion onset/offset), and the congested condition. The findings of the DPM-GLM indicated that in all estimated states, the traffic speed decreases when traffic occupancy increases. Comparison of the influence of traffic occupancy between traffic states showed that traffic occupancy has more impact on the free-flow and the congested state than on the transitional flow condition. With respect to estimating the threshold speed value, the results of the BCD model revealed promising findings in characterizing levels of traffic congestion.展开更多
In order to meet the demand of online optimal running, a novel soft sensor modeling approach based on Gaussian processes was proposed. The approach is moderately simple to implement and use without loss of performance...In order to meet the demand of online optimal running, a novel soft sensor modeling approach based on Gaussian processes was proposed. The approach is moderately simple to implement and use without loss of performance. It is trained by optimizing the hyperparameters using the scaled conjugate gradient algorithm with the squared exponential covariance function employed. Experimental simulations show that the soft sensor modeling approach has the advantage via a real-world example in a refinery. Meanwhile, the method opens new possibilities for application of kernel methods to potential fields.展开更多
In this paper, we propose a new soft multi-phase segmentation model where it is assumed that the pixel intensities are distributed as a Gaussian mixture. The model is formulated as a minimization problem through the u...In this paper, we propose a new soft multi-phase segmentation model where it is assumed that the pixel intensities are distributed as a Gaussian mixture. The model is formulated as a minimization problem through the use of the maximum likelihood estimator and phase-transition theory. The mixture coefficients, which are estimated using a spatially varying mean and variance procedure, are used for image segmentation. The experimental results indicate the effectiveness of the method.展开更多
We introduce a method based on Gaussian mixture model(GMM)clustering and level-set to automatically detect intraretina fluid on diabetic retinopathy(DR)from spectral domain optical coherence tomography(SD-OCT)images i...We introduce a method based on Gaussian mixture model(GMM)clustering and level-set to automatically detect intraretina fluid on diabetic retinopathy(DR)from spectral domain optical coherence tomography(SD-OCT)images in this paper.First,each B-scan is segmented using GMM clustering.The original chustering results are refined using location and thickness infor-mation.Then,the spatial information among every consecutive five B-scans is used to search potential fluid.Finally,the improved level-set method is used to obtain the accurate boundaries.The high sensitivity and accuracy demonstrated here show its potential for detection of fluid.展开更多
In recent years,image restoration has become a huge subject,and finite hybrid model has been widely used in image denoising because of its easy modeling and strong explanatory results.The gaussian mixture model is the...In recent years,image restoration has become a huge subject,and finite hybrid model has been widely used in image denoising because of its easy modeling and strong explanatory results.The gaussian mixture model is the most common one.The existing image denoising methods usually assume that each component of the natural image is subject to the gaussian mixture model(GMM).However,this approach is not entirely reasonable.It is well known that most natural images are complex and their distribution is not entirely gaussian.As a result,there are still many problems that GMM cannot solve.This paper tries to improve the finite mixture model and introduces the asymmetric gaussian mixture model into it.Since the asymmetric gaussian mixture model can simulate the asymmetric distribution on the basis of the gaussian mixture model,it is more consistent with the natural image data,so the denoising effect of the natural complex image is better.We carried out image denoising experiments under different noise scales and types,and found that the asymmetric gaussian mixture model has better denoising effect and performance.展开更多
Intrusion detection is the investigation process of information about the system activities or its data to detect any malicious behavior or unauthorized activity.Most of the IDS implement K-means clustering technique ...Intrusion detection is the investigation process of information about the system activities or its data to detect any malicious behavior or unauthorized activity.Most of the IDS implement K-means clustering technique due to its linear complexity and fast computing ability.Nonetheless,it is Naïve use of the mean data value for the cluster core that presents a major drawback.The chances of two circular clusters having different radius and centering at the same mean will occur.This condition cannot be addressed by the K-means algorithm because the mean value of the various clusters is very similar together.However,if the clusters are not spherical,it fails.To overcome this issue,a new integrated hybrid model by integrating expectation maximizing(EM)clustering using a Gaussian mixture model(GMM)and naïve Bays classifier have been proposed.In this model,GMM give more flexibility than K-Means in terms of cluster covariance.Also,they use probabilities function and soft clustering,that’s why they can have multiple cluster for a single data.In GMM,we can define the cluster form in GMM by two parameters:the mean and the standard deviation.This means that by using these two parameters,the cluster can take any kind of elliptical shape.EM-GMM will be used to cluster data based on data activity into the corresponding category.展开更多
Wireless sensor network(WSN)positioning has a good effect on indoor positioning,so it has received extensive attention in the field of positioning.Non-line-of sight(NLOS)is a primary challenge in indoor complex enviro...Wireless sensor network(WSN)positioning has a good effect on indoor positioning,so it has received extensive attention in the field of positioning.Non-line-of sight(NLOS)is a primary challenge in indoor complex environment.In this paper,a robust localization algorithm based on Gaussian mixture model and fitting polynomial is proposed to solve the problem of NLOS error.Firstly,fitting polynomials are used to predict the measured values.The residuals of predicted and measured values are clustered by Gaussian mixture model(GMM).The LOS probability and NLOS probability are calculated according to the clustering centers.The measured values are filtered by Kalman filter(KF),variable parameter unscented Kalman filter(VPUKF)and variable parameter particle filter(VPPF)in turn.The distance value processed by KF and VPUKF and the distance value processed by KF,VPUKF and VPPF are combined according to probability.Finally,the maximum likelihood method is used to calculate the position coordinate estimation.Through simulation comparison,the proposed algorithm has better positioning accuracy than several comparison algorithms in this paper.And it shows strong robustness in strong NLOS environment.展开更多
Cluster-based channel model is the main stream of fifth generation mobile communications, thus the accuracy of clustering algorithm is important. Traditional Gaussian mixture model (GMM) does not consider the power in...Cluster-based channel model is the main stream of fifth generation mobile communications, thus the accuracy of clustering algorithm is important. Traditional Gaussian mixture model (GMM) does not consider the power information which is important for the channel multipath clustering. In this paper, a normalized power weighted GMM (PGMM) is introduced to model the channel multipath components (MPCs). With MPC power as a weighted factor, the PGMM can fit the MPCs in accordance with the cluster-based channel models. Firstly, expectation maximization (EM) algorithm is employed to optimize the PGMM parameters. Then, to further increase the searching ability of EM and choose the optimal number of components without resort to cross-validation, the variational Bayesian (VB) inference is employed. Finally, 28 GHz indoor channel measurement data is used to demonstrate the effectiveness of the PGMM clustering algorithm.展开更多
Actual engineering systems will be inevitably affected by uncertain factors.Thus,the Reliability-Based Multidisciplinary Design Optimization(RBMDO)has become a hotspot for recent research and application in complex en...Actual engineering systems will be inevitably affected by uncertain factors.Thus,the Reliability-Based Multidisciplinary Design Optimization(RBMDO)has become a hotspot for recent research and application in complex engineering system design.The Second-Order/First-Order Mean-Value Saddlepoint Approximate(SOMVSA/-FOMVSA)are two popular reliability analysis strategies that are widely used in RBMDO.However,the SOMVSA method can only be used efficiently when the distribution of input variables is Gaussian distribution,which significantly limits its application.In this study,the Gaussian Mixture Model-based Second-Order Mean-Value Saddlepoint Approximation(GMM-SOMVSA)is introduced to tackle above problem.It is integrated with the Collaborative Optimization(CO)method to solve RBMDO problems.Furthermore,the formula and procedure of RBMDO using GMM-SOMVSA-Based CO(GMM-SOMVSA-CO)are proposed.Finally,an engineering example is given to show the application of the GMM-SOMVSA-CO method.展开更多
An improved approach for J-value segmentation (JSEG) is presented for unsupervised color image segmentation. Instead of color quantization algorithm, an automatic classification method based on adaptive mean shift ...An improved approach for J-value segmentation (JSEG) is presented for unsupervised color image segmentation. Instead of color quantization algorithm, an automatic classification method based on adaptive mean shift (AMS) based clustering is used for nonparametric clustering of image data set. The clustering results are used to construct Gaussian mixture modelling (GMM) of image data for the calculation of soft J value. The region growing algorithm used in JSEG is then applied in segmenting the image based on the multiscale soft J-images. Experiments show that the synergism of JSEG and the soft classification based on AMS based clustering and GMM overcomes the limitations of JSEG successfully and is more robust.展开更多
Reliable process monitoring is important for ensuring process safety and product quality.A production process is generally characterized bymultiple operation modes,and monitoring thesemultimodal processes is challengi...Reliable process monitoring is important for ensuring process safety and product quality.A production process is generally characterized bymultiple operation modes,and monitoring thesemultimodal processes is challenging.Most multimodal monitoring methods rely on the assumption that the modes are independent of each other,which may not be appropriate for practical application.This study proposes a transition-constrained Gaussian mixture model method for efficient multimodal process monitoring.This technique can reduce falsely and frequently occurring mode transitions by considering the time series information in the mode identification of historical and online data.This process enables the identified modes to reflect the stability of actual working conditions,improve mode identification accuracy,and enhance monitoring reliability in cases of mode overlap.Case studies on a numerical simulation example and simulation of the penicillin fermentation process are provided to verify the effectiveness of the proposed approach inmultimodal process monitoring with mode overlap.展开更多
基金supported by the National Natural Science Foundation of China (61903326, 61933015)。
文摘The large blast furnace is essential equipment in the process of iron and steel manufacturing. Due to the complex operation process and frequent fluctuations of variables, conventional monitoring methods often bring false alarms. To address the above problem, an ensemble of greedy dynamic principal component analysis-Gaussian mixture model(EGDPCA-GMM) is proposed in this paper. First, PCA-GMM is introduced to deal with the collinearity and the non-Gaussian distribution of blast furnace data.Second, in order to explain the dynamics of data, the greedy algorithm is used to determine the extended variables and their corresponding time lags, so as to avoid introducing unnecessary noise. Then the bagging ensemble is adopted to cooperate with greedy extension to eliminate the randomness brought by the greedy algorithm and further reduce the false alarm rate(FAR) of monitoring results. Finally, the algorithm is applied to the blast furnace of a large iron and steel group in South China to verify performance.Compared with the basic algorithms, the proposed method achieves lowest FAR, while keeping missed alarm rate(MAR) remain stable.
基金Supported by the National Natural Science Foundation of China under Grant No 60972106the China Postdoctoral Science Foundation under Grant No 2014M561053+1 种基金the Humanity and Social Science Foundation of Ministry of Education of China under Grant No 15YJA630108the Hebei Province Natural Science Foundation under Grant No E2016202341
文摘The contribution of this work is twofold: (1) a multimodality prediction method of chaotic time series with the Gaussian process mixture (GPM) model is proposed, which employs a divide and conquer strategy. It automatically divides the chaotic time series into multiple modalities with different extrinsic patterns and intrinsic characteristics, and thus can more precisely fit the chaotic time series. (2) An effective sparse hard-cut expec- tation maximization (SHC-EM) learning algorithm for the GPM model is proposed to improve the prediction performance. SHO-EM replaces a large learning sample set with fewer pseudo inputs, accelerating model learning based on these pseudo inputs. Experiments on Lorenz and Chua time series demonstrate that the proposed method yields not only accurate multimodality prediction, but also the prediction confidence interval SHC-EM outperforms the traditional variational 1earning in terms of both prediction accuracy and speed. In addition, SHC-EM is more robust and insusceptible to noise than variational learning.
基金Supported by the Natural Science Foundation of Jiangsu Province of China(BK20130531)the Priority Academic Program Development of Jiangsu Higher Education Institutions(PAPD[2011]6)Jiangsu Government Scholarship
文摘The dynamic soft sensor based on a single Gaussian process regression(GPR) model has been developed in fermentation processes.However,limitations of single regression models,for multiphase/multimode fermentation processes,may result in large prediction errors and complexity of the soft sensor.Therefore,a dynamic soft sensor based on Gaussian mixture regression(GMR) was proposed to overcome the problems.Two structure parameters,the number of Gaussian components and the order of the model,are crucial to the soft sensor model.To achieve a simple and effective soft sensor,an iterative strategy was proposed to optimize the two structure parameters synchronously.For the aim of comparisons,the proposed dynamic GMR soft sensor and the existing dynamic GPR soft sensor were both investigated to estimate biomass concentration in a Penicillin simulation process and an industrial Erythromycin fermentation process.Results show that the proposed dynamic GMR soft sensor has higher prediction accuracy and is more suitable for dynamic multiphase/multimode fermentation processes.
文摘Delineation of the lung parenchyma in the thoracic Computed Tomography(CT)is an important processing step for most of the pulmonary image analysis such as lung volume extraction,lung nodule detection and pulmonary vessel segmentation.An automatic method for accurate delineation of lung parenchyma in thoracic Computed Tomography images is presented in this paper.The proposed method involves a segmentation phase followed by a lung boundary correction technique.The tissues in the thoracic Computed Tomography can be represented by a number of Gaussians.We propose a histogram utilized Adaptive Multilevel Thresholding(AMT)for estimating the total number of Gaussians and their initial parameters.The parameters of Gaussian components are updated by Expectation Maximization(EM)algorithm.The segmented lung parenchyma from the Gaussian Mixture model(GMM)undergoes an Adaptive Morphological Filtering(AMF)to reduce the boundary errors.The proposed method has been tested on 70 diseased and 119 normal lung images from 28 cases obtained from Lung Image Database Consortium(LIDC).The performance of the proposed system has been validated.
基金Project supported by the National Natural Science Foundation of China(Grant Nos.U1706218,61971388,and L1824025).
文摘The techniques for oceanographic observation have made great progress in both space-time coverage and quality, which make the observation data present some characteristics of big data. We explore the essence of global ocean dynamic via constructing a complex network with regard to sea surface temperature. The global ocean is divided into discrete regions to represent the nodes of the network. To understand the ocean dynamic behavior, we introduce the Gaussian mixture models to describe the nodes as limit-cycle oscillators. The interacting dynamical oscillators form the complex network that simulates the ocean as a stochastic system. Gaussian probability matching is suggested to measure the behavior similarity of regions. Complex network statistical characteristics of the network are analyzed in terms of degree distribution, clustering coefficient and betweenness. Experimental results show a pronounced sensitivity of network characteristics to the climatic anomaly in the oceanic circulation. Particularly, the betweenness reveals the main pathways to transfer thermal energy of El Niño–Southern oscillation. Our works provide new insights into the physical processes of ocean dynamic, as well as climate changes and ocean anomalies.
基金Supported by the National Natural Science Foundation of China (42174142)National Science and Technology Major Project (2017ZX05039-002)+2 种基金Operation Fund of China National Petroleum Corporation Logging Key Laboratory (2021DQ20210107-11)Fundamental Research Funds for Central Universities (19CX02006A)Major Science and Technology Project of China National Petroleum Corporation (ZD2019-183-006)。
文摘To make the quantitative results of nuclear magnetic resonance(NMR) transverse relaxation(T;) spectrums reflect the type and pore structure of reservoir more directly, an unsupervised clustering method was developed to obtain the quantitative pore structure information from the NMR T;spectrums based on the Gaussian mixture model(GMM). Firstly, We conducted the principal component analysis on T;spectrums in order to reduce the dimension data and the dependence of the original variables. Secondly, the dimension-reduced data was fitted using the GMM probability density function, and the model parameters and optimal clustering numbers were obtained according to the expectation-maximization algorithm and the change of the Akaike information criterion. Finally, the T;spectrum features and pore structure types of different clustering groups were analyzed and compared with T;geometric mean and T;arithmetic mean. The effectiveness of the algorithm has been verified by numerical simulation and field NMR logging data. The research shows that the clustering results based on GMM method have good correlations with the shape and distribution of the T;spectrum, pore structure, and petroleum productivity, providing a new means for quantitative identification of pore structure, reservoir grading, and oil and gas productivity evaluation.
基金supported by National Key Natural Science Foundation of China (Grant No. 50635010)
文摘The currently prevalent machine performance degradation assessment techniques involve estimating a machine's current condition based upon the recognition of indications of failure features,which entail complete data collected in different conditions.However,failure data are always hard to acquire,thus making those techniques hard to be applied.In this paper,a novel method which does not need failure history data is introduced.Wavelet packet decomposition(WPD) is used to extract features from raw signals,principal component analysis(PCA) is utilized to reduce feature dimensions,and Gaussian mixture model(GMM) is then applied to approximate the feature space distributions.Single-channel confidence value(SCV) is calculated by the overlap between GMM of the monitoring condition and that of the normal condition,which can indicate the performance of single-channel.Furthermore,multi-channel confidence value(MCV),which can be deemed as the overall performance index of multi-channel,is calculated via logistic regression(LR) and that the task of decision-level sensor fusion is also completed.Both SCV and MCV can serve as the basis on which proactive maintenance measures can be taken,thus preventing machine breakdown.The method has been adopted to assess the performance of the turbine of a centrifugal compressor in a factory of Petro-China,and the result shows that it can effectively complete this task.The proposed method has engineering significance for machine performance degradation assessment.
文摘Accurate classification and prediction of future traffic conditions are essential for developing effective strategies for congestion mitigation on the highway systems. Speed distribution is one of the traffic stream parameters, which has been used to quantify the traffic conditions. Previous studies have shown that multi-modal probability distribution of speeds gives excellent results when simultaneously evaluating congested and free-flow traffic conditions. However, most of these previous analytical studies do not incorporate the influencing factors in characterizing these conditions. This study evaluates the impact of traffic occupancy on the multi-state speed distribution using the Bayesian Dirichlet Process Mixtures of Generalized Linear Models (DPM-GLM). Further, the study estimates the speed cut-point values of traffic states, which separate them into homogeneous groups using Bayesian change-point detection (BCD) technique. The study used 2015 archived one-year traffic data collected on Florida’s Interstate 295 freeway corridor. Information criteria results revealed three traffic states, which were identified as free-flow, transitional flow condition (congestion onset/offset), and the congested condition. The findings of the DPM-GLM indicated that in all estimated states, the traffic speed decreases when traffic occupancy increases. Comparison of the influence of traffic occupancy between traffic states showed that traffic occupancy has more impact on the free-flow and the congested state than on the transitional flow condition. With respect to estimating the threshold speed value, the results of the BCD model revealed promising findings in characterizing levels of traffic congestion.
文摘In order to meet the demand of online optimal running, a novel soft sensor modeling approach based on Gaussian processes was proposed. The approach is moderately simple to implement and use without loss of performance. It is trained by optimizing the hyperparameters using the scaled conjugate gradient algorithm with the squared exponential covariance function employed. Experimental simulations show that the soft sensor modeling approach has the advantage via a real-world example in a refinery. Meanwhile, the method opens new possibilities for application of kernel methods to potential fields.
文摘In this paper, we propose a new soft multi-phase segmentation model where it is assumed that the pixel intensities are distributed as a Gaussian mixture. The model is formulated as a minimization problem through the use of the maximum likelihood estimator and phase-transition theory. The mixture coefficients, which are estimated using a spatially varying mean and variance procedure, are used for image segmentation. The experimental results indicate the effectiveness of the method.
基金This work was supported by the National Natural Science Foundation of China(NSFC)(Grant Nos.61701192,61471226 and 61671242)the Natural Science Foundation of Shandong Province,China(Youth Fund Project)under Grant No.ZR2017QF004+3 种基金Natural Science Foundation of Shandong Province(Grant Nos.JQ201516 and 2018GGXl 01018)the Taishan scholar project of Shandong Province(No.tsqn2016023)Fundamental Research Funds for the Central University(30920140111004)the China Postdoctoral Science Foundation(No.2017M612178).
文摘We introduce a method based on Gaussian mixture model(GMM)clustering and level-set to automatically detect intraretina fluid on diabetic retinopathy(DR)from spectral domain optical coherence tomography(SD-OCT)images in this paper.First,each B-scan is segmented using GMM clustering.The original chustering results are refined using location and thickness infor-mation.Then,the spatial information among every consecutive five B-scans is used to search potential fluid.Finally,the improved level-set method is used to obtain the accurate boundaries.The high sensitivity and accuracy demonstrated here show its potential for detection of fluid.
基金This work was partly supported by the National Natural Science Foundation of China under Grants 61672293.
文摘In recent years,image restoration has become a huge subject,and finite hybrid model has been widely used in image denoising because of its easy modeling and strong explanatory results.The gaussian mixture model is the most common one.The existing image denoising methods usually assume that each component of the natural image is subject to the gaussian mixture model(GMM).However,this approach is not entirely reasonable.It is well known that most natural images are complex and their distribution is not entirely gaussian.As a result,there are still many problems that GMM cannot solve.This paper tries to improve the finite mixture model and introduces the asymmetric gaussian mixture model into it.Since the asymmetric gaussian mixture model can simulate the asymmetric distribution on the basis of the gaussian mixture model,it is more consistent with the natural image data,so the denoising effect of the natural complex image is better.We carried out image denoising experiments under different noise scales and types,and found that the asymmetric gaussian mixture model has better denoising effect and performance.
文摘Intrusion detection is the investigation process of information about the system activities or its data to detect any malicious behavior or unauthorized activity.Most of the IDS implement K-means clustering technique due to its linear complexity and fast computing ability.Nonetheless,it is Naïve use of the mean data value for the cluster core that presents a major drawback.The chances of two circular clusters having different radius and centering at the same mean will occur.This condition cannot be addressed by the K-means algorithm because the mean value of the various clusters is very similar together.However,if the clusters are not spherical,it fails.To overcome this issue,a new integrated hybrid model by integrating expectation maximizing(EM)clustering using a Gaussian mixture model(GMM)and naïve Bays classifier have been proposed.In this model,GMM give more flexibility than K-Means in terms of cluster covariance.Also,they use probabilities function and soft clustering,that’s why they can have multiple cluster for a single data.In GMM,we can define the cluster form in GMM by two parameters:the mean and the standard deviation.This means that by using these two parameters,the cluster can take any kind of elliptical shape.EM-GMM will be used to cluster data based on data activity into the corresponding category.
基金supported by the National Natural Science Foundation of China under Grant No.62273083 and No.61973069Natural Science Foundation of Hebei Province under Grant No.F2020501012。
文摘Wireless sensor network(WSN)positioning has a good effect on indoor positioning,so it has received extensive attention in the field of positioning.Non-line-of sight(NLOS)is a primary challenge in indoor complex environment.In this paper,a robust localization algorithm based on Gaussian mixture model and fitting polynomial is proposed to solve the problem of NLOS error.Firstly,fitting polynomials are used to predict the measured values.The residuals of predicted and measured values are clustered by Gaussian mixture model(GMM).The LOS probability and NLOS probability are calculated according to the clustering centers.The measured values are filtered by Kalman filter(KF),variable parameter unscented Kalman filter(VPUKF)and variable parameter particle filter(VPPF)in turn.The distance value processed by KF and VPUKF and the distance value processed by KF,VPUKF and VPPF are combined according to probability.Finally,the maximum likelihood method is used to calculate the position coordinate estimation.Through simulation comparison,the proposed algorithm has better positioning accuracy than several comparison algorithms in this paper.And it shows strong robustness in strong NLOS environment.
基金supported by National Science and Technology Major Program of the Ministry of Science and Technology (No.2018ZX03001031)Key program of Beijing Municipal Natural Science Foundation (No. L172030)+2 种基金Beijing Municipal Science & Technology Commission Project (No. Z171100005217001)Key Project of State Key Lab of Networking and Switching Technology (NST20170205)National Key Technology Research and Development Program of the Ministry of Science and Technology of China (NO. 2012BAF14B01)
文摘Cluster-based channel model is the main stream of fifth generation mobile communications, thus the accuracy of clustering algorithm is important. Traditional Gaussian mixture model (GMM) does not consider the power information which is important for the channel multipath clustering. In this paper, a normalized power weighted GMM (PGMM) is introduced to model the channel multipath components (MPCs). With MPC power as a weighted factor, the PGMM can fit the MPCs in accordance with the cluster-based channel models. Firstly, expectation maximization (EM) algorithm is employed to optimize the PGMM parameters. Then, to further increase the searching ability of EM and choose the optimal number of components without resort to cross-validation, the variational Bayesian (VB) inference is employed. Finally, 28 GHz indoor channel measurement data is used to demonstrate the effectiveness of the PGMM clustering algorithm.
基金support from the National Natural Science Foundation of China(Grant No.52175130)the Sichuan Science and Technology Program(Grant No.2021YFS0336)+4 种基金the China Postdoctoral Science Foundation(Grant No.2021M700693)the 2021 Open Project of Failure Mechanics and Engineering Disaster Prevention,Key Lab of Sichuan Province(Grant No.FMEDP202104)the Fundamental Research Funds for the Central Universities(Grant No.ZYGX2019J035)the Sichuan Science and Technology Innovation Seedling Project Funding Project(Grant No.2021112)the Sichuan Special Equipment Inspection and Research Institute(YNJD-02-2020)are gratefully acknowledged.
文摘Actual engineering systems will be inevitably affected by uncertain factors.Thus,the Reliability-Based Multidisciplinary Design Optimization(RBMDO)has become a hotspot for recent research and application in complex engineering system design.The Second-Order/First-Order Mean-Value Saddlepoint Approximate(SOMVSA/-FOMVSA)are two popular reliability analysis strategies that are widely used in RBMDO.However,the SOMVSA method can only be used efficiently when the distribution of input variables is Gaussian distribution,which significantly limits its application.In this study,the Gaussian Mixture Model-based Second-Order Mean-Value Saddlepoint Approximation(GMM-SOMVSA)is introduced to tackle above problem.It is integrated with the Collaborative Optimization(CO)method to solve RBMDO problems.Furthermore,the formula and procedure of RBMDO using GMM-SOMVSA-Based CO(GMM-SOMVSA-CO)are proposed.Finally,an engineering example is given to show the application of the GMM-SOMVSA-CO method.
文摘An improved approach for J-value segmentation (JSEG) is presented for unsupervised color image segmentation. Instead of color quantization algorithm, an automatic classification method based on adaptive mean shift (AMS) based clustering is used for nonparametric clustering of image data set. The clustering results are used to construct Gaussian mixture modelling (GMM) of image data for the calculation of soft J value. The region growing algorithm used in JSEG is then applied in segmenting the image based on the multiscale soft J-images. Experiments show that the synergism of JSEG and the soft classification based on AMS based clustering and GMM overcomes the limitations of JSEG successfully and is more robust.
基金supported in part by National Natural Science Foundation of China under Grants 61973119 and 61603138in part by Shanghai Rising-Star Program under Grant 20QA1402600+1 种基金in part by the Open Funding from Shandong Key Laboratory of Big-data Driven Safety Control Technology for Complex Systems under Grant SKDN202001in part by the Programme of Introducing Talents of Discipline to Universities(the 111 Project)under Grant B17017.
文摘Reliable process monitoring is important for ensuring process safety and product quality.A production process is generally characterized bymultiple operation modes,and monitoring thesemultimodal processes is challenging.Most multimodal monitoring methods rely on the assumption that the modes are independent of each other,which may not be appropriate for practical application.This study proposes a transition-constrained Gaussian mixture model method for efficient multimodal process monitoring.This technique can reduce falsely and frequently occurring mode transitions by considering the time series information in the mode identification of historical and online data.This process enables the identified modes to reflect the stability of actual working conditions,improve mode identification accuracy,and enhance monitoring reliability in cases of mode overlap.Case studies on a numerical simulation example and simulation of the penicillin fermentation process are provided to verify the effectiveness of the proposed approach inmultimodal process monitoring with mode overlap.