Based on Gaussian mixture models(GMM), speed, flow and occupancy are used together in the cluster analysis of traffic flow data. Compared with other clustering and sorting techniques, as a structural model, the GMM ...Based on Gaussian mixture models(GMM), speed, flow and occupancy are used together in the cluster analysis of traffic flow data. Compared with other clustering and sorting techniques, as a structural model, the GMM is suitable for various kinds of traffic flow parameters. Gap statistics and domain knowledge of traffic flow are used to determine a proper number of clusters. The expectation-maximization (E-M) algorithm is used to estimate parameters of the GMM model. The clustered traffic flow pattems are then analyzed statistically and utilized for designing maximum likelihood classifiers for grouping real-time traffic flow data when new observations become available. Clustering analysis and pattern recognition can also be used to cluster and classify dynamic traffic flow patterns for freeway on-ramp and off-ramp weaving sections as well as for other facilities or things involving the concept of level of service, such as airports, parking lots, intersections, interrupted-flow pedestrian facilities, etc.展开更多
Aiming at the problems that the classical Gaussian mixture model is unable to detect the complete moving object, and is sensitive to the light mutation scenes and so on, an improved algorithm is proposed for moving ob...Aiming at the problems that the classical Gaussian mixture model is unable to detect the complete moving object, and is sensitive to the light mutation scenes and so on, an improved algorithm is proposed for moving object detection based on Gaussian mixture model and three-frame difference method. In the process of extracting the moving region, the improved three-frame difference method uses the dynamic segmentation threshold and edge detection technology, and it is first used to solve the problems such as the illumination mutation and the discontinuity of the target edge. Then, a new adaptive selection strategy of the number of Gaussian distributions is introduced to reduce the processing time and improve accuracy of detection. Finally, HSV color space is used to remove shadow regions, and the whole moving object is detected. Experimental results show that the proposed algorithm can detect moving objects in various situations effectively.展开更多
Intrusion detection is the investigation process of information about the system activities or its data to detect any malicious behavior or unauthorized activity.Most of the IDS implement K-means clustering technique ...Intrusion detection is the investigation process of information about the system activities or its data to detect any malicious behavior or unauthorized activity.Most of the IDS implement K-means clustering technique due to its linear complexity and fast computing ability.Nonetheless,it is Naïve use of the mean data value for the cluster core that presents a major drawback.The chances of two circular clusters having different radius and centering at the same mean will occur.This condition cannot be addressed by the K-means algorithm because the mean value of the various clusters is very similar together.However,if the clusters are not spherical,it fails.To overcome this issue,a new integrated hybrid model by integrating expectation maximizing(EM)clustering using a Gaussian mixture model(GMM)and naïve Bays classifier have been proposed.In this model,GMM give more flexibility than K-Means in terms of cluster covariance.Also,they use probabilities function and soft clustering,that’s why they can have multiple cluster for a single data.In GMM,we can define the cluster form in GMM by two parameters:the mean and the standard deviation.This means that by using these two parameters,the cluster can take any kind of elliptical shape.EM-GMM will be used to cluster data based on data activity into the corresponding category.展开更多
Gaussian mixture models are classical but still popular machine learning models.An appealing feature of Gaussian mixture models is their tractability,that is,they can be learned efficiently and exactly from data,and a...Gaussian mixture models are classical but still popular machine learning models.An appealing feature of Gaussian mixture models is their tractability,that is,they can be learned efficiently and exactly from data,and also support efficient exact inference queries like soft clustering data points.Only seemingly simple,Gaussian mixture models can be hard to understand.There are at least four aspects to understanding Gaussian mixture models,namely,understanding the whole distribution,its individual parts(mixture components),the relationships between the parts,and the interplay of the whole and its parts.In a structured literature review of applications of Gaussian mixture models,we found the need for supporting all four aspects.To identify candidate visualizations that effectively aid the user needs,we structure the available design space along three different representations of Gaussian mixture models,namely as functions,sets of parameters,and sampling processes.From the design space,we implemented three design concepts that visualize the overall distribution together with its components.Finally,we assessed the practical usefulness of the design concepts with respect to the different user needs in expert interviews and an insight-based user study.展开更多
The large blast furnace is essential equipment in the process of iron and steel manufacturing. Due to the complex operation process and frequent fluctuations of variables, conventional monitoring methods often bring f...The large blast furnace is essential equipment in the process of iron and steel manufacturing. Due to the complex operation process and frequent fluctuations of variables, conventional monitoring methods often bring false alarms. To address the above problem, an ensemble of greedy dynamic principal component analysis-Gaussian mixture model(EGDPCA-GMM) is proposed in this paper. First, PCA-GMM is introduced to deal with the collinearity and the non-Gaussian distribution of blast furnace data.Second, in order to explain the dynamics of data, the greedy algorithm is used to determine the extended variables and their corresponding time lags, so as to avoid introducing unnecessary noise. Then the bagging ensemble is adopted to cooperate with greedy extension to eliminate the randomness brought by the greedy algorithm and further reduce the false alarm rate(FAR) of monitoring results. Finally, the algorithm is applied to the blast furnace of a large iron and steel group in South China to verify performance.Compared with the basic algorithms, the proposed method achieves lowest FAR, while keeping missed alarm rate(MAR) remain stable.展开更多
Purpose-For the large-scale power grid monitoring system equipment,its working environment is increasingly complex and the probability of fault or failure of the monitoring system is gradually increasing.This paper pr...Purpose-For the large-scale power grid monitoring system equipment,its working environment is increasingly complex and the probability of fault or failure of the monitoring system is gradually increasing.This paper proposes a fault classification algorithm based on Gaussian mixture model(GMM),which can complete the automatic classification of fault and the elimination of fault sources in the monitoring system.Design/methodology/approach-The algorithm first defines the GMM and obtains the detection value of the fault classification through a method based on the causal Mason Young Tracy(MYT)decomposition under each normal distribution in the GMM.Then,the weight value of GMM is used to calculate weighted classification value of fault detection and separation,and by comparing the actual control limits with the classification result of GMM,the fault classification results are obtained.Findings-The experiment on the defined non-thermostatic continuous stirred-tank reactor model shows that the algorithm proposed in this paper is superior to the traditional algorithm based on the causal MYT decomposition in fault detection and fault separation.Originality/value-The proposed algorithm fundamentally solves the problem of fault detection and fault separation in large-scale systems and provides support for troubleshooting and identifying fault sources.展开更多
A cascaded projection of the Gaussian mixture model algorithm is proposed.First,the marginal distribution of the Gaussian mixture model is computed for different feature dimensions, and a number of sub-classifiers are...A cascaded projection of the Gaussian mixture model algorithm is proposed.First,the marginal distribution of the Gaussian mixture model is computed for different feature dimensions, and a number of sub-classifiers are generated using the marginal distribution model.Each sub-classifier is based on different feature sets.The cascaded structure is adopted to fuse the sub-classifiers dynamically to achieve sample adaptation ability.Secondly,the effectiveness of the proposed algorithm is verified on electrocardiogram emotional signal and speech emotional signal.Emotional data including fidgetiness,happiness and sadness is collected by induction experiments.Finally,the emotion feature extraction method is discussed,including heart rate variability, the chaotic electrocardiogram feature and utterance level static feature.The emotional feature reduction methods are studied, including principle component analysis,sequential forward selection, the Fisher discriminant ratio and maximal information coefficient.The experimental results show that the proposed classification algorithm can effectively improve recognition accuracy in two different scenarios.展开更多
A new two-step framework is proposed for image segmentation. In the first step, the gray-value distribution of the given image is reshaped to have larger inter-class variance and less intra-class variance. In the sec-...A new two-step framework is proposed for image segmentation. In the first step, the gray-value distribution of the given image is reshaped to have larger inter-class variance and less intra-class variance. In the sec- ond step, the discriminant-based methods or clustering-based methods are performed on the reformed distribution. It is focused on the typical clustering methods-Gaussian mixture model (GMM) and its variant to demonstrate the feasibility of the framework. Due to the independence of the first step in its second step, it can be integrated into the pixel-based and the histogram-based methods to improve their segmentation quality. The experiments on artificial and real images show that the framework can achieve effective and robust segmentation results.展开更多
In order to enhance the reliability of the moving target detection, an adaptive moving target detection algorithm based on the Gaussian mixture model is proposed. This algorithm employs Gaussian mixture distributions ...In order to enhance the reliability of the moving target detection, an adaptive moving target detection algorithm based on the Gaussian mixture model is proposed. This algorithm employs Gaussian mixture distributions in modeling the background of each pixel. As a result, the number of Gaussian distributions is not fixed but adaptively changes with the change of the pixel value frequency. The pixels of the difference image are divided into two parts according to their values. Then the two parts are separately segmented by the adaptive threshold, and finally the foreground image is obtained. The shadow elimination method based on morphological reconstruction is introduced to improve the performance of foreground image's segmentation. Experimental results show that the proposed algorithm can quickly and accurately build the background model and it is more robust in different real scenes.展开更多
An improved Gaussian mixture model (GMM)- based clustering method is proposed for the difficult case where the true distribution of data is against the assumed GMM. First, an improved model selection criterion, the ...An improved Gaussian mixture model (GMM)- based clustering method is proposed for the difficult case where the true distribution of data is against the assumed GMM. First, an improved model selection criterion, the completed likelihood minimum message length criterion, is derived. It can measure both the goodness-of-fit of the candidate GMM to the data and the goodness-of-partition of the data. Secondly, by utilizing the proposed criterion as the clustering objective function, an improved expectation- maximization (EM) algorithm is developed, which can avoid poor local optimal solutions compared to the standard EM algorithm for estimating the model parameters. The experimental results demonstrate that the proposed method can rectify the over-fitting tendency of representative GMM-based clustering approaches and can robustly provide more accurate clustering results.展开更多
To enrich the diversity of artificial neurons,a type of quadratic neurons was proposed previously,where the inner product of inputs and weights is replaced by a quadratic operation.In this paper,we demonstrate the sup...To enrich the diversity of artificial neurons,a type of quadratic neurons was proposed previously,where the inner product of inputs and weights is replaced by a quadratic operation.In this paper,we demonstrate the superiority of such quadratic neurons over conventional counterparts.For this purpose,we train such quadratic neural networks using an adapted backpropagation algorithm and perform a systematic comparison between quadratic and conventional neural networks for classificaiton of Gaussian mixture data,which is one of the most important machine learning tasks.Our results show that quadratic neural networks enjoy remarkably better efficacy and efficiency than conventional neural networks in this context,and potentially extendable to other relevant applications.展开更多
Actual engineering systems will be inevitably affected by uncertain factors.Thus,the Reliability-Based Multidisciplinary Design Optimization(RBMDO)has become a hotspot for recent research and application in complex en...Actual engineering systems will be inevitably affected by uncertain factors.Thus,the Reliability-Based Multidisciplinary Design Optimization(RBMDO)has become a hotspot for recent research and application in complex engineering system design.The Second-Order/First-Order Mean-Value Saddlepoint Approximate(SOMVSA/-FOMVSA)are two popular reliability analysis strategies that are widely used in RBMDO.However,the SOMVSA method can only be used efficiently when the distribution of input variables is Gaussian distribution,which significantly limits its application.In this study,the Gaussian Mixture Model-based Second-Order Mean-Value Saddlepoint Approximation(GMM-SOMVSA)is introduced to tackle above problem.It is integrated with the Collaborative Optimization(CO)method to solve RBMDO problems.Furthermore,the formula and procedure of RBMDO using GMM-SOMVSA-Based CO(GMM-SOMVSA-CO)are proposed.Finally,an engineering example is given to show the application of the GMM-SOMVSA-CO method.展开更多
Reliable process monitoring is important for ensuring process safety and product quality.A production process is generally characterized bymultiple operation modes,and monitoring thesemultimodal processes is challengi...Reliable process monitoring is important for ensuring process safety and product quality.A production process is generally characterized bymultiple operation modes,and monitoring thesemultimodal processes is challenging.Most multimodal monitoring methods rely on the assumption that the modes are independent of each other,which may not be appropriate for practical application.This study proposes a transition-constrained Gaussian mixture model method for efficient multimodal process monitoring.This technique can reduce falsely and frequently occurring mode transitions by considering the time series information in the mode identification of historical and online data.This process enables the identified modes to reflect the stability of actual working conditions,improve mode identification accuracy,and enhance monitoring reliability in cases of mode overlap.Case studies on a numerical simulation example and simulation of the penicillin fermentation process are provided to verify the effectiveness of the proposed approach inmultimodal process monitoring with mode overlap.展开更多
An improved approach for J-value segmentation (JSEG) is presented for unsupervised color image segmentation. Instead of color quantization algorithm, an automatic classification method based on adaptive mean shift ...An improved approach for J-value segmentation (JSEG) is presented for unsupervised color image segmentation. Instead of color quantization algorithm, an automatic classification method based on adaptive mean shift (AMS) based clustering is used for nonparametric clustering of image data set. The clustering results are used to construct Gaussian mixture modelling (GMM) of image data for the calculation of soft J value. The region growing algorithm used in JSEG is then applied in segmenting the image based on the multiscale soft J-images. Experiments show that the synergism of JSEG and the soft classification based on AMS based clustering and GMM overcomes the limitations of JSEG successfully and is more robust.展开更多
The currently prevalent machine performance degradation assessment techniques involve estimating a machine's current condition based upon the recognition of indications of failure features,which entail complete data ...The currently prevalent machine performance degradation assessment techniques involve estimating a machine's current condition based upon the recognition of indications of failure features,which entail complete data collected in different conditions.However,failure data are always hard to acquire,thus making those techniques hard to be applied.In this paper,a novel method which does not need failure history data is introduced.Wavelet packet decomposition(WPD) is used to extract features from raw signals,principal component analysis(PCA) is utilized to reduce feature dimensions,and Gaussian mixture model(GMM) is then applied to approximate the feature space distributions.Single-channel confidence value(SCV) is calculated by the overlap between GMM of the monitoring condition and that of the normal condition,which can indicate the performance of single-channel.Furthermore,multi-channel confidence value(MCV),which can be deemed as the overall performance index of multi-channel,is calculated via logistic regression(LR) and that the task of decision-level sensor fusion is also completed.Both SCV and MCV can serve as the basis on which proactive maintenance measures can be taken,thus preventing machine breakdown.The method has been adopted to assess the performance of the turbine of a centrifugal compressor in a factory of Petro-China,and the result shows that it can effectively complete this task.The proposed method has engineering significance for machine performance degradation assessment.展开更多
The key problem of the adaptive mixture background model is that the parameters can adaptively change according to the input data. To address the problem, a new method is proposed. Firstly, the recursive equations are...The key problem of the adaptive mixture background model is that the parameters can adaptively change according to the input data. To address the problem, a new method is proposed. Firstly, the recursive equations are inferred based on the maximum likelihood rule. Secondly, the forgetting factor and learning rate factor are redefined, and their still more general formulations are obtained by analyzing their practical functions. Lastly, the convergence of the proposed algorithm is proved to enable the estimation converge to a local maximum of the data likelihood function according to the stochastic approximation theory. The experiments show that the proposed learning algorithm excels the formers both in converging rate and accuracy.展开更多
Since the joint probabilistic data association(JPDA)algorithm results in calculation explosion with the increasing number of targets,a multi-target tracking algorithm based on Gaussian mixture model(GMM)clustering is ...Since the joint probabilistic data association(JPDA)algorithm results in calculation explosion with the increasing number of targets,a multi-target tracking algorithm based on Gaussian mixture model(GMM)clustering is proposed.The algorithm is used to cluster the measurements,and the association matrix between measurements and tracks is constructed by the posterior probability.Compared with the traditional data association algorithm,this algorithm has better tracking performance and less computational complexity.Simulation results demonstrate the effectiveness of the proposed algorithm.展开更多
To make the quantitative results of nuclear magnetic resonance(NMR) transverse relaxation(T;) spectrums reflect the type and pore structure of reservoir more directly, an unsupervised clustering method was developed t...To make the quantitative results of nuclear magnetic resonance(NMR) transverse relaxation(T;) spectrums reflect the type and pore structure of reservoir more directly, an unsupervised clustering method was developed to obtain the quantitative pore structure information from the NMR T;spectrums based on the Gaussian mixture model(GMM). Firstly, We conducted the principal component analysis on T;spectrums in order to reduce the dimension data and the dependence of the original variables. Secondly, the dimension-reduced data was fitted using the GMM probability density function, and the model parameters and optimal clustering numbers were obtained according to the expectation-maximization algorithm and the change of the Akaike information criterion. Finally, the T;spectrum features and pore structure types of different clustering groups were analyzed and compared with T;geometric mean and T;arithmetic mean. The effectiveness of the algorithm has been verified by numerical simulation and field NMR logging data. The research shows that the clustering results based on GMM method have good correlations with the shape and distribution of the T;spectrum, pore structure, and petroleum productivity, providing a new means for quantitative identification of pore structure, reservoir grading, and oil and gas productivity evaluation.展开更多
Snow cover is an important parameter in the fields of computer modeling,engineering technology and energy development.With the extensive growth of novel hardware and software compositions creating smart,cyber physical...Snow cover is an important parameter in the fields of computer modeling,engineering technology and energy development.With the extensive growth of novel hardware and software compositions creating smart,cyber physical systems’(CPS)efficient end-to-end workflows.In order to provide accurate snow detection results for the CPS’s terminal,this paper proposed a snow cover detection algorithm based on the unsupervised Gaussian mixture model(GMM)for the FY-4A satellite data.At present,most snow cover detection algorithms mainly utilize the characteristics of the optical spectrum,which is based on the normalized difference snow index(NDSI)with thresholds in different wavebands.These algorithms require a large amount of manually labeled data for statistical analysis to obtain the appropriate thresholds for the study area.Consideration must be given to both the high and low elevations in the study area.It is difficult to extract all snow by a fixed threshold in mountainous and rugged terrains.In this research,we avoid relying on a manual analysis for different elevations.Therefore,an algorithm based on the GMM is proposed,integrating the threshold-based algorithm and the GMM.First,the threshold-based algorithm with transferred thresholds from other satellites’analysis results are used to coarsely classify the surface objects.These results are then used to initialize the parameters of the GMM.Finally,the parameters of that model are updated by an expectation-maximum(EM)iteration algorithm,and the final results are outputted when the iterative conditions end.The results show that this algorithm can adjust itself to mountainous terrain with different elevations,and exhibits a better performance than the threshold-based algorithm.Compared with orbit satellites’snow products,the accuracy of the algorithm used for FY-4A is improved by nearly 2%,and the snow detection rate is increased by nearly 6%.Moreover,compared with microwave sensors’snow products,the accuracy is increased by nearly 3%.The validation results show that the proposed algorithm can be adapted to a complex terrain environment in mountainous areas and exhibits good performance under a transferred threshold without manually assigned labels.展开更多
To solve the problem of color distortion after dehazing in the sky region by using the classical dark channel prior method to process the hazy images with large regions of sky,an improved dark channel image dehazing m...To solve the problem of color distortion after dehazing in the sky region by using the classical dark channel prior method to process the hazy images with large regions of sky,an improved dark channel image dehazing method based on Gaussian mixture model is proposed.Firstly,we use the Gaussian mixture model to model the hazy image,and then use the expectation maximization(EM)algorithm to optimize the parameters,so that the hazy image can be divided into the sky region and the non-sky region.Secondly,the sky region is divided into a light haze region,a medium haze region and a heavy haze region according to the different dark channel values to estimate the transmission respectively.Thirdly,the restored image is obtained by combining the atmospheric scattering model.Finally,adaptive local tone mapping for high dynamic range images is used to adjust the brightness of the restored image.The experimental results show that the proposed method can effectively eliminate the color distortion in the sky region,and the restored image is clearer and has better visual effect.展开更多
基金The US National Science Foundation (No. CMMI-0408390,CMMI-0644552)the American Chemical Society Petroleum Research Foundation (No.PRF-44468-G9)+3 种基金the Research Fellowship for International Young Scientists (No.51050110143)the Fok Ying-Tong Education Foundation (No.114024)the Natural Science Foundation of Jiangsu Province (No.BK2009015)the Postdoctoral Science Foundation of Jiangsu Province (No.0901005C)
文摘Based on Gaussian mixture models(GMM), speed, flow and occupancy are used together in the cluster analysis of traffic flow data. Compared with other clustering and sorting techniques, as a structural model, the GMM is suitable for various kinds of traffic flow parameters. Gap statistics and domain knowledge of traffic flow are used to determine a proper number of clusters. The expectation-maximization (E-M) algorithm is used to estimate parameters of the GMM model. The clustered traffic flow pattems are then analyzed statistically and utilized for designing maximum likelihood classifiers for grouping real-time traffic flow data when new observations become available. Clustering analysis and pattern recognition can also be used to cluster and classify dynamic traffic flow patterns for freeway on-ramp and off-ramp weaving sections as well as for other facilities or things involving the concept of level of service, such as airports, parking lots, intersections, interrupted-flow pedestrian facilities, etc.
文摘Aiming at the problems that the classical Gaussian mixture model is unable to detect the complete moving object, and is sensitive to the light mutation scenes and so on, an improved algorithm is proposed for moving object detection based on Gaussian mixture model and three-frame difference method. In the process of extracting the moving region, the improved three-frame difference method uses the dynamic segmentation threshold and edge detection technology, and it is first used to solve the problems such as the illumination mutation and the discontinuity of the target edge. Then, a new adaptive selection strategy of the number of Gaussian distributions is introduced to reduce the processing time and improve accuracy of detection. Finally, HSV color space is used to remove shadow regions, and the whole moving object is detected. Experimental results show that the proposed algorithm can detect moving objects in various situations effectively.
文摘Intrusion detection is the investigation process of information about the system activities or its data to detect any malicious behavior or unauthorized activity.Most of the IDS implement K-means clustering technique due to its linear complexity and fast computing ability.Nonetheless,it is Naïve use of the mean data value for the cluster core that presents a major drawback.The chances of two circular clusters having different radius and centering at the same mean will occur.This condition cannot be addressed by the K-means algorithm because the mean value of the various clusters is very similar together.However,if the clusters are not spherical,it fails.To overcome this issue,a new integrated hybrid model by integrating expectation maximizing(EM)clustering using a Gaussian mixture model(GMM)and naïve Bays classifier have been proposed.In this model,GMM give more flexibility than K-Means in terms of cluster covariance.Also,they use probabilities function and soft clustering,that’s why they can have multiple cluster for a single data.In GMM,we can define the cluster form in GMM by two parameters:the mean and the standard deviation.This means that by using these two parameters,the cluster can take any kind of elliptical shape.EM-GMM will be used to cluster data based on data activity into the corresponding category.
基金Carl Zeiss Foundation,within the project Interactive Inference.
文摘Gaussian mixture models are classical but still popular machine learning models.An appealing feature of Gaussian mixture models is their tractability,that is,they can be learned efficiently and exactly from data,and also support efficient exact inference queries like soft clustering data points.Only seemingly simple,Gaussian mixture models can be hard to understand.There are at least four aspects to understanding Gaussian mixture models,namely,understanding the whole distribution,its individual parts(mixture components),the relationships between the parts,and the interplay of the whole and its parts.In a structured literature review of applications of Gaussian mixture models,we found the need for supporting all four aspects.To identify candidate visualizations that effectively aid the user needs,we structure the available design space along three different representations of Gaussian mixture models,namely as functions,sets of parameters,and sampling processes.From the design space,we implemented three design concepts that visualize the overall distribution together with its components.Finally,we assessed the practical usefulness of the design concepts with respect to the different user needs in expert interviews and an insight-based user study.
基金supported by the National Natural Science Foundation of China (61903326, 61933015)。
文摘The large blast furnace is essential equipment in the process of iron and steel manufacturing. Due to the complex operation process and frequent fluctuations of variables, conventional monitoring methods often bring false alarms. To address the above problem, an ensemble of greedy dynamic principal component analysis-Gaussian mixture model(EGDPCA-GMM) is proposed in this paper. First, PCA-GMM is introduced to deal with the collinearity and the non-Gaussian distribution of blast furnace data.Second, in order to explain the dynamics of data, the greedy algorithm is used to determine the extended variables and their corresponding time lags, so as to avoid introducing unnecessary noise. Then the bagging ensemble is adopted to cooperate with greedy extension to eliminate the randomness brought by the greedy algorithm and further reduce the false alarm rate(FAR) of monitoring results. Finally, the algorithm is applied to the blast furnace of a large iron and steel group in South China to verify performance.Compared with the basic algorithms, the proposed method achieves lowest FAR, while keeping missed alarm rate(MAR) remain stable.
文摘Purpose-For the large-scale power grid monitoring system equipment,its working environment is increasingly complex and the probability of fault or failure of the monitoring system is gradually increasing.This paper proposes a fault classification algorithm based on Gaussian mixture model(GMM),which can complete the automatic classification of fault and the elimination of fault sources in the monitoring system.Design/methodology/approach-The algorithm first defines the GMM and obtains the detection value of the fault classification through a method based on the causal Mason Young Tracy(MYT)decomposition under each normal distribution in the GMM.Then,the weight value of GMM is used to calculate weighted classification value of fault detection and separation,and by comparing the actual control limits with the classification result of GMM,the fault classification results are obtained.Findings-The experiment on the defined non-thermostatic continuous stirred-tank reactor model shows that the algorithm proposed in this paper is superior to the traditional algorithm based on the causal MYT decomposition in fault detection and fault separation.Originality/value-The proposed algorithm fundamentally solves the problem of fault detection and fault separation in large-scale systems and provides support for troubleshooting and identifying fault sources.
基金The National Natural Science Foundation of China(No.61231002,61273266,51075068,61271359)Doctoral Fund of Ministry of Education of China(No.20110092130004)
文摘A cascaded projection of the Gaussian mixture model algorithm is proposed.First,the marginal distribution of the Gaussian mixture model is computed for different feature dimensions, and a number of sub-classifiers are generated using the marginal distribution model.Each sub-classifier is based on different feature sets.The cascaded structure is adopted to fuse the sub-classifiers dynamically to achieve sample adaptation ability.Secondly,the effectiveness of the proposed algorithm is verified on electrocardiogram emotional signal and speech emotional signal.Emotional data including fidgetiness,happiness and sadness is collected by induction experiments.Finally,the emotion feature extraction method is discussed,including heart rate variability, the chaotic electrocardiogram feature and utterance level static feature.The emotional feature reduction methods are studied, including principle component analysis,sequential forward selection, the Fisher discriminant ratio and maximal information coefficient.The experimental results show that the proposed classification algorithm can effectively improve recognition accuracy in two different scenarios.
基金Supported by the National Natural Science Foundation of China(60505004,60773061)~~
文摘A new two-step framework is proposed for image segmentation. In the first step, the gray-value distribution of the given image is reshaped to have larger inter-class variance and less intra-class variance. In the sec- ond step, the discriminant-based methods or clustering-based methods are performed on the reformed distribution. It is focused on the typical clustering methods-Gaussian mixture model (GMM) and its variant to demonstrate the feasibility of the framework. Due to the independence of the first step in its second step, it can be integrated into the pixel-based and the histogram-based methods to improve their segmentation quality. The experiments on artificial and real images show that the framework can achieve effective and robust segmentation results.
基金The National Natural Science Foundation of China (No.61172135,61101198)the Aeronautical Foundation of China (No.20115152026)
文摘In order to enhance the reliability of the moving target detection, an adaptive moving target detection algorithm based on the Gaussian mixture model is proposed. This algorithm employs Gaussian mixture distributions in modeling the background of each pixel. As a result, the number of Gaussian distributions is not fixed but adaptively changes with the change of the pixel value frequency. The pixels of the difference image are divided into two parts according to their values. Then the two parts are separately segmented by the adaptive threshold, and finally the foreground image is obtained. The shadow elimination method based on morphological reconstruction is introduced to improve the performance of foreground image's segmentation. Experimental results show that the proposed algorithm can quickly and accurately build the background model and it is more robust in different real scenes.
基金The National Natural Science Foundation of China(No.61105048,60972165)the Doctoral Fund of Ministry of Education of China(No.20110092120034)+2 种基金the Natural Science Foundation of Jiangsu Province(No.BK2010240)the Technology Foundation for Selected Overseas Chinese Scholar,Ministry of Human Resources and Social Security of China(No.6722000008)the Open Fund of Jiangsu Province Key Laboratory for Remote Measuring and Control(No.YCCK201005)
文摘An improved Gaussian mixture model (GMM)- based clustering method is proposed for the difficult case where the true distribution of data is against the assumed GMM. First, an improved model selection criterion, the completed likelihood minimum message length criterion, is derived. It can measure both the goodness-of-fit of the candidate GMM to the data and the goodness-of-partition of the data. Secondly, by utilizing the proposed criterion as the clustering objective function, an improved expectation- maximization (EM) algorithm is developed, which can avoid poor local optimal solutions compared to the standard EM algorithm for estimating the model parameters. The experimental results demonstrate that the proposed method can rectify the over-fitting tendency of representative GMM-based clustering approaches and can robustly provide more accurate clustering results.
基金This work was supported in part by NIH,Nos.R01CA237267,R01HL151561,R21CA264772,and R01EB032716.
文摘To enrich the diversity of artificial neurons,a type of quadratic neurons was proposed previously,where the inner product of inputs and weights is replaced by a quadratic operation.In this paper,we demonstrate the superiority of such quadratic neurons over conventional counterparts.For this purpose,we train such quadratic neural networks using an adapted backpropagation algorithm and perform a systematic comparison between quadratic and conventional neural networks for classificaiton of Gaussian mixture data,which is one of the most important machine learning tasks.Our results show that quadratic neural networks enjoy remarkably better efficacy and efficiency than conventional neural networks in this context,and potentially extendable to other relevant applications.
基金support from the National Natural Science Foundation of China(Grant No.52175130)the Sichuan Science and Technology Program(Grant No.2021YFS0336)+4 种基金the China Postdoctoral Science Foundation(Grant No.2021M700693)the 2021 Open Project of Failure Mechanics and Engineering Disaster Prevention,Key Lab of Sichuan Province(Grant No.FMEDP202104)the Fundamental Research Funds for the Central Universities(Grant No.ZYGX2019J035)the Sichuan Science and Technology Innovation Seedling Project Funding Project(Grant No.2021112)the Sichuan Special Equipment Inspection and Research Institute(YNJD-02-2020)are gratefully acknowledged.
文摘Actual engineering systems will be inevitably affected by uncertain factors.Thus,the Reliability-Based Multidisciplinary Design Optimization(RBMDO)has become a hotspot for recent research and application in complex engineering system design.The Second-Order/First-Order Mean-Value Saddlepoint Approximate(SOMVSA/-FOMVSA)are two popular reliability analysis strategies that are widely used in RBMDO.However,the SOMVSA method can only be used efficiently when the distribution of input variables is Gaussian distribution,which significantly limits its application.In this study,the Gaussian Mixture Model-based Second-Order Mean-Value Saddlepoint Approximation(GMM-SOMVSA)is introduced to tackle above problem.It is integrated with the Collaborative Optimization(CO)method to solve RBMDO problems.Furthermore,the formula and procedure of RBMDO using GMM-SOMVSA-Based CO(GMM-SOMVSA-CO)are proposed.Finally,an engineering example is given to show the application of the GMM-SOMVSA-CO method.
基金supported in part by National Natural Science Foundation of China under Grants 61973119 and 61603138in part by Shanghai Rising-Star Program under Grant 20QA1402600+1 种基金in part by the Open Funding from Shandong Key Laboratory of Big-data Driven Safety Control Technology for Complex Systems under Grant SKDN202001in part by the Programme of Introducing Talents of Discipline to Universities(the 111 Project)under Grant B17017.
文摘Reliable process monitoring is important for ensuring process safety and product quality.A production process is generally characterized bymultiple operation modes,and monitoring thesemultimodal processes is challenging.Most multimodal monitoring methods rely on the assumption that the modes are independent of each other,which may not be appropriate for practical application.This study proposes a transition-constrained Gaussian mixture model method for efficient multimodal process monitoring.This technique can reduce falsely and frequently occurring mode transitions by considering the time series information in the mode identification of historical and online data.This process enables the identified modes to reflect the stability of actual working conditions,improve mode identification accuracy,and enhance monitoring reliability in cases of mode overlap.Case studies on a numerical simulation example and simulation of the penicillin fermentation process are provided to verify the effectiveness of the proposed approach inmultimodal process monitoring with mode overlap.
文摘An improved approach for J-value segmentation (JSEG) is presented for unsupervised color image segmentation. Instead of color quantization algorithm, an automatic classification method based on adaptive mean shift (AMS) based clustering is used for nonparametric clustering of image data set. The clustering results are used to construct Gaussian mixture modelling (GMM) of image data for the calculation of soft J value. The region growing algorithm used in JSEG is then applied in segmenting the image based on the multiscale soft J-images. Experiments show that the synergism of JSEG and the soft classification based on AMS based clustering and GMM overcomes the limitations of JSEG successfully and is more robust.
基金supported by National Key Natural Science Foundation of China (Grant No. 50635010)
文摘The currently prevalent machine performance degradation assessment techniques involve estimating a machine's current condition based upon the recognition of indications of failure features,which entail complete data collected in different conditions.However,failure data are always hard to acquire,thus making those techniques hard to be applied.In this paper,a novel method which does not need failure history data is introduced.Wavelet packet decomposition(WPD) is used to extract features from raw signals,principal component analysis(PCA) is utilized to reduce feature dimensions,and Gaussian mixture model(GMM) is then applied to approximate the feature space distributions.Single-channel confidence value(SCV) is calculated by the overlap between GMM of the monitoring condition and that of the normal condition,which can indicate the performance of single-channel.Furthermore,multi-channel confidence value(MCV),which can be deemed as the overall performance index of multi-channel,is calculated via logistic regression(LR) and that the task of decision-level sensor fusion is also completed.Both SCV and MCV can serve as the basis on which proactive maintenance measures can be taken,thus preventing machine breakdown.The method has been adopted to assess the performance of the turbine of a centrifugal compressor in a factory of Petro-China,and the result shows that it can effectively complete this task.The proposed method has engineering significance for machine performance degradation assessment.
基金the Doctorate Foundation of the Engineering College, Air Force Engineering University.
文摘The key problem of the adaptive mixture background model is that the parameters can adaptively change according to the input data. To address the problem, a new method is proposed. Firstly, the recursive equations are inferred based on the maximum likelihood rule. Secondly, the forgetting factor and learning rate factor are redefined, and their still more general formulations are obtained by analyzing their practical functions. Lastly, the convergence of the proposed algorithm is proved to enable the estimation converge to a local maximum of the data likelihood function according to the stochastic approximation theory. The experiments show that the proposed learning algorithm excels the formers both in converging rate and accuracy.
基金the National Natural Science Foundation of China(61771367)the Science and Technology on Communication Networks Laboratory(HHS19641X003).
文摘Since the joint probabilistic data association(JPDA)algorithm results in calculation explosion with the increasing number of targets,a multi-target tracking algorithm based on Gaussian mixture model(GMM)clustering is proposed.The algorithm is used to cluster the measurements,and the association matrix between measurements and tracks is constructed by the posterior probability.Compared with the traditional data association algorithm,this algorithm has better tracking performance and less computational complexity.Simulation results demonstrate the effectiveness of the proposed algorithm.
基金Supported by the National Natural Science Foundation of China (42174142)National Science and Technology Major Project (2017ZX05039-002)+2 种基金Operation Fund of China National Petroleum Corporation Logging Key Laboratory (2021DQ20210107-11)Fundamental Research Funds for Central Universities (19CX02006A)Major Science and Technology Project of China National Petroleum Corporation (ZD2019-183-006)。
文摘To make the quantitative results of nuclear magnetic resonance(NMR) transverse relaxation(T;) spectrums reflect the type and pore structure of reservoir more directly, an unsupervised clustering method was developed to obtain the quantitative pore structure information from the NMR T;spectrums based on the Gaussian mixture model(GMM). Firstly, We conducted the principal component analysis on T;spectrums in order to reduce the dimension data and the dependence of the original variables. Secondly, the dimension-reduced data was fitted using the GMM probability density function, and the model parameters and optimal clustering numbers were obtained according to the expectation-maximization algorithm and the change of the Akaike information criterion. Finally, the T;spectrum features and pore structure types of different clustering groups were analyzed and compared with T;geometric mean and T;arithmetic mean. The effectiveness of the algorithm has been verified by numerical simulation and field NMR logging data. The research shows that the clustering results based on GMM method have good correlations with the shape and distribution of the T;spectrum, pore structure, and petroleum productivity, providing a new means for quantitative identification of pore structure, reservoir grading, and oil and gas productivity evaluation.
基金This study was jointly supported by National Science Foundation of China(41661144039,41875027 and 41871238).
文摘Snow cover is an important parameter in the fields of computer modeling,engineering technology and energy development.With the extensive growth of novel hardware and software compositions creating smart,cyber physical systems’(CPS)efficient end-to-end workflows.In order to provide accurate snow detection results for the CPS’s terminal,this paper proposed a snow cover detection algorithm based on the unsupervised Gaussian mixture model(GMM)for the FY-4A satellite data.At present,most snow cover detection algorithms mainly utilize the characteristics of the optical spectrum,which is based on the normalized difference snow index(NDSI)with thresholds in different wavebands.These algorithms require a large amount of manually labeled data for statistical analysis to obtain the appropriate thresholds for the study area.Consideration must be given to both the high and low elevations in the study area.It is difficult to extract all snow by a fixed threshold in mountainous and rugged terrains.In this research,we avoid relying on a manual analysis for different elevations.Therefore,an algorithm based on the GMM is proposed,integrating the threshold-based algorithm and the GMM.First,the threshold-based algorithm with transferred thresholds from other satellites’analysis results are used to coarsely classify the surface objects.These results are then used to initialize the parameters of the GMM.Finally,the parameters of that model are updated by an expectation-maximum(EM)iteration algorithm,and the final results are outputted when the iterative conditions end.The results show that this algorithm can adjust itself to mountainous terrain with different elevations,and exhibits a better performance than the threshold-based algorithm.Compared with orbit satellites’snow products,the accuracy of the algorithm used for FY-4A is improved by nearly 2%,and the snow detection rate is increased by nearly 6%.Moreover,compared with microwave sensors’snow products,the accuracy is increased by nearly 3%.The validation results show that the proposed algorithm can be adapted to a complex terrain environment in mountainous areas and exhibits good performance under a transferred threshold without manually assigned labels.
基金National Natural Science Foundation of China(Nos.61841303,61963023)Project of Humanities and Social Sciences of Ministry of Education in China(No.19YJC760012)。
文摘To solve the problem of color distortion after dehazing in the sky region by using the classical dark channel prior method to process the hazy images with large regions of sky,an improved dark channel image dehazing method based on Gaussian mixture model is proposed.Firstly,we use the Gaussian mixture model to model the hazy image,and then use the expectation maximization(EM)algorithm to optimize the parameters,so that the hazy image can be divided into the sky region and the non-sky region.Secondly,the sky region is divided into a light haze region,a medium haze region and a heavy haze region according to the different dark channel values to estimate the transmission respectively.Thirdly,the restored image is obtained by combining the atmospheric scattering model.Finally,adaptive local tone mapping for high dynamic range images is used to adjust the brightness of the restored image.The experimental results show that the proposed method can effectively eliminate the color distortion in the sky region,and the restored image is clearer and has better visual effect.