An improved Gaussian mixture model (GMM)- based clustering method is proposed for the difficult case where the true distribution of data is against the assumed GMM. First, an improved model selection criterion, the ...An improved Gaussian mixture model (GMM)- based clustering method is proposed for the difficult case where the true distribution of data is against the assumed GMM. First, an improved model selection criterion, the completed likelihood minimum message length criterion, is derived. It can measure both the goodness-of-fit of the candidate GMM to the data and the goodness-of-partition of the data. Secondly, by utilizing the proposed criterion as the clustering objective function, an improved expectation- maximization (EM) algorithm is developed, which can avoid poor local optimal solutions compared to the standard EM algorithm for estimating the model parameters. The experimental results demonstrate that the proposed method can rectify the over-fitting tendency of representative GMM-based clustering approaches and can robustly provide more accurate clustering results.展开更多
Since in most blind source separation(BSS)algorithms the estimations of probability density function(pdf)of sources are fixed or can only switch between one sup-Gaussian and other sub-Gaussian model,they may not be ef...Since in most blind source separation(BSS)algorithms the estimations of probability density function(pdf)of sources are fixed or can only switch between one sup-Gaussian and other sub-Gaussian model,they may not be efficient to separate sources with different distributions.So to solve the problem of pdf mismatch and the separation of hybrid mixture in BSS,the generalized Gaussian model(GGM)is introduced to model the pdf of the sources since it can provide a general structure of univariate distributions.Its great advantage is that only one parameter needs to be determined in modeling the pdf of different sources,so it is less complex than Gaussian mixture model.By using maximum likelihood(ML)approach,the convergence of the proposed algorithm is improved.The computer simulations show that it is more efficient and valid than conventional methods with fixed pdf estimation.展开更多
A novel estimation algorithm is introduced to handle the popular undersea problem called torpedo tracking with angle-only measurements with a better approach compared to the existing filters. The new algorithm produce...A novel estimation algorithm is introduced to handle the popular undersea problem called torpedo tracking with angle-only measurements with a better approach compared to the existing filters. The new algorithm produces a better estimate from the outputs produced by the traditional nonlinear approaches with the assistance of simple noise minimizers like maximum likelihood filter or any other algorithm which belongs to their family. The introduced method is extended to the higher version in two ways. The first approach extracts a better estimate and covariance by enhancing the count of the intermediate filters, while the second approach accepts more inputs so as to attain improved performance without enhancement of the intermediate filter count. The ideal choice of the placement of towed array sensors to improve the performance of the proposed method further is suggested as the one where the line of sight and the towed array are perpendicular. The results could get even better by moving the ownship in the direction of reducing range. All the results are verified in the MATLAB environment.展开更多
The probability distribution analysis is per-formed for multi-timescale aerosol optical depth (AOD) using AErosol RObotic NETwork (AERONET) level 2.0 data.The maximum likelihood estimation is employed to determine the...The probability distribution analysis is per-formed for multi-timescale aerosol optical depth (AOD) using AErosol RObotic NETwork (AERONET) level 2.0 data.The maximum likelihood estimation is employed to determine the best-fit probability density function (PDF),and the statement that the fitting Weibull distribution will be light-tailed is proved true for these AOD samples.The best-fit PDF results for multi-site data show that the PDF of AOD samples with longer timescale in most sites tends to be stably represented by lognormal distribution,while Weibull distribution is a better fit for AOD samples with short timescales.The reason for this difference is ana-lyzed through tail characteristics of the two distributions,and an indicator for the selection between Weibull and lognormal distributions is suggested and validated.The result of this research is helpful for determining the most accurate AOD statistics for a given site and a given time-scale and for validating the retrieved AOD through its PDF.展开更多
The span of coordinate time series affects the determination of an optimal noise model. We analyzed position data recorded for 10 continuous Global Positioning System (GPS) sites from 1998.0 to mid-2009 on the Austr...The span of coordinate time series affects the determination of an optimal noise model. We analyzed position data recorded for 10 continuous Global Positioning System (GPS) sites from 1998.0 to mid-2009 on the Australian Plate to estimate the best noise model and thereafter obtain the true uncertainties of the velocity, employing the maximum likelihood estimation (MLE) method. MLE was employed to analyze the data in four ways. In the first two analyses, the noise was assumed to be a combination of flicker noise and white noise for the raw time series and spatially filtered time series. In the final two analyses, the spectral indices and amplitudes were simultaneously estimated for a power law noise plus white noise model for the raw time series and spatially filtered time series. We conclude that the noise model of GPS time series in Australia can be best described as the combination of flicker noise and white noise. Velocity uncertainties fall below -0.2 mm/yr when the time span exceeds -9.5 years. A comparison of noise amplitudes and maximum likelihood estimation values between the raw and spatially filtered time series suggests that traditional spatial filtering to remove common-mode errors might not be applicable to the raw time series of this region.展开更多
基金The National Natural Science Foundation of China(No.61105048,60972165)the Doctoral Fund of Ministry of Education of China(No.20110092120034)+2 种基金the Natural Science Foundation of Jiangsu Province(No.BK2010240)the Technology Foundation for Selected Overseas Chinese Scholar,Ministry of Human Resources and Social Security of China(No.6722000008)the Open Fund of Jiangsu Province Key Laboratory for Remote Measuring and Control(No.YCCK201005)
文摘An improved Gaussian mixture model (GMM)- based clustering method is proposed for the difficult case where the true distribution of data is against the assumed GMM. First, an improved model selection criterion, the completed likelihood minimum message length criterion, is derived. It can measure both the goodness-of-fit of the candidate GMM to the data and the goodness-of-partition of the data. Secondly, by utilizing the proposed criterion as the clustering objective function, an improved expectation- maximization (EM) algorithm is developed, which can avoid poor local optimal solutions compared to the standard EM algorithm for estimating the model parameters. The experimental results demonstrate that the proposed method can rectify the over-fitting tendency of representative GMM-based clustering approaches and can robustly provide more accurate clustering results.
文摘Since in most blind source separation(BSS)algorithms the estimations of probability density function(pdf)of sources are fixed or can only switch between one sup-Gaussian and other sub-Gaussian model,they may not be efficient to separate sources with different distributions.So to solve the problem of pdf mismatch and the separation of hybrid mixture in BSS,the generalized Gaussian model(GGM)is introduced to model the pdf of the sources since it can provide a general structure of univariate distributions.Its great advantage is that only one parameter needs to be determined in modeling the pdf of different sources,so it is less complex than Gaussian mixture model.By using maximum likelihood(ML)approach,the convergence of the proposed algorithm is improved.The computer simulations show that it is more efficient and valid than conventional methods with fixed pdf estimation.
文摘A novel estimation algorithm is introduced to handle the popular undersea problem called torpedo tracking with angle-only measurements with a better approach compared to the existing filters. The new algorithm produces a better estimate from the outputs produced by the traditional nonlinear approaches with the assistance of simple noise minimizers like maximum likelihood filter or any other algorithm which belongs to their family. The introduced method is extended to the higher version in two ways. The first approach extracts a better estimate and covariance by enhancing the count of the intermediate filters, while the second approach accepts more inputs so as to attain improved performance without enhancement of the intermediate filter count. The ideal choice of the placement of towed array sensors to improve the performance of the proposed method further is suggested as the one where the line of sight and the towed array are perpendicular. The results could get even better by moving the ownship in the direction of reducing range. All the results are verified in the MATLAB environment.
基金supported by funds from the Chinese Global Change Research Program (Grant No.2010CB951804)the National Natural Science Foundation of China (Grant No.40830103)the China Postdoctoral Science Foundation (Grant No.20100480436)
文摘The probability distribution analysis is per-formed for multi-timescale aerosol optical depth (AOD) using AErosol RObotic NETwork (AERONET) level 2.0 data.The maximum likelihood estimation is employed to determine the best-fit probability density function (PDF),and the statement that the fitting Weibull distribution will be light-tailed is proved true for these AOD samples.The best-fit PDF results for multi-site data show that the PDF of AOD samples with longer timescale in most sites tends to be stably represented by lognormal distribution,while Weibull distribution is a better fit for AOD samples with short timescales.The reason for this difference is ana-lyzed through tail characteristics of the two distributions,and an indicator for the selection between Weibull and lognormal distributions is suggested and validated.The result of this research is helpful for determining the most accurate AOD statistics for a given site and a given time-scale and for validating the retrieved AOD through its PDF.
基金supported by the National Natural Science Foundation of China(Grant Nos.41304007,41074022)the Chinese Universities Scientific Fund(Grant No.121103)+1 种基金the Surveying and Mapping Basic Research Program of the National Administration of Surveying,Mapping and Geoinformation(Grant No.11-02-02)the China Scholarship Council and College of Science of the University of Nevada,Reno
文摘The span of coordinate time series affects the determination of an optimal noise model. We analyzed position data recorded for 10 continuous Global Positioning System (GPS) sites from 1998.0 to mid-2009 on the Australian Plate to estimate the best noise model and thereafter obtain the true uncertainties of the velocity, employing the maximum likelihood estimation (MLE) method. MLE was employed to analyze the data in four ways. In the first two analyses, the noise was assumed to be a combination of flicker noise and white noise for the raw time series and spatially filtered time series. In the final two analyses, the spectral indices and amplitudes were simultaneously estimated for a power law noise plus white noise model for the raw time series and spatially filtered time series. We conclude that the noise model of GPS time series in Australia can be best described as the combination of flicker noise and white noise. Velocity uncertainties fall below -0.2 mm/yr when the time span exceeds -9.5 years. A comparison of noise amplitudes and maximum likelihood estimation values between the raw and spatially filtered time series suggests that traditional spatial filtering to remove common-mode errors might not be applicable to the raw time series of this region.