期刊文献+

Parallel Expectation-Maximization Algorithm for Large Databases

Parallel Expectation-Maximization Algorithm for Large Databases
下载PDF
导出
摘要 A new parallel expectation-maximization (EM) algorithm is proposed for large databases. The purpose of the algorithm is to accelerate the operation of the EM algorithm. As a well-known algorithm for estimation in generic statistical problems, the EM algorithm has been widely used in many domains. But it often requires significant computational resources. So it is needed to develop more elaborate methods to adapt the databases to a large number of records or large dimensionality. The parallel EM algorithm is based on partial Esteps which has the standard convergence guarantee of EM. The algorithm utilizes fully the advantage of parallel computation. It was confirmed that the algorithm obtains about 2.6 speedups in contrast with the standard EM algorithm through its application to large databases. The running time will decrease near linearly when the number of processors increasing. A new parallel expectation-maximization (EM) algorithm is proposed for large databases. The purpose of the algorithm is to accelerate the operation of the EM algorithm. As a well-known algorithm for estimation in generic statistical problems, the EM algorithm has been widely used in many domains. But it often requires significant computational resources. So it is needed to develop more elaborate methods to adapt the databases to a large number of records or large dimensionality. The parallel EM algorithm is based on partial Esteps which has the standard convergence guarantee of EM. The algorithm utilizes fully the advantage of parallel computation. It was confirmed that the algorithm obtains about 2.6 speedups in contrast with the standard EM algorithm through its application to large databases. The running time will decrease near linearly when the number of processors increasing.
出处 《Journal of Beijing Institute of Technology》 EI CAS 2006年第4期420-424,共5页 北京理工大学学报(英文版)
基金 the National Natural Science Foundation of China(79990584)
关键词 expectation-maximization (EM) algorithm incremental EM lazy EM parallel EM expectation-maximization (EM) algorithm incremental EM lazy EM parallel EM
  • 相关文献

参考文献9

  • 1Bo Thiesson,Christopher Meek,David Heckerman.Accelerating EM for Large Databases[J].Machine Learning.2001(3)
  • 2Moor A W.Very fast EM-based mixture model cluster-ing using multi-resolution kd-trees[].Proceedings of the NIPSConference.1999
  • 3Kruengkrai C,Jaruskulchai C.A parallel learning algo-rithmfor text classification[].KDD-The Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.2002
  • 4Larranaga P,Poza M,Yurramendi Y,et al.Structure learning of Bayesian networks by genetic algorithms:A performance analysis of control parameters[].IEEE Transactions on Pattern Analysis and Machine Intelligence.1996
  • 5Neal R,Hinton G.Aviewof the EMalgorithmthat jus-tifies incremental,sparse,and other variants[].Learning in Graphical Models.1999
  • 6Dempster A P,Laird N M,Rubin D B.Maxi mum-like-lihoodfromincomplete data via the EM algorithm[].Journal of the Royal Statistical SocietySeries B(Methodological).1977
  • 7Bil mes J A.Agentle tutorial of the EMalgorithmandits application to parameter esti mation for Gaussian mixture and hidden Markov modals. ICSI-TR-97-021 . 1997
  • 8Kanungo T,Mount D M,Netanyahu N,et al.An effi-cient k-means clustering algorithm:Analysis and i mple-mentation[].IEEE Transactions on Pattern Analysis and Machine Intelligence.2002
  • 9Verbeek J J,Vlassis N,Nunnink J R J.A variational EMalgorithmfor large-scale mixture modeling[].Procth Annual Conf of the Advanced School for Computing and I maging.2003

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部