期刊文献+

基于EM和贝叶斯网络的丢失数据填充算法 被引量:21

Imputation algorithm of missing values based on EM and Bayesian network
下载PDF
导出
摘要 实际应用中存在大量的丢失数据的数据集,对丢失数据的处理已成为目前分类领域的研究热点。分析和比较了几种通用的丢失数据填充算法,并提出一种新的基于EM和贝叶斯网络的丢失数据填充算法。算法利用朴素贝叶斯估计出EM算法初值,然后将EM和贝叶斯网络结合进行迭代确定最终更新器,同时得到填充后的完整数据集。实验结果表明,与经典填充算法相比,新算法具有更高的分类准确率,且节省了大量开销。 Dataset with missing values is quite common in real applications,and handling missing values has become a research hot issue in the classification field.This paper analyzes and compares several popular missing values imputation algorithms,and has proposed a novel imputation algorithm for missing values based on EM(Expectation Maximization) and Bayesian network.In this algorithm,the Nave Bayesian is employed to estimate the initial values of EM algorithm,and the EM inspired approach for filling up missing values is incorporated to Bayesian network learning with the objective of ensuring the ultimate updater.As a result,the complete dataset is got after imputation.Experiment results demonstrate that the proposed algorithm enables much higher classification accuracy and lower cost when compared with other classical imputation algorithm.
出处 《计算机工程与应用》 CSCD 北大核心 2010年第5期123-125,共3页 Computer Engineering and Applications
基金 国家杰出青年基金No.60425310~~
关键词 丢失数据填充 参数更新器 最大期望值算法(EM) 贝叶斯网络 missing values imputation parameter updater Expectation-Maximization(EM) Bayesian network
  • 相关文献

参考文献8

  • 1Lakshminarayan K,Harp S A,Samad T.Imputation of missing data in industrial databases[J].Applied Intelligence,1999,11:259-275.
  • 2Li K H.Imputation using Markov chains[J].Journal of Statisticalt Comput Simul,1988,30:57-79.
  • 3Little R J,Rubin D B.Statistical analysis with missing data[M].[S.l] :John Wiley and Sons,1987.
  • 4Gustavo E A,Batista P A,Monard M C.An analysis of four missing data treatment methods for supervised learning[J].Applied Artificial Intelligence,2003,17(5/6):519-533.
  • 5Huang C,Lee H.A grey-based nearegt neighbor approach for missing attribute value prediction[J].Applied Artificial Intelligence,2004,20(3):239-252.
  • 6Hruschka E R,Jr,Ebecken N F F.Missing values prediction with K2[J].Intelligent Data Allalysis,2002,6(6):557-566.
  • 7Petersen B,Winther O,Hansen L K.On the slow convergence of EM and VBEM in low-noise linear models[J].Neural Computation,2005,17(9):1921-1926.
  • 8Stanimirova I,Duszykowski M,Walczak B.Dealing with missing values and outliers in principle component analysis[J].Tanlonta,2007,72:172-178.

同被引文献127

引证文献21

二级引证文献77

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部