期刊文献+

基于不完全数据的TAN学习算法 被引量:1

Learning TAN from incomplete data
下载PDF
导出
摘要 TAN算法是一种针对复杂数据且在实际中具有极强的学习能力的有效算法,它已被广泛应用于数据挖掘、机器学习和模式识别领域。由于现实世界中的数据大多是不完全数据,研究了怎样使TAN有效地从不完全数据中学习。首先,用一种有效的方法直接从不完全数据中估计条件互信息,然后应用估计条件互信息法去扩展基本的TAN算法来处理不相关数据,最后实验比较了扩展的TAN算法和基本的TAN算法。实验结果表明,在大多数不完全数据集合上扩展的TAN算法精确性明显高于基本的TAN算法。虽然扩展的TAN算法时间复杂度高于基本的TAN算法,但在可接受范围之内。此估计条件互信息的方法能够容易地和其它技术相结合来进一步提高TAN算法的性能。 TAN is a good trade-off between the model complexity and learn ability in practice,which has been widely used in data mining,machine learning and pattern recognition etc.Sinee there are few complete datasets in real-world,the paper develops research on how to efficiently learn TAN from incomplete data.Firstly an efficient method that could estimate conditional Mutual Information directly from incomplete data is presented.And then the basic TAN learning algorithm is extended to incomplete data using the conditional Mutual Information estimation method.Finally,experiments are carried out to evaluate the extended TAN and compare it with basic TAN.The experimental results show that the accuracy of the extended TAN is much higher than that of basic TAN on most of the incomplete datasets.Despite more time consumption of the extended TAN compared with basic TAN,it is still acceptable.The conditional Mutual Information estimation method can be easily combined with other techniques to improve TAN further.
出处 《计算机工程与应用》 CSCD 北大核心 2007年第36期181-184,共4页 Computer Engineering and Applications
关键词 TAN 学习 不完全数据 条件互信息 TAN learning incomplete data conditional mutual information
  • 相关文献

参考文献12

  • 1Pazzani M J.Searching for dependencies in Bayesian classifiers[M]// Learning from data:artificial intelligence and statistics V.New York : Springer-Verlag, 1996, 239-248.
  • 2Domingos P,Pazzani M.On the optimality of the simple Bayesian classifier under zero-one loss [J].Machine Learning, 1997,29(2/ 3 ) : 103-130.
  • 3Friedman D G N,Goldszmidt M.Bayesian network classifiers[J].Machine Learning, 1997,29:131-161.
  • 4Keogh E J,Pazzani M J.Learning augmented Bayesian classifiers:a comparison of distribution-based and classification-based approaches[C]//Proceedings of the 7th International Workshop on AI and Statistics, 1999 , 225-230.
  • 5Shi G W H,Wang Z,Huang H.A new restricted Bayesian network classifier[C]//Proceedings of the 7th Pacific-Asia Conference on Knowledge Discovery and Data Mining,2003,501-512.
  • 6Wang G I W Z,Zheng F.Selective augmented Bayesian network classifiers based on rough set theory[C]//Proceedings of the 8th Pacific-Asia Conference on Knowledge Discovery and Data Mining, 2004,161-165.
  • 7Quinlan J R.C4.5: Programs for machine learning[M].San Francisco, CA : Morgan Kaufmann, 1993.
  • 8Kohavi B B R,Sommerfield D.Improving simple bayes[C]//van Someren M,Widmer G.Proceedings of ECML-97,1997,78-87.
  • 9Ramoni M,Sebastiani P.Robust bayes classifiers[J].Artificial Intelligence, 2001,125 : 209-226.
  • 10Singh M.Learning Bayesian networks from incomplete data[C]//the 14th National Conf on Artificial Intelligence, 1997.

同被引文献8

引证文献1

二级引证文献11

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部