摘要
数据挖掘经常会碰到缺失数据。本文结合回归和决策树的优点提出一种混合算法能较好地处理文本和连续性数据,同时考虑到当文本型数据类别很大时,决策树处理方法效果不佳,提出了一种C4.5的改进算法。
In this paper,by combining advantages of regression and decision tree,we provide a hybrid algorithm which can process data of text type and continuous data. Also an improved algorithm based on C4.5 is offered.
出处
《数字技术与应用》
2010年第9期24-24,26,共2页
Digital Technology & Application