期刊文献+

使用PCA建立基于规则的组合分类器

Constructing Ensembles of Rule-based Classifiers Using PCA
下载PDF
导出
摘要 提出了一种使用基于规则的基分类器建立组合分类器的新方法PCARules。尽管新方法也采用基分类器预测的加权投票来决定待分类样本的类,但是为基分类器创建训练数据集的方法与bagging和boosting完全不同。该方法不是通过抽样为基分类器创建数据集,而是随机地将特征划分成K个子集,使用PCA得到每个子集的主成分,形成新的特征空间,并将所有训练数据映射到新的特征空间作为基分类器的训练集。在UCI机器学习库的30个随机选取的数据集上的实验表明:算法不仅能够显著提高基于规则的分类方法的分类性能,而且与bagging和boosting等传统组合方法相比,在大部分数据集上都具有更高的分类准确率。 A new method,called PCARules,is presented for constructing ensembles of rule-based classifiers. Although the class label of a sample to be classified is also determined by taking weighted vote among the predictions made by each base classifier,this method is very different from bagging and boosting in the way of creating the training data for a base classifier. Instead of creating a training data for each base classifier by sampling,this method splits the feature set into K subsets randomly,upon each of which principal component analysis (PCA) is applied to find the corresponding principal components. And then all principal components are put together to form a new feature space,into which all original training data are mapped to create the training set for a base classifier. Experiments carried on 30 benchmark datasets selected randomly from the UCI machine learning repository show that the method not only improves performance of rule-based classifiers significantly,but also achieves higher accuracy in most of data sets than traditional combining methods such as bagging and boosting.
出处 《计算机科学与探索》 CSCD 2010年第5期455-463,共9页 Journal of Frontiers of Computer Science and Technology
基金 国家自然科学基金No.60773048 国家"十一五"计划科技支撑课题子课题No.2006BAF01A00~~
关键词 组合分类器 特征提取 主成分分析 classifier ensemble feature extraction principal component analysis (PCA)
  • 相关文献

参考文献16

  • 1Roli F,Klittler J.Multiple classifier systems[C]//Proc of the 8th International Workshop on Multiple Classifier Systems,2009.
  • 2Haindl M,Kittler J,Roli F.Multiple classifier systems[C]//Proc of the 7th International Workshop on Multiple Classi-fier Systems,2007.
  • 3Oza N C.Multiple classifier systems[C]//Proc of the 6th International Workshop on Multiple Classifier Systems,2005.
  • 4Roli F.Multiple classifier systems[C]//Proc of the 5th International Workshop on Multiple Classifier Systems,2004.
  • 5Kuncheva L I.Combining pattern classifiers:Methods and algorithms[M].[S.l.]:John Wiley and Sons,2004.
  • 6Han J,Kamber M.Data mining:Concepts and techniques[M].[S.I.]:Morgan Kaufinann,2001.
  • 7Breiman L.Bagging predictors[J].Machine Learning,1996,24(2):123-140.
  • 8Quinlan J R.Bagging,boosting,and C4.5[C]//Proceed-ings of the 13th National Conference on Artificial Intel-ligence.[S.I.]:AAAI/MIT Press,1996:725-730.
  • 9Freund Y,Schapire R E.A decision-theoretic generaliza-tion of on-line learning and an application to boosting[J].J Computer and System Sciences,1997,55(1):119-139.
  • 10Rodriguez J J,Kuncheva L I,Alonso C J.Rotation forest:A new classifier ensemble method[J].IEEE Trans Pattern Anal Mach Intell,2006,28:1619-1630.

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部