期刊文献+

基于多值分解和多类标学习的分类框架设计

Framework of Classification Based on Multi-Value Decomposition and Multi-Label Learning
下载PDF
导出
摘要 多值多类标的数据分类是研究一个样本不但同时属于多个类别,而且在某些属性下也可能存在多个取值的问题。提出了一种结合多值分解和多类标学习的多值多类标分类框架(MDML),采用4种不同的多值分解策略,将问题转化为多类标问题,然后利用3种经典的多类标算法进行学习。实验结果表明,MDML与已有的多值多类标决策树算法相比,有效地提高了分类的性能,而且不同的组合方法适用于不同特点的数据集。 Classification of multi-valued and multi-labeled data is about a sample which is not only associated with a set of labels,but also with several values that include some attributes.This paper proposes a multi-valued and multi-labeled learning framework that combines multi-value decomposition with multi-label learning(MDML),using four strategies to deal with multi-valued attributes and three classical,multi-label algorithms to learn.Experimental results demonstrate that MDML significantly outperforms the decision tree based method.Meanwhile,combined methods can be applied to various types of datasets.
出处 《计算机系统应用》 2010年第10期187-190,22,共5页 Computer Systems & Applications
关键词 分类 多值属性分解 多类标数据 数据转化 classification multi-label data multi-valued attribute decomposition data transformation
  • 相关文献

参考文献7

  • 1赵蕊,李宏.一种多值属性和多类标数据的决策树算法[J].计算机工程,2007,33(13):87-89. 被引量:1
  • 2Chou S, Hsu C. MMDT: a multi-valued and multi-labeled decision tree classifier for data mining. Expert Systems with Applications, 2005,28(2):799- 812.
  • 3Tsoumakas G, Katakis I. Multi-Label Classification: An Overview. International Journal of Data Warehousing and Mining, 2007,3(3): 1 - 13.
  • 4Clare A, King RD. Knowledge Discovery in Multilabel Phenotype Data. Lecture Notes in Computer Science Vol. 2168, Springer, Berlin 2001.
  • 5Eisseeff A, Weston J. A kernel method for multilabelled classification. Dietterich TG, Becker S, Ghahramani Z, Editors, Advances in Neural Information Processing Systems 14, MIT Press, Cambridge, MA, 2002:681 - 687.
  • 6Zhang ML, Zhou ZH. ML-KNN: A lazy learning approach to multi-label learning. Pattern recognition, 2007,40(7):2038 - 2048.
  • 7Witten IH, Frank E. Data Mining: Practical machine learning tools and techniques. Morgan Kaufinann, San Francisco, 2005.

二级参考文献9

  • 1Han J W Kamber M 范明 孟小峰译.数据挖掘概念与技术[M].北京:机械工业出版杜,2001.147-158.
  • 2Witten I H,Frank E.Data Mining:Practical Machine Learning Tools and Techniques with Java Implementations[M].Morgan Kaufman,2003.
  • 3Agrawal R,Ghosh S,Imielinski T,et al.An Interval Classifier for Database Mining Applications[C]//Proceedings of the 18th International Conference on Very Large Databases.2005:560-573.
  • 4Ruggieri S.Efficient C4.5[J].IEEE Transactions on Knowledge and Data Engineering,2002,14(2):438-444.
  • 5Wang H,Zaniolo C.CMP:A Fast Decision Tree Classifier Using Multivariate Predictions[C]//Proceedings of the 16th International Conference on Data Engineering.2002:449-460.
  • 6Chen Y.Hsu C.Constructing a Multi-valued and Multi-labeled Decision Tree[J].Expert Systems with Applications,2003,25(2):199-209.
  • 7Shafer J C,Agrawal R,Mehta M.SPRINT:A Scalable Parallel Classifier for Data Mining[C]//Proceedings of the 22th International Conference on Very Large Databases.1996.
  • 8Mantaras R L D.A Distance-based Attribute Selection Measure for Decision Tree induction[J].Machine Learning,1991,6(1):81-92.
  • 9Chou S,Hsu C.MMDT:a Multi-valued and Multi-labeled Decision Tree Classifier for Data Mining[J].Expert Systems with Applications,2005,28(2):799-812.

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部