Hybrid Bayesian estimation tree learning with discrete and fuzzy labels 被引量：2

Hybrid Bayesian estimation tree learning with discrete and fuzzy labels

导出

摘要 Classical decision tree model is one of the classical machine learning models for its simplicity and effectiveness in applications. However, compared to the DT model, probability estimation trees （PETs） give a better estimation on class probability. In order to get a good probability estimation, we usually need large trees which are not desirable with respect to model transparency. Linguistic decision tree （LDT） is a PET model based on label semantics. Fuzzy labels are used for building the tree and each branch is associated with a probability distribution over classes. If there is no overlap between neighboring fuzzy labels, these fuzzy labels then become discrete labels and a LDT with discrete labels becomes a special case of the PET model. In this paper, two hybrid models by combining the naive Bayes classifier and PETs are proposed in order to build a model with good performance without losing too much transparency. The first model uses naive Bayes estimation given a PET, and the second model uses a set of small-sized PETs as estimators by assuming the independence between these trees. Empirical studies on discrete and fuzzy labels show that the first model outperforms the PET model at shallow depth, and the second model is equivalent to the naive Bayes and PET. Classical decision tree model is one of the classical machine learning models for its simplicity and effectiveness in applications. However, compared to the DT model, probability estimation trees （PETs） give a better estimation on class probability. In order to get a good probability estimation, we usually need large trees which are not desirable with respect to model transparency. Linguistic decision tree （LDT） is a PET model based on label semantics. Fuzzy labels are used for building the tree and each branch is associated with a probability distribution over classes. If there is no overlap between neighboring fuzzy labels, these fuzzy labels then become discrete labels and a LDT with discrete labels becomes a special case of the PET model. In this paper, two hybrid models by combining the naive Bayes classifier and PETs are proposed in order to build a model with good performance without losing too much transparency. The first model uses naive Bayes estimation given a PET, and the second model uses a set of small-sized PETs as estimators by assuming the independence between these trees. Empirical studies on discrete and fuzzy labels show that the first model outperforms the PET model at shallow depth, and the second model is equivalent to the naive Bayes and PET.

作者 Zengchang QIN Tao WAN

机构地区 Intelligent Computing and Machine Learning Lab School of Biological Science and Medical Engineering Department of Biomedical Engineering

出处《Frontiers of Computer Science》 SCIE EI CSCD 2013年第6期852-863,共12页 中国计算机科学前沿（英文版）

关键词 fuzzy labels label semantics random set probability estimation tree mass assignment linguistic decision tree naive Bayes fuzzy labels, label semantics, random set, probability estimation tree, mass assignment, linguistic decision tree, naive Bayes

分类号 TP18 [自动化与计算机技术—控制理论与控制工程] O212.1 [理学—概率论与数理统计]

引文网络
相关文献

参考文献29

1Quinlan J R. Induction of decision trees[J].{H}Machine Learning,1986,(01):81-106.
2Olaru C,Wehenkel L. A complete fuzzy decision tree technique[J].{H}Fuzzy Sets and Systems,2003,(02):221-254.doi:10.1016/S0165-0114(03)00089-7.
3Quinlan J R. C4.5:programs for machine learning[M].{H}Morgan Kaufmann Publishers Inc,1993.
4Baldwin J,Lawry J,Martin T. Mass assignment fuzzy ID3 with applications[A].1997.278-294.
5Janikow C Z. Fuzzy decision trees:issues and methods[J].IEEE Transactions on Systems Man and Cybernetics Part B:Cybernetics,1998,(01):1-14.
6Huang Z,Gedeon T D,Nikravesh M. Pattern trees induction:a new machine learning method[J].{H}IEEE Transactions on Fuzzy Systems,2008,(04):958-970.
7Qin B,Xia Y,Li F. Dtu:a decision tree for uncertain data[J].Advances in Knowledge Discovery and Data Mining,2009.4-15.
8Provost F,Domingos P. Tree induction for probability-based ranking[J].{H}Machine Learning,2003,(03):199-215.doi:10.1023/A:1024099825458.
9Qin Z,Lawry J. Decision tree learning with fuzzy labels[J].{H}Information Sciences,2005,(01):91-129.
10Qin Z,Lawry J. Prediction trees using linguistic modelling[A].2005.

同被引文献6

1Liangxiao Jiang (1) ljiang@cug.edu.cn.Learning random forests for ranking[J].Frontiers of Computer Science,2011,5(1):79-86. 被引量：2
2Xiaochen LI,Wenji MAO,Daniel ZENG.Forecasting complex group behavior via multiple plan recognition[J].Frontiers of Computer Science,2012,6(1):102-110. 被引量：2
3Kuo-Wei HSU,Jaideep SRIVASTAVA.Improving bagging performance through multi-algorithm ensembles[J].Frontiers of Computer Science,2012,6(5):498-512. 被引量：2
4Anca Maria IVANESCU Marc WICHTERICH Christian BEECKS Thomas SEIDL.The ClasSi coefficient for the evaluation of ranking quality in the presence of class similarities[J].Frontiers of Computer Science,2012,6(5):568-580. 被引量：1
5Cheqing JIN,Jingwei ZHANG,Aoying ZHOU.Continuous ranking on uncertain streams[J].Frontiers of Computer Science,2012,6(6):686-699. 被引量：3
6Lianyin JIA,Jianqing XI,Mengjuan LI,Yong LIU,Decheng MIAO.ETI： an efficient index for set similarity queries[J].Frontiers of Computer Science,2012,6(6):700-712. 被引量：2

引证文献2

1Hebah ELGIBREEN,Mehmet Sabih AKSOY.RULES-IT： incremental transfer learning with RULES family[J].Frontiers of Computer Science,2014,8(4):537-562.
2Pushpinder SINGH.Some new distance measures for type-2 fuzzy sets and distance measure based ranking for group decision making problems[J].Frontiers of Computer Science,2014,8(5):741-752. 被引量：2

二级引证文献2

1Sukhveer SINGH,Harish GARG.Comments on ＂Some new distance measures for type-2 fuzzy sets and distance measure based ranking for group decision making problems＂[J].Frontiers of Computer Science,2018,12(2):396-400. 被引量：1
2GENG Juan-juan,YE Wan-hong,ZHANG Ju,Xu Dong-sheng.Entropy and Similarity Measure for T2SVNSs and Its Application[J].Chinese Quarterly Journal of Mathematics,2021,36(2):160-175.

1Shunyi Zhao,Fei Liu.Bayesian estimation for nonlinear stochastic hybrid systems with state dependent transitions[J].Journal of Systems Engineering and Electronics,2012,23(2):242-249.
2Myungjin Cho.Three-dimensional color photon counting microscopy using Bayesian estimation with adaptive priori information[J].Chinese Optics Letters,2015,13(7):36-39.
3代磊,马卫东,王凌楠,马建国.基于权重的朴素贝叶斯分类器设计与实现[J].情报理论与实践,2008,31(3):440-442. 被引量：9
4李楚进,付泽正.对朴素贝叶斯分类器的改进[J].统计与决策,2016,32(21):9-11. 被引量：11
5边平勇,石永奎,张序萍.基于贝叶斯分类器的煤与瓦斯突出强度预测研究[J].佳木斯大学学报（自然科学版）,2013,31(6):890-894. 被引量：7
6卢锦玲,李洪伟,刘海军.基于集成贝叶斯分类器的暂态稳定评估方法研究[J].华北电力大学学报（自然科学版）,2010,37(3):14-20. 被引量：2
7华锐,梁娜.特征加权朴素贝叶斯分类器在小样本中的应用[J].统计与决策,2012,28(23):69-71. 被引量：4
8冯成进.THE PROOF OF BODENDIEK'S CON-JECTURE ON GRACEFUL GRAPHS[J].Chinese Science Bulletin,1983,28(9):1152-1155.
9Joo Manuel Martins Casaca,Pedro Jorge Bele Mateus,Joeo de Jesus Isidoro Coelho.Bayesian Estimation in Dam Monitoring Networks[J].Journal of Civil Engineering and Architecture,2011,5(2):185-190. 被引量：1
10包丽莉.基于数据挖掘的旅客运输量分析[J].天水师范学院学报,2016,36(5):1-4. 被引量：1

Frontiers of Computer Science

2013年第6期

浏览历史

内容加载中请稍等...

Hybrid Bayesian estimation tree learning with discrete and fuzzy labels 被引量：2

参考文献29

同被引文献6

引证文献2

二级引证文献2

相关作者

相关机构

相关主题

浏览历史