基于分布不完整数据选择性分类器被引量：1

Distribution-Based Selective Classifiers for Incomplete Data

下载PDF

导出

摘要通过删除数据集中的无关属性和冗余属性构建的选择性分类器可以有效地提高分类精度和效率.由于处理不完整数据的复杂性,已有的选择性分类器大都是针对完整数据的.然而,现实中的数据通常是不完整的并且包含许多冗余属性或无关属性.为解决这一问题,在构建的不完整数据分类器DBNB的基础上给出了一种有效的选择性分类器:SDBNB.在12个标准的不完整数据集上的实验结果显示,SDBNB的分类准确率比分类效果较好的选择性不完整数据分类器SNB和SRBC平均高出0.69%和0.58%,而其标准离差比SNB和SRBC平均低0.11和0.05.这表明SDBNB不仅有较高的分类准确率,而且分类效果更稳定. Selective classifiers are a kind of algorithms that can effectively improve the accuracy and efficiency of classification by deleting irrelevant or redundant attributes of a data set. Due to the complexity of processing incomplete data, however, most of them deal with complete data. Yet actual data are often incomplete and have many redundant or irrelevant attributes, a selective classifier for incomplete data （SDBNB）, which is based on a newly constructed Bayes classifier （DBNB）, is presented. Experiments results from twelve benchmark incomplete data sets show that the average accuracy of SDBNB is 0.69 percent and 0.58 percent higher than that of the effective selective classifiers： SNB and SRBC. Furthermore, its standard deviation is 0.11 and 0.05 lower than that of SNB and SRBC. This shows that not only SDBNB has higher accuracy, but also performs more stably as well.

作者陈景年黄厚宽杨莉萍田凤占

机构地区北京交通大学计算机与信息技术学院

出处《北京交通大学学报》 EI CAS CSCD 北大核心 2008年第2期26-29,共4页 JOURNAL OF BEIJING JIAOTONG UNIVERSITY

基金国家自然科学基金资助项目(6050301760673089)

关键词数据分类特征选择贝叶斯方法不完整数据 data classification feature selection Bayesian method incomplete data

分类号 TP181 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献10

1Langley P, Sage S. Induction of Selective Bayesian Classitiers[C] .Proc. of the 10th Conference on Uncertainty in Artificial Intelligence. Morgan Kaufmann, 1994:399 - 406.
2Singh M, Provan G M. Efficient Learning of Selective Bayesian Network Classifiers[C].Proc. of the 13th International Conference on Machine Learning. Morgan Kaufman, 1996.
3Quinlan J R. C4.5:Programs for Machine Learning[ M]. San Francisco: Morgan Kaufmann, 1993.
4Friedman N, Geiger D, Goldszmidt M. Bayesian Network Classifters[J]. Machine Learning, 1997,29(2 - 3) : 131 - 163.
5Little R J A, Rubin D B. Statistical Analysis with Missing Data[ M]. New York:Wiley, 1987.
6Ramoni M, Sebastiani P. Robust Bayes Classifiers[J]. Artificial Intelligence, 2001, 125(1 - 2) :209 - 226.
7Winston P H. Artificial Intelligence[ M]. M A: Addison-Wesley, 1992.
8陈景年,黄厚宽,田凤占,付树军.用于不完整数据的选择性贝叶斯分类器[J].计算机研究与发展,2007,44(8):1324-1330. 被引量：11
9Blake C, Keogh, E Merz C J. UCI Repository of Machine Learning Databases [ EB/OL ] (1998) [ 2007 ]. Department of Information and Computer Sciences, University of California, Irvine, http: // www. ics. uci. edu/-mlearn/MLRepository. html.
10Witten I H, Frank E. Data Mining: Practical Machine Learning Tools and Techniques[ M]. 2ed. San Francisco: Morgan Kaufmann, 2005.

二级参考文献17

1尚文倩,黄厚宽,刘玉玲,林永民,瞿有利,董红斌.文本分类中基于基尼指数的特征选择算法研究[J].计算机研究与发展,2006,43(10):1688-1694. 被引量：38
2J R Quinlan.C4.5:Programs for Machine Learning[M].San Francisco:Morgan Kaufmann,1993.
3R Kohavi,B Becker,D Sommerfield.Improving simple Bayes[C].In:M van Someren,G Widmer,eds.Poster Papers of the ECML-97.Prague:Charles University,1997.78-87.
4N Friedman,D Geiger,M Goldszmidt.Bayesian network classifiers[J].Machine Learning,1997,29(2-3):131-163.
5S L Lauritzen.The EM algorithm for graphical association models with missing data[J].Computational Statistics and Data Analysis,1995,19(2):191-201.
6S Russell,J Binder,D Koller,et al.Local learning in probabilistic networks with hidden variables[C].In:Proc of IJCAI-95.San Francisco:Morgan Kaufmann,1995.1146-1151.
7S Geman,D Geman.Stochastic relaxation,Gibbs distributions and the Bayesian restoration of images[J].IEEE Trans on Pattern Analysis and Machine Intelligence,1984,6(6):721-741.
8R J A Little,D B Rubin.Statistical Analysis with Missing Data[M].New York:Wiley,1987.
9D J Spiegelhalter,R G Cowell.Learning in probabilistic expert systems[C].In:J Bernardo,J Berger,A P Dawid,eds.Bayesian Statistics 4.Oxford:Oxford University Press,1992.447-466.
10M Ramoni,P Sebastiani.Robust Bayes classifiers[J].Artificial Intelligence,2001,125(1-2):209-226.

共引文献10

1赵文清.基于选择性贝叶斯分类器的变压器故障诊断[J].电工文摘,2011(5):34-37. 被引量：1
2蔡月红,朱倩,孙萍,程显毅.基于属性选择的半监督短文本分类算法[J].计算机应用,2010,30(4):1015-1018. 被引量：8
3赵文清.基于选择性贝叶斯分类器的变压器故障诊断[J].电力自动化设备,2011,31(2):44-47. 被引量：21
4陶永才,薛正元,石磊.基于MapReduce的贝叶斯垃圾邮件过滤机制[J].计算机应用,2011,31(9):2412-2416. 被引量：14
5许明英,尉永清,赵静.一种结合反馈信息的贝叶斯分类增量学习方法[J].计算机应用,2011,31(9):2530-2533. 被引量：5
6张亚萍,胡学钢,方振国,姜恩华.数据缺失条件下的贝叶斯优化算法[J].计算机工程与应用,2012,48(11):111-114. 被引量：3
7冷泳林,张清辰,鲁富宇.不完整数据的聚类研究[J].河南科学,2014,32(11):2259-2262.
8李凌霞,李冰冰,王建.物联网环境下智能应用的信息支持和决策技术研究[J].物联网技术,2017,7(10):70-73. 被引量：3
9程炜东,王洪亚,郭开彦.面向脏数据的贝叶斯统计建模研究[J].智能计算机与应用,2019,9(2):104-107. 被引量：1
10刘永裕,巩晓婷,方炜杰,傅仰耿.数据缺失的扩展置信规则库推理方法[J].计算机研究与发展,2022,59(3):661-673. 被引量：1

同被引文献10

1Langley P, Sage S. Induction of Selective Bayesian Classitiers[ C]// Proc. of the 10th Conference on Uncertainty in Artificial Intelligence. Morgan Kaufmann, 1994:399 - 406.
2Singh M, Provan G M. Efficient Learning of Selective Bayesian Network Classifiers[C]// Proe. of the 13th International Conference on Machine Learning. Morgan Kaufman, 1996:453 - 461.
3Quinlan J R. C4.5: Programs for Machine Learning[ M]. San Francisco, CA: Morgan Kaufmann, 1993.
4Kohavi R, Becker B, Sommerfield D. Improving Simple Bayes[C]// M. Van Someren, G. Widmer. Poster Papers of the ECML-97. Charles University, Prague, 1997: 78 - 87.
5Friedman N, Geiger D, Goldszmid M T. Bayesian Network Classifiers[J]. Machine Learning, 1997, 29(2/3): 131 - 163.
6Little R J A, Rubin D B. Statistical Analysis with Missing Data[ M]. New York:Wiley, 1987.
7Spiegelhalter D J, Cowell R G. Learning in Probabilistic Expert Systems[C]//Bernardo J, Berger J, Dawid A P, Smith A F M, Bayesian Statistics 4. Oxford University Press, Oxford, UK, 1992:447-466.
8Ramoni M, Sebastiani P. Robust Bayes Classifiers[J]. Artificial Intelligence, 2001, 125 (1/2) : 209 - 226.
9Witten I H, Frank E. Data Mining: Practical Machine Learning Tools and Techniques (Second Edition) [ M ]. Morgan Kaufmann, 2005.
10Blake C, Keogh, E Merz C J. UCI Repository of Machine Learning Databases [ OB/OL ]. (1998). [ 2008 ]. Department of Information andComputer Sciences, University of California, Irvine, http://www. ics. uci. edu/ mlearn/M LRepository. html.

引证文献1

1陈景年,黄厚宽,徐力,伊传环.利用增益率构建混合型选择性不完整数据分类器[J].北京交通大学学报,2009,33(5):117-120. 被引量：2

二级引证文献2

1吕靖,舒礼莲.基于AdaBoost的不完整数据的信息熵分类算法[J].计算机与现代化,2013(9):31-34. 被引量：3
2赵姝,吕靖,张燕平,张以文.不完整数据集的信息熵集成分类算法[J].模式识别与人工智能,2014,27(3):193-198. 被引量：6

1陈景年,黄厚宽,田凤占,付树军.用于不完整数据的选择性贝叶斯分类器[J].计算机研究与发展,2007,44(8):1324-1330. 被引量：11
2陈景年,黄厚宽,田凤占,薛小平.一种基于特征选择的不完整数据分类方法[J].计算机工程与应用,2007,43(31):23-24. 被引量：2
3向卓元,张蕾.粗糙集理论对C4．5算法的优化研究[J].电脑知识与技术,2012,8(6):3782-3785. 被引量：1
4牛晓博,赵虎,陈新来.基于信息熵和概率神经网络的海战场目标识别[J].电光与控制,2010,17(4):83-86. 被引量：3
5路松峰,胡波.基于核属性依赖的属性约简算法研究[J].计算机仿真,2007,24(4):69-71. 被引量：2
6苏鹏,李玉忱,刘慧.一种新的加权k-最临近分类方法[J].计算机工程与应用,2003,39(35):183-185. 被引量：3
7张静,王建民,何华灿.基于属性相关性的属性约简新方法[J].计算机工程与应用,2005,41(28):55-57. 被引量：18
8赵文清.基于选择性贝叶斯分类器的变压器故障诊断[J].电力自动化设备,2011,31(2):44-47. 被引量：21
9闫玲博,魏维,王艳.一种改进的基于标准离差的相关反馈图像检索方法[J].微计算机应用,2008,29(12):98-102.
10陈莉,焦李成.基于关系代数的关联规则挖掘算法[J].西北大学学报（自然科学版）,2005,35(6):691-694. 被引量：16

北京交通大学学报

2008年第2期

浏览历史

内容加载中请稍等...

基于分布不完整数据选择性分类器被引量：1

参考文献10

二级参考文献17

共引文献10

同被引文献10

引证文献1

二级引证文献2

相关作者

相关机构

相关主题

浏览历史

基于分布不完整数据选择性分类器 被引量：1

参考文献10

二级参考文献17

共引文献10

同被引文献10

引证文献1

二级引证文献2

相关作者

相关机构

相关主题

浏览历史

基于分布不完整数据选择性分类器被引量：1