一种改进的CAIM算法被引量：1

Modified Algorithm of CAIM

下载PDF

导出

摘要在CAIM算法中,离散判别式仅考虑了区间中最多的类与属性间的依赖度,使离散化过度而导致结果不精确。基于此,提出对CAIM的改进算法,该算法考虑到按属性重要性从小到大顺序进行离散,同时根据粗糙集理论提出条件属性可分辨率概念,与近似精度同时控制信息表最终的离散程度,有效解决了离散化过度问题。实验通过C4.5和支持向量机分别对离散化后的数据进行识别和分类预测,结果证明了该算法的有效性。 In Class-Attribute Interdependency Maximization（CAIM） algorithm, discretization criterion only accounts for the trend of maximizing the number of values belonging to a leading class within each interval. The disadvantage makes CAIM generate irrational discrete results and further leads to the decrease of predictive accuracy of a classifier. This paper proposes a modified algorithm of CALM. With the algorithm, the importance of attributes is adopted in discretization process, and a concept of attribute discernibility rate is proposed based on rough set. Both attribute discernibility rate and approximate quality are used for discretization intervals, which effectively resolve the problem of over-discretization. By using C4.5 and SVM, experiments are performed respectively with the results of discreted data, which show that the presented algorithm is effective.

作者李慧闫德勤张迎春

机构地区辽宁师范大学计算机与信息技术学院

出处《计算机工程》 CAS CSCD 北大核心 2010年第4期77-78,81,共3页 Computer Engineering

基金国家自然科学基金资助项目(60372071) 中国科学院自动化研究所复杂系统与智能科学重点实验室开放课题基金资助项目(20070101) 辽宁省教育厅高等学校科学研究基金资助项目(2008344) 大连市科技局科技计划基金资助项目(2007A10GX117)

关键词连续属性离散化粗糙集属性可分辨率 discretization of continuous attributes rough set attribute discernibility rate

分类号 TP18 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献4

1Kurgan L A, Cios K J. CAIM Discretization Algorithm[J]. IEEE Transactions on Knowledge and Data Engineering, 2004, 16(2): 145-153.
2徐燕,怀进鹏,王兆其.基于区分能力大小的启发式约简算法及其应用[J].计算机学报,2003,26(1):97-103. 被引量：39
3Ching J Y, Wong A K C, Chan K C C. Class-dependent Discretization for Inductive Learning from Continuous and Mixed-mode Data[J]. IEEE Trans. on Pattern Analysis and Machine Intelligence, 1995, 17(7): 641-651.
4李国正,王猛.支持向量机导论[M].北京:电子工业出版社,2000.

二级参考文献10

1[5]Starzyk J, Nelson D E, Sturtz K. Reducts. A mathematical foundation for improved reduct generation in information systems. Journal of Knowledge and Information Systems, 2000, 2(2):131～146
2[6]Bazan J G, Skowron A, Synak P. Dynamic reducts as a tool for extracting laws from decisions tables. In: Ras Z W, Zemankiva M eds. Methodologies for Intelligent Systems. Berlin: Springer-Verlag,1994. 346～355
3[7]Ziarko W. Variable precision rough sets model. Journal of Computer and Systems Sciences, 1993, 46(1):39～59
4[8]Pawlak Z. Grzymala-Busse J, Slowinski R etal. Rough sets.Communications of the ACM, 1995, 38(11): 89～95
5[11]Ying Wu, Thomas S Huang. Hand moeling, analysis, and recognition. IEEE Signal Processing Magazine, 2001(5):51～60
6[12]Lin J, Wu Y, Huang T S. Modeling human hand constraint. In: Proceedings of Workshop on Human Motion. Austin, Texas USA,2000. 121～126
7[1]Pawlak Z. Rough sets. International Journal of Computer and Information Science, 1982, 11(5): 341～356
8[2]Wong S K M, Ziarko W. Optimal decision rules in decision table. Bulletin of Polish Academy of Sciences, 1985,33(11～12):693～696
9[3]Hu Xiao-Hua. Knowledge discovery in databases:an attrbute oriented rough set approach[Ph D dissertation]. University of Regina, Regina, Canada,1995
10[4]Starzyk J, Nelson D E, Sturtz K. Reducts in composed information systems. Bulletin of International Rough Set Society,1999,3(1～2):19～22

共引文献40

1徐燕,怀进鹏,苏林萍,王兆其.粗糙集理论在中国手语合成中的应用[J].复旦学报（自然科学版）,2004,43(5):874-876.
2刘斌,王莉.PCI术后cTnT升高与术后早期临床事件的关系[J].中国科技信息,2005(12):157-158.
3陈堂敏.基于知识量的粗集理论在风机故障预测中的应用[J].制造业自动化,2005,27(7):11-15.
4曹付元,梁吉业,钱宇华.基于信息熵的决策表约简[J].计算机应用,2005,25(11):2630-2631. 被引量：6
5武岩,崔广才.基于信息熵的属性约简算法的研究与实现[J].长春理工大学学报（自然科学版）,2005,28(3):48-51. 被引量：1
6陈堂敏.基于区分能力大小的启发式约简算法的研究[J].计算机学报,2006,29(3):480-487. 被引量：12
7陈堂敏.面向用户的知识量最佳属性约简算法在数控机床故障预测中的应用[J].机械科学与技术,2006,25(2):163-167. 被引量：3
8许长志,闵帆.带权约简及其在汉语词性标注自动校对中的应用[J].控制与决策,2007,22(7):740-744. 被引量：1
9袁楚明,胡广华,陈幼平,周祖德.基于RS和GA结合的属性约简在设备诊断维护中的应用[J].机械与电子,2007,25(10):43-45.
10冯琴荣,苗夺谦,程昳.基于知识划分粒度的信息系统约简算法[J].计算机工程与应用,2007,43(34):19-21.

同被引文献12

1谢宏,程浩忠,牛东晓.基于信息熵的粗糙集连续属性离散化算法[J].计算机学报,2005,28(9):1570-1574. 被引量：134
2谷小红,蔡晋辉,周泽魁.基于声发射传感器与ChiMerge粗糙集的埋地水管泄漏检测[J].传感技术学报,2006,19(6):2470-2473. 被引量：2
3Wu X D. Top 10 algorithms in data mining. Knowledge Information System, 2008, 14(1): 1-37.
4Su C T, Hsu J H. An extended Chi2 algorithm for discretization of re- al value attributes. IEEE Transactions on Knowledge and Data Engi- neering, 2005, 17(3) : 437-441.
5Dougherty J, Kohavi R, Sahami M. Supervised and unsupervised dis- cretization of continuous feature. Proceedings of 12th International Conference on Machine learning, 1995:194-202.
6Schmidberger G, Frank E. Unsupervised discretization using tree- based density estimation. The European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Data- bases (ECML PKDD) , 2005:240-251.
7Biba M, Esposito F, Ferilli S, et al. Unsupervised discretization u- sing kernel density estimation. The Twentieth International Joint Conference on Artificial Intelligence (IJCAI), 2007:696-701.
8Hettich S, Bay S D. The UCI KDDArchive. http://kdd, its. uci. edu/, 1999.
9Demsar J. Statistical comparisons of classifiers over multiple data- sets. Journal of Machine Learning Research, 2006 , 7 ( 1 ) : 1-30.
10Weka 3 Data mining software in Java. http ://www. cs. waikato, ac. nz/ml/weka, 2007.

引证文献1

1单桂军,胡伟.基于连续数据量化的声纳传感器数据识别方法[J].科学技术与工程,2013,21(22):6605-6609.

1CA Technologies发布聚合基础设施管理方案[J].中国信息化,2012(22):70-70.
2CA Technologies发布聚合基础设施管理解决方案[J].数字通信世界,2012(11):73-73.
3孙昌儿,刘秉瀚.基于粗糙集理论的病理诊断规则提取算法研究[J].福州大学学报（自然科学版）,2007,35(2):175-179. 被引量：3
4郭启铭,樊玮.基于Cramer's V的连续属性离散化算法[J].计算机工程,2008,34(4):111-112. 被引量：2
5鞠久朋,张伟伟,宁建军,周国栋.CRF与规则相结合的地理空间命名实体识别[J].计算机工程,2011,37(7):210-212. 被引量：31
6毛聪莉,易波.基于决策协调度的最简决策树生成算法[J].计算机工程与设计,2008,29(5):1250-1252. 被引量：7
7徐晶,刘旭敏,关永,董睿.基于条件误分类的决策树剪枝算法[J].计算机工程,2010,36(23):50-52. 被引量：4
8张松涛.离散T-S模糊系统的稳定条件[J].控制与决策,2012,27(8):1175-1179. 被引量：2
9沈炜.基于离线条件可信第三方的挂号邮件协议[J].计算机工程,2004,30(7):108-110.
10小企业成长论坛之40 员工常找借口加薪怎么办[J].光彩,2013(9):41-41.

计算机工程

2010年第4期

浏览历史

内容加载中请稍等...

一种改进的CAIM算法被引量：1

参考文献4

二级参考文献10

共引文献40

同被引文献12

引证文献1

相关作者

相关机构

相关主题

浏览历史

一种改进的CAIM算法 被引量：1

参考文献4

二级参考文献10

共引文献40

同被引文献12

引证文献1

相关作者

相关机构

相关主题

浏览历史

一种改进的CAIM算法被引量：1