粗糙集中离散化算法的研究

Research on Discretization Algorithms in Rough Sets

下载PDF

导出

摘要对目前粗糙集的离散化算法进行了分类讨论,重点分析了基于信息熵的离散化算法的理论基础以及实现步骤,并就该算法对于同一属性在不同样本数据集上的应用情况进行了分析.实验表明,该算法对于部分属性具有数据敏感性,当选择这些属性作为依据时会影响系统的决策能力. Based on classification and discussion of current discretization algorithms,the theoretical basis and accomplishment steps of discretization algorithm using information entropy were analyzed in details,and applications of the algorithm in different sample sets of the same attribute were presented.The experimental results show that the algorithm has sensitivity to some attributes and can affect the decision ability of the system when chosen these attributes.

作者史志才王益涵赵敏媛

机构地区上海工程技术大学电子电气工程学院

出处《上海工程技术大学学报》 CAS 2010年第3期240-244,共5页 Journal of Shanghai University of Engineering Science

基金上海市教委科研创新资助项目(09YZ370) 上海工程技术大学科研基金项目(校启07-22)

关键词粗糙集熵离散化信息增益 rough set entropy discretization information gain

分类号 TP393.08 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献7

1PAWLAK Z. Rough set theory and its application to data analysis [J]. Cybernetics and Systems, 1998, 29(7):661 - 688.
2谢宏,程浩忠,牛东晓.基于信息熵的粗糙集连续属性离散化算法[J].计算机学报,2005,28(9):1570-1574. 被引量：134
3FAYYAD U, IRANI K. Multi-interval discretization of continuous-valued attributes for classification learning[C] // Proceedings of the 13th International Joint Conference on Artificial Intelligence. San Ma- teo: Morgan Kaufmann Publisher, 1993 : 1022 - 1027.
4KERBER R C. Discretization of numeric attributes [C]//Proceedings of the 10th National Conference on Artificial Intelligence, MIT Press, 1992 : 123 - 128.
5李刚,李霁伦,童兆页.WILD:基于加权信息损耗的离散化算法[J].南京大学学报（自然科学版）,2001,37(2):148-153. 被引量：8
6赵曦滨,井然哲,顾明.基于粗糙集的自适应入侵检测算法[J].清华大学学报（自然科学版）,2008,48(7):1165-1168. 被引量：17
7STOLFO S J, FAN W, LEE W K, et al. Cost-based modeling and evaluation for data mining with application to fraud and intrusion detection: results from the JAM project[EB/OL]. (1999- 08 - 27)[2010 - 05 - 30]. http: //www. weifan, info/PAPERS/JAM99. PDF.

二级参考文献31

1杨武,云晓春,李建华.一种基于强化规则学习的高效入侵检测方法[J].计算机研究与发展,2006,43(7):1252-1259. 被引量：12
2Bace R. Intrusion Detection[M]. New York: Macmillan Technical Publishing, 2000.
3Forrest S, Perrelason A S, Allen L. Self-nonself discrimination in a computer [C]// Rushby J, Meadows C. Proceedings of the 1994 IEEE Symposium on Research in Security and Privacy. Oakland CA: IEEE Computer Society Press, 1994: 202-212.
4Ghosh A K, Michael C, Schatz M. A real time intrusion system based on learning program behavior [C]// Debar H, Wu S F (eds). Recent Advances in Intrusion Detection (RAID 2000). Toulouse: Spinger-Verlag, 2000: 93- 109.
5Lee W, Stolfo S J. A data mining framework for building intrusion detection model[C]// Proceedings of the 1999 IEEE Symposium on Security and Privacy. Oakland, CA: IEEE Computer Society Press, 1999: 120- 132.
6Pawalk Z. Rough sets [J]. Int J Computer and Information Sci, 1982, 11(5):341-356.
7Ziako W. Rough sets: Trends, challenges, and prospects [C]// Ziako W, Yao Y (eds). Rough Sets and Current Trends in Computing (RSCTC 2000). Banff: Springer-Verlag, 2001: 1-7.
8[1]Catlett J. On Changing Continuous Attributes into Ordered Discrete Attributes. Proceedings of EuropeanWorking Session on Learning (EWSL91). LNAI-482, Berlin: Springer-Verlag, 1991:164～ 178.
9[2]Dougherty J, Kohavi R, Sahami M. Supervised and unsupervised discretization of continuous features.Prieditis A. Machine Learning: Proceedings of the 12th International Conference. San Mateo: MorganKaufmann Publishers, 1995: 194～202.
10[3]Fayyad U, Irani K. Multi-interval discretizaton of continuous-valued attributes for classification learning.Proceedings of the 13th International Joint Conference on Artificial Intelligence. San Mateo: Morgan Kaufmann Publishers, 1993:1 022～1 027.

共引文献156

1吴礼旺,卓李萍.5G大数据时代下智能识别误码行为隐患的研究[J].广西通信技术,2020(4):30-33.
2高原,段永胜,刘建武,刘勇.遗传策略粒子群优化的粗糙集神经网络在崩塌落石灾害风险评估中的应用[J].公路交通科技（应用技术版）,2010,6(10):106-109. 被引量：1
3赵军,张显跃.基于粗集理论的数据离散化技术研究[J].重庆邮电学院学报（自然科学版）,2006,18(6):752-757. 被引量：14
4张旷怡,胡明涛,勒中坚.粗糙集理论在商业数据挖掘中的应用[J].南昌工程学院学报,2006,25(5):71-74.
5聂红梅,周家庆.粗糙集理论中一种连续属性离散化算法[J].现代电子技术,2007,30(2):77-79. 被引量：9
6刘云霞,曾五一.数据挖掘中基于可辨识矩阵的连续属性离散化方法[J].统计研究,2007,24(4):8-11. 被引量：6
7刘云霞.数据挖掘中基于似然比假设检验的连续属性离散化方法[J].统计与决策,2007,23(8):11-13. 被引量：3
8叶明全,胡学钢.基于灰色关联度的粗集连续属性离散化算法[J].重庆邮电大学学报（自然科学版）,2007,19(4):409-412. 被引量：1
9刘业政,焦宁,姜元春.连续属性离散化算法比较研究[J].计算机应用研究,2007,24(9):28-30. 被引量：20
10李春贵,王萌,原庆能.基于启发式信息熵的粗集数值属性离散化算法[J].广西科学院学报,2007,23(4):235-237. 被引量：3

1史志才,夏永祥,周金祖.基于粒计算的离散化算法及其应用[J].计算机科学,2013,40(06A):133-135. 被引量：4
2李晶晶,肖大伟.一种粗糙集并行离散化算法[J].科技信息,2011(20):207-209.
3席静,欧阳为民.基于聚类的连续值属性最佳离散化算法[J].小型微型计算机系统,2000,21(10):1025-1027. 被引量：6
4陈涛,杨峰,陈佳.数字水印研究综述[J].软件导刊,2010,9(1):150-152. 被引量：4
5曾春先.面向数字资源的可视化方法研究[J].重庆电子工程职业学院学报,2013,22(4):161-164. 被引量：2
6孙英娟,杨柳,何昆鸟.属性离散化算法研究[J].长春师范学院学报（自然科学版）,2009,28(6):12-14.
7张永,张红蕊,路婧威.海量数据离散化算法的并行设计与实现[J].计算机应用与软件,2014,31(6):21-23. 被引量：1
8周激流,何其超.人脸正面模式自动识别方法研究[J].四川大学学报（自然科学版）,1993,31(1):70-75. 被引量：9
9李刚,段隆振,孙焱平.基于信息增益的多连续属性离散化算法改进[J].江西科学,2009,27(2):251-254.
10朱小飞,卓丽霞,彭建华.一种基于分布特征的连续属性离散化方法[J].西南师范大学学报（自然科学版）,2006,31(2):107-110. 被引量：1

上海工程技术大学学报

2010年第3期

浏览历史

内容加载中请稍等...

粗糙集中离散化算法的研究

参考文献7

二级参考文献31

共引文献156

相关作者

相关机构

相关主题

浏览历史