云平台下基于粗糙集的并行算法的研究被引量：1

Parallel algorithm for rough set approximates in cloud computing platform

下载PDF

导出

摘要随着科技不断发展和新技术的不断涌现,数据的重要性变得越来越明显,与此同时数据也在以超出人们预期的速度快速地增长。物联网技术和云计算技术的出现给数据挖掘和知识发现等相关领域既带来了巨大挑战,也赋予了新的活力,物联网的出现和成功运用使得数据具有时间特性和空间特性,在增加数量的同时也增加了数据的维度,从而使一些传统的数据挖掘的工具和算法变得效率低下;而云计算平台提供的计算能力和简易的并行编程思想使得大量数据所带来的问题在一定程度上得到解决。粗糙集是一种成功数据发掘工具,但在面对日益增长的数据时,效率也变得不理想。借助Map/Reduce思想将传统串行运行算法成功转移到云环境中。首先简单介绍了Map/Reduce流程和粗糙集的相关理论,然后扩展云环境下编程理论和提出相应的算法,最后通过复杂度和相应实验验证了算法的有效性。 With the continuous development of science and technologies, many new technologies are brought out, and the function of data becomes more and more important. At the same time, the data is also growing at a fast speed which exceeds our expectations. The emergence of the Internet of things and the cloud computing has brought enormous challenges to the data mining, knowledge discovery and other related fields, hut＇ they also give them new and energetic lives. The data produced by the Internet of things processes spatial characteristics and temporal characteristics, which increase the dimension of the data. For some traditional tools and algorithms, the process has become inefficient. However, the powerful computing ability and the simply parallel programming way of the cloud make the problem solved at some extent. Rough set is a successful data-mining tool, but in the face of the increasing data, it becomes inefficient. In this paper, with the help of the Map/Reduce we have transferred the traditional serial algorithm into the cloud environment. Firstly we briefly introduces the Map/Reduce process and the rough set theory, and then expand the cloud environment programming theories. At last, we put forward the corresponding algorithm, and verify the validity of the algorithm by the complexity and the corresponding experimental.

作者李朋刘天华

机构地区沈阳师范大学科信软件学院

出处《沈阳师范大学学报（自然科学版）》 CAS 2015年第2期274-278,共5页 Journal of Shenyang Normal University:Natural Science Edition

基金国家自然科学基金资助项目(60970112)

关键词粗糙集 MAP/REDUCE 并行算法 Rough set Map/Reduce parallel algorithm

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献15

1PAWLAK Z, GRZYMALA-BUSSE J, SLOWINSKI R, et al. Rough sets[J]. Communications of the ACM, 1995, 38(11) :88 - 95.
2Blt;IASZCZYIdSKI J, SOWIt;ISKI R, SZELAG M. Sequential covering rule induction algorithm for variable consistency rough set approaches[J]. Information Sciences, 2011,181(5):987- 1002.
3HONG T P, TSENG L H, CHIEN B (2. Mining from incomplete quantitative data by fuzzy rough sets[J]. Expert Systems with Applications, 2010,37(3) : 2644 - 2653.
4INUIGUCHI M, MIYAJIMA T. Rough set based rule induction from two decision tables[J]. Euro J Operational Res, 2007,181(3) :1540 - 1553.
5KANElWA K. A rough set approach to multiple dataset analysis[J]. Appl Soft Comput, 2011,11(2) : 2538 - 2547.
6LEUNG Y, FISCHER M, WU W Z, et al. A rough set approach for the discovery of classification rules in interval --valued information systems[J]. Int J Approximate Reasoning, 2008,47 (2):233- 246.
7MIAO D Q, DUAN Q G, ZHANG H, et al. Rough set based hybrid algorithm for text elassification[J]. Exp Syst Appl, 2009,36(5) :9168 - 9174.
8CHEN Y M, MIAO D Q, WANG R Z, et al. A rough set approach to feature selection based on power set tree[J]. Knowledge-Based Syst, 2011,24(2) : 275 - 281.
9张岩,郭松,赵国海.基于Hadoop的云计算试验平台搭建研究[J].沈阳师范大学学报（自然科学版）,2013,31(1):85-89. 被引量：14
10DEAN J, GHEMAWAT S. MapReduce: a flexible data processing tool[J]. Communications of the ACM, 2010,53 (1):72 - 77.

二级参考文献26

1崔杰,李陶深,兰红星.基于Hadoop的海量数据存储平台设计与开发[J].计算机研究与发展,2012,49(S1):12-18. 被引量：141
2叶东毅,陈昭炯.一个新的二进制可辨识矩阵及其核的计算[J].小型微型计算机系统,2004,25(6):965-967. 被引量：49
3Liang Ji Ye,Xu Zong-Ben.The algorithm on knowledge reduction in incomplete information systems.International Journal of Uncertainty,Fuzziness and Knowledge Based Systems,2002,10(1):95～103
4Pawlak Z.et al.Rough set.Communications of the ACM,1995,38(11):89～95
5Pawlak Z.et al.Rough set theory and its application to data a nalysis.Cybernetics and Systems,1998,29(7):661～688
6Wang S.K.M.,Ziarko W..On optional decision rules in de cision table.Bulletin of Polish Academy of Sciences,1985,33(11～12):693～696
7Hu X.H.,Nick C..Learning in relational databases:A rough set approach.International Journal of Computational Intelligence,1995,11(2):323～338
8黎春兰,邓仲华.论云计算的价值[J].图书与情报,2009(4):42-46. 被引量：87
9张建勋,古志民,郑超.云计算研究进展综述[J].计算机应用研究,2010,27(2):429-433. 被引量：589
10多雪松,张晶,高强.基于Hadoop的海量数据管理系统[J].微计算机信息,2010,26(13):202-204. 被引量：27

共引文献246

1马捷,葛岩,蒲泓宇.属性约简方法研究综述[J].数据分析与知识发现,2020,4(1):40-50. 被引量：11
2梁吉业,李超伟,魏巍.基于Rough Sets的特征选择研究进展[J].山西大学学报（自然科学版）,2012,35(2):211-218. 被引量：2
3王希雷,马永军,苏静.基于Rough集的数据挖掘中知识变化的研究[J].华中科技大学学报（自然科学版）,2012,40(S1):320-323.
4李华雄,周献中.基于0-1分辨矩阵的启发式属性约简[J].中南大学学报（自然科学版）,2009,40(S1):304-308. 被引量：2
5周江卫,冯博琴,刘洋.粗糙集高效遗传约简算法[J].西安交通大学学报,2007,41(4):444-447. 被引量：8
6孙林,徐久成,马媛媛.基于新的条件熵的决策树规则提取方法[J].计算机应用,2007,27(4):884-887. 被引量：11
7桂现才.决策表化简及其属性约简方法[J].计算机工程与设计,2007,28(8):1765-1767. 被引量：1
8姜伟,徐章艳,杨炳儒.基于数据库的属性约简模型的快速求核算法[J].计算机工程与应用,2007,43(16):189-190. 被引量：5
9周江卫,冯博琴,刘洋.一种新的快速求核算法[J].西安交通大学学报,2007,41(6):688-691. 被引量：10
10孙林,徐久成,马媛媛.基于决策熵的决策树规则提取方法[J].计算机技术与发展,2007,17(6):97-100. 被引量：6

同被引文献3

1董俊,王锁萍,熊范纶.可变相似性度量的近邻传播聚类[J].电子与信息学报,2010,32(3):509-514. 被引量：49
2刘晓勇,付辉.一种快速AP聚类算法[J].山东大学学报（工学版）,2011,41(4):20-23. 被引量：20
3郭秀娟,陈莹.AP聚类算法的分析与应用[J].吉林建筑工程学院学报,2013,30(4):58-61. 被引量：12

引证文献1

1铉岩,周传生.基于张量距离的高阶近邻传播聚类算法[J].沈阳师范大学学报（自然科学版）,2016,34(1):96-99.

1寇晓雨.CAXA在数控技术中的编程理论[J].计算机光盘软件与应用,2011(16):98-98.
2刘鑫,李鹏飞.下位机PLC和上位机组态软件在恒压供水系统中的应用[J].电气传动自动化,2016,38(4):29-34. 被引量：9
3王轶男.轶男随笔[J].电脑高手,2002(9):95-99.
4欧阳为民,郑诚,张燕.国际知识发现与数据发掘工具评述[J].计算机科学,2001,28(3):101-108. 被引量：10
5马志芳.基于西门子T-XP3000的设定值调节器算法的开发[J].电力学报,2010,25(2):165-166.
6邹长忠,傅清祥.一种新的加权关联规则增量更新算法[J].福州大学学报（自然科学版）,2008,36(4):501-505. 被引量：1
7赵德新,刘瑾.设计模式思想及其应用[J].天津理工大学学报,2007,23(5):58-61. 被引量：10
8Cypress 0．13umeFlash工艺成功移植华虹NEC[J].集成电路应用,2008(8):13-13.
9贾维,张胜文,张亮,方喜峰.数控编程KBE系统的基础性使能技术研究[J].CAD/CAM与制造业信息化,2008(7):88-91. 被引量：1
10Charles Lohr.将音乐变成色彩[J].科技纵览,2016,0(8):19-20.

沈阳师范大学学报（自然科学版）

2015年第2期

浏览历史

内容加载中请稍等...

云平台下基于粗糙集的并行算法的研究被引量：1

参考文献15

二级参考文献26

共引文献246

同被引文献3

引证文献1

相关作者

相关机构

相关主题

浏览历史

云平台下基于粗糙集的并行算法的研究 被引量：1

参考文献15

二级参考文献26

共引文献246

同被引文献3

引证文献1

相关作者

相关机构

相关主题

浏览历史

云平台下基于粗糙集的并行算法的研究被引量：1