Classifying Uncertain and Evolving Data Streams with Distributed Extreme Learning Machine 被引量：1

Classifying Uncertain and Evolving Data Streams with Distributed Extreme Learning Machine

导出

摘要 Conventional classification algorithms are not well suited for the inherent uncertainty, potential concept drift, volume, and velocity of streaming data. Specialized algorithms are needed to obtain efficient and accurate classifiers for uncertain data streams. In this paper, we first introduce Distributed Extreme Learning Machine （DELM）, an optimization of ELM for large matrix operations over large datasets. We then present Weighted Ensemble Classifier Based on Distributed ELM （WE-DELM）, an online and one-pass algorithm for efficiently classifying uncertain streaming data with concept drift. A probability world model is built to transform uncertain streaming data into certain streaming data. Base classifiers are learned using DELM. The weights of the base classifiers are updated dynamically according to classification results. WE-DELM improves both the efficiency in learning the model and the accuracy in performing classification. Experimental results show that WE-DELM achieves better performance on different evaluation criteria, including efficiency, accuracy, and speedup. Conventional classification algorithms are not well suited for the inherent uncertainty, potential concept drift, volume, and velocity of streaming data. Specialized algorithms are needed to obtain efficient and accurate classifiers for uncertain data streams. In this paper, we first introduce Distributed Extreme Learning Machine （DELM）, an optimization of ELM for large matrix operations over large datasets. We then present Weighted Ensemble Classifier Based on Distributed ELM （WE-DELM）, an online and one-pass algorithm for efficiently classifying uncertain streaming data with concept drift. A probability world model is built to transform uncertain streaming data into certain streaming data. Base classifiers are learned using DELM. The weights of the base classifiers are updated dynamically according to classification results. WE-DELM improves both the efficiency in learning the model and the accuracy in performing classification. Experimental results show that WE-DELM achieves better performance on different evaluation criteria, including efficiency, accuracy, and speedup.

作者韩东红张昕王国仁

机构地区 College of Information Science and Engineering Key Laboratory of Medical Image Computing ( NEU)

出处《Journal of Computer Science & Technology》 SCIE EI CSCD 2015年第4期874-887,共14页 计算机科学技术学报（英文版）

基金 This work was supported by the National Natural Science Foundation of China under Grant Nos. 61173029 and 61272182. Acknowledgement The authors would like to thank anonymous reviewers and editors for their valuable comments.

关键词 uncertain data stream CLASSIFICATION extreme learning machine distributed computing concept drift uncertain data stream, classification, extreme learning machine, distributed computing, concept drift

分类号 TP393 [自动化与计算机技术—计算机应用技术] TP273.22 [自动化与计算机技术—检测技术与自动化装置]

引文网络
相关文献

参考文献32

1Babcock B, Babu S, Datar M et al. Models and issues in data stream systems. In Proc. the 21st ACM SIGMOD- SIGACT-SGART Symposium on Principles of Database Systems, June 2002, pp.1-16.
2Tran T T, Peng L, Li Bet al. PODS: A new model and pro- cessing algorithms for uncertain data streams. In Proc. the 2010 ACM SIGMOD International Conference on Man- agement of Data, June 2010, pp.159-170.
3Cao K Y, Wang G R, Han D H et al. Continuous outlier monitoring on uncertain data streams. Yournal of Computer Science and Technology, 2014, 29(3): 436-448.
4Zhao L, Yang Y Y, Zhou X. Continuous probabilistic sub- space skyline query processing using grid projections. Your- nal of Computer Science and Technology, 2014, 29(2): 332- 344.
5Zhou A Y, Jin C Q, Wang G R et al. A survey on the man- agement of uncertain data. Chinese Journal of Computers, 2009, 32(1): 1-16.
6He Q, Shang T, Zhuang F et al. Parallel extreme learning machine for regression based on MapReduce. Neurocomput- ing, 2013, 102: 52-58.
7Aggarwal C C, Yu P S. A survey of uncertain data algo- rithms and applications. IEEE Transactions on Knowledge and Data Engineering, 2009, 21(5): 609-623.
8Masud M M, Gao J, Khan L et al. A practical approach to classify evolving data streams: Training with limited amount of labeled data. In Proc. the 8th IEEE International Conference on Data Mining, December 2008, pp.929-934.
9Xu W, Qin Z, Chang Y. A framework for classifying un- certain and evolving data streams. Information Technology Journal, 2011, 10(10): 1926-1935.
10Domingos P, Hulten G. Mining high-speed data streams. In Proc. the 6th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, August 2000, pp.71-80.

同被引文献1

1DENG ChenWei,HUANG GuangBin,XU Jia,TANG JieXiong.Extreme learning machines: new trends and applications[J].Science China(Information Sciences),2015,58(2):1-16. 被引量：52

引证文献1

1You-Xi Wu,Dong Liu,He Jiang.Length-Changeable Incremental Extreme Learning Machine[J].Journal of Computer Science & Technology,2017,32(3):630-643. 被引量：2

二级引证文献2

1王延斌,武优西,刘洪普.梯度优化决策树的集成学习及其应用[J].计算机科学,2018,45(B11):121-125. 被引量：3
2张轶,高雪冬,郭亚伟,赵丙贺.加权k-means算法及其在高校贫困生判别中的应用[J].产业与科技论坛,2022,21(19):40-44. 被引量：2

1Zhang Yuru (Beijing University of Aeronautics and Astronautics William A. Gruver Simon Fraser University , Canada).METHOD OF CLASSIFYING GRASPS BY ROBOT HANDS[J].Chinese Journal of Mechanical Engineering,1996,9(4):271-277. 被引量：1
2张忠平,李岩,杨静.基于矩阵的频繁项集挖掘算法[J].计算机工程,2009,35(1):84-86. 被引量：19
3尹志武,黄上腾,薛贵荣.Logistic Regression for Evolving Data Streams Classification[J].Journal of Shanghai Jiaotong university(Science),2007,12(2):197-203.
4Windows 7库使用技巧:把文件收藏起来[J].网络与信息,2009(11):40-40.
5徐俊芬,叶俊杰,刘业政.基于相似领域共享特征的分类学习模型[J].计算机工程与应用,2014,50(17):137-141.
6屠新兵.中职C语言中矩阵操作编程方法探索[J].山西农经,2016(14):118-119.
7DU Weiwei.Colorization by classifying the prior knowledge[J].智能系统学报,2011,6(6):556-560.
8Min Cheng,Guojin Wang.Approximate merging of multiple Bézier segments[J].Progress in Natural Science:Materials International,2008,18(6):757-762. 被引量：10
9榆树市加快信息化发展步伐[J].吉林农业,2011(10):95-95.
10付伟,叶清,吴晓平.基于矩阵操作的必然性QoS约束副本放置方法[J].系统工程理论与实践,2012,32(12):2796-2801.

Journal of Computer Science & Technology

2015年第4期

浏览历史

内容加载中请稍等...

Classifying Uncertain and Evolving Data Streams with Distributed Extreme Learning Machine 被引量：1

参考文献32

同被引文献1

引证文献1

二级引证文献2

相关作者

相关机构

相关主题

浏览历史