期刊文献+

Classifying Uncertain and Evolving Data Streams with Distributed Extreme Learning Machine 被引量:1

Classifying Uncertain and Evolving Data Streams with Distributed Extreme Learning Machine
原文传递
导出
摘要 Conventional classification algorithms are not well suited for the inherent uncertainty, potential concept drift, volume, and velocity of streaming data. Specialized algorithms are needed to obtain efficient and accurate classifiers for uncertain data streams. In this paper, we first introduce Distributed Extreme Learning Machine (DELM), an optimization of ELM for large matrix operations over large datasets. We then present Weighted Ensemble Classifier Based on Distributed ELM (WE-DELM), an online and one-pass algorithm for efficiently classifying uncertain streaming data with concept drift. A probability world model is built to transform uncertain streaming data into certain streaming data. Base classifiers are learned using DELM. The weights of the base classifiers are updated dynamically according to classification results. WE-DELM improves both the efficiency in learning the model and the accuracy in performing classification. Experimental results show that WE-DELM achieves better performance on different evaluation criteria, including efficiency, accuracy, and speedup. Conventional classification algorithms are not well suited for the inherent uncertainty, potential concept drift, volume, and velocity of streaming data. Specialized algorithms are needed to obtain efficient and accurate classifiers for uncertain data streams. In this paper, we first introduce Distributed Extreme Learning Machine (DELM), an optimization of ELM for large matrix operations over large datasets. We then present Weighted Ensemble Classifier Based on Distributed ELM (WE-DELM), an online and one-pass algorithm for efficiently classifying uncertain streaming data with concept drift. A probability world model is built to transform uncertain streaming data into certain streaming data. Base classifiers are learned using DELM. The weights of the base classifiers are updated dynamically according to classification results. WE-DELM improves both the efficiency in learning the model and the accuracy in performing classification. Experimental results show that WE-DELM achieves better performance on different evaluation criteria, including efficiency, accuracy, and speedup.
出处 《Journal of Computer Science & Technology》 SCIE EI CSCD 2015年第4期874-887,共14页 计算机科学技术学报(英文版)
基金 This work was supported by the National Natural Science Foundation of China under Grant Nos. 61173029 and 61272182. Acknowledgement The authors would like to thank anonymous reviewers and editors for their valuable comments.
关键词 uncertain data stream CLASSIFICATION extreme learning machine distributed computing concept drift uncertain data stream, classification, extreme learning machine, distributed computing, concept drift
  • 相关文献

参考文献32

  • 1Babcock B, Babu S, Datar M et al. Models and issues in data stream systems. In Proc. the 21st ACM SIGMOD- SIGACT-SGART Symposium on Principles of Database Systems, June 2002, pp.1-16.
  • 2Tran T T, Peng L, Li Bet al. PODS: A new model and pro- cessing algorithms for uncertain data streams. In Proc. the 2010 ACM SIGMOD International Conference on Man- agement of Data, June 2010, pp.159-170.
  • 3Cao K Y, Wang G R, Han D H et al. Continuous outlier monitoring on uncertain data streams. Yournal of Computer Science and Technology, 2014, 29(3): 436-448.
  • 4Zhao L, Yang Y Y, Zhou X. Continuous probabilistic sub- space skyline query processing using grid projections. Your- nal of Computer Science and Technology, 2014, 29(2): 332- 344.
  • 5Zhou A Y, Jin C Q, Wang G R et al. A survey on the man- agement of uncertain data. Chinese Journal of Computers, 2009, 32(1): 1-16.
  • 6He Q, Shang T, Zhuang F et al. Parallel extreme learning machine for regression based on MapReduce. Neurocomput- ing, 2013, 102: 52-58.
  • 7Aggarwal C C, Yu P S. A survey of uncertain data algo- rithms and applications. IEEE Transactions on Knowledge and Data Engineering, 2009, 21(5): 609-623.
  • 8Masud M M, Gao J, Khan L et al. A practical approach to classify evolving data streams: Training with limited amount of labeled data. In Proc. the 8th IEEE International Conference on Data Mining, December 2008, pp.929-934.
  • 9Xu W, Qin Z, Chang Y. A framework for classifying un- certain and evolving data streams. Information Technology Journal, 2011, 10(10): 1926-1935.
  • 10Domingos P, Hulten G. Mining high-speed data streams. In Proc. the 6th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, August 2000, pp.71-80.

同被引文献1

引证文献1

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部