期刊文献+

基于嵌套滑动窗口的数据流缺失数据填充算法 被引量:4

On a Missing Data Imputation Algorithm Based on the Nested Sliding Window
下载PDF
导出
摘要 提出了一种基于嵌套滑动窗口的缺失数据填充算法.考虑到传感器数据流的时效特性,采用嵌套滑动窗口选取空间相关度高且距离最近的数据作为样本数据,之后分两种情况对缺失数据进行填充.算法首先通过皮尔逊相关计算对数据的空间性进行分析,应用嵌套滑动窗口对缺失数据相关的数据进行采样,得到强相关数据,之后采用MKNN算法进行精确填充.通过皮尔逊相关分析和嵌套窗口采样,极大地降低了数据样本大小,提高了缺失数据处理实时性;对于不具有强的空间相关的缺失数据,考虑到短时间内采集数据间强的时间相关性,采用线性相关法对数据进行填充,降低算法复杂度.实验表明,该算法能够实时、精确地对数据流缺失数据进行填充. Characteristics of continuous,massive and rapid make the traditional imputation algorithm can not be applied to data stream.In this paper,a nested sliding window-based missing data imputing algorithm has been proposed.Taking into account the aging characteristics of the data stream of sensor networks,we use a nested sliding window to select the data,both of which have high spatial correlation and nearest data,as sample data,then to impute the missing data by two cases.Firstly,we use the Pearson correlation to analysis the spatial relation of data,then use nested sliding window to select the sample data which have strong spatial relation to each others,then use MKNN algorithm to accurate impute.Pearson correlation analysis and nested window greatly reduced the data size greatly,improved the real-time processing;For missing data which do not having strong spatial correlation,using simple linear correlation algorithm to impute to reduce the complexity.Experimental results show that this algorithm can accurately to impute the missing data of data flow in real time.
出处 《西南师范大学学报(自然科学版)》 CAS 北大核心 2015年第11期130-136,共7页 Journal of Southwest China Normal University(Natural Science Edition)
关键词 传感器网络 数据流 嵌套滑动窗口 缺失数据 数据填充 sensor networks data flow the nested sliding window missing data data imputation
  • 引文网络
  • 相关文献

参考文献5

二级参考文献45

  • 1张登银,李军.多速率多播拥塞控制研究[J].重庆邮电学院学报(自然科学版),2005,17(2):215-220. 被引量:2
  • 2王泽根,华一新.主动空间信息服务技术研究[J].测绘学报,2006,35(4):379-384. 被引量:16
  • 3[1]PAXSON V.End-to-End Internet Packet Dynamics[EB/OL].(1999-05-23)[2007-01-12].http://www.sigcomm.org/sigcomm97/papers/p086.pdf.
  • 4[2]CACERES R,DUFFIELD N G,HOROWITZ J,et al.Multicast-based Inference of Network-Internal Characteristics:Accuracy of Packet-Loss Estimation[EB/OL].(1999-05-23)[2007 01-12].http://ieeexplore.ieee.org/iel4/6063/16198/00749304.pdf? arnumber= 749304.
  • 5[3]COATES M,NOWAK R.Network Loss Inference Using Unicast End-to-End Measurement[EB/OL].(1999-05-23)[2007 01-12].http://cmc.rice.edu/docs/docs/Coa2000Sep5 NetworkLos.pdf.
  • 6[5]COATES M,NOWAK R.Network Tomography for Internal Delay Estimation[EB/OL].(2001-12-23)[2007-01-12].http://www.spin.rice.edu/PDF/COATES.pdf.
  • 7[6]CACERES R,DUFFIELD N G,HOROWITZ J.Statistical Inference of Multicast Network Topology[EB/OL].(1999-12-23)[2007-01-12].http://citeseer.ist.psu.edu/context/1738219/0.
  • 8[7]TSANG Y,COATES M,NOWAK R.Passive Network Tomography Using EM Algorithms[EB/OL].(2001-12-10)[2007-01 12].http://www.tsp.ece.mcgill.ca/Networks/projects/pdf/tsang_ICASSP01.pdf.
  • 9[8]TSANG Y,COATES M,NOWAK R.Passive Unicast Network Tomography Based on TCP Monitoring[R].USA:Rice University,2000.
  • 10Cool A L.A review of methods for dealing with missing data[C].Paper presented at the Annual Meeting of the Southwest Educational Research Association,Dallas,TX,2000:1-34.

共引文献11

同被引文献45

引证文献4

二级引证文献3

相关主题

;
使用帮助 返回顶部