期刊文献+

Data partitioning based on sampling for power load streams

一种基于采样的并行电力负荷数据流划分方法(英文)
下载PDF
导出
摘要 A novel data streams partitioning method is proposed to resolve problems of range-aggregation continuous queries over parallel streams for power industry.The first step of this method is to parallel sample the data,which is implemented as an extended reservoir-sampling algorithm.A skip factor based on the change ratio of data-values is introduced to describe the distribution characteristics of data-values adaptively.The second step of this method is to partition the fluxes of data streams averagely,which is implemented with two alternative equal-depth histogram generating algorithms that fit the different cases:one for incremental maintenance based on heuristics and the other for periodical updates to generate an approximate partition vector.The experimental results on actual data prove that the method is efficient,practical and suitable for time-varying data streams processing. 为了解决电力工业中并行数据流范围聚集的连续查询问题,提出一种新颖的数据流划分方法.首先构造了一个适用于数据流处理的扩展蓄水池抽样算法,根据流值变化率引入跳跃因子反应负荷数据的变化情况,实现数据流的自适应并行采样.然后为了实现数据流量的平均划分,基于近似技术提出2种适应不同情况的生成等深柱状图的算法:增量更新的启发式方法和周期性更新的快捷方法,从而在采样的基础上生成近似划分向量.通过在实际数据集上对算法性能测试,证明文中提出的数据流划分方法高效实用,适合于高速时变数据流的处理.
出处 《Journal of Southeast University(English Edition)》 EI CAS 2005年第3期293-298,共6页 东南大学学报(英文版)
基金 The High Technology Research Plan of Jiangsu Prov-ince (No.BG2004034) the Foundation of Graduate Creative Program ofJiangsu Province (No.xm04-36).
关键词 data streams continuous queries parallel processing sampling data partitioning 数据流 连续查询 并行处理 采样 数据划分
  • 相关文献

参考文献8

  • 1W ang Yongli,Xu Hongbing,Dong Y isheng,et al.Design on DSMS supporting distribution system automation[].Automation ofElectric Power System s.2004
  • 2V itter J S.Random sampling with a reservoir[].ACM Transactions on Mathematical Software.1985
  • 3Guha S,Koudas N,Shim K.Data streams and histograms[].In: Proc ofSymp on Theory ofComputing.2001
  • 4Seshadri S,Jeffrey F.Sampling issues in parallel database systems[].In: rd InternationalConference on Extending Database Technology.1992
  • 5DewittD J,Naughton J F,Schneider D A.Parallel sorting on a shared-nothing architecture using probabilistic splitting[].In: Proc of the First International Conference on Parallel and Distributed Information System s.1991
  • 6Arasu A,Manku G.Approximate counts and quantiles over sliding windows[].In: Proc of the rd ACM SIGACTSIGMOD-SIGART Symp on Principles ofDatabase System s.2004
  • 7Gurmeet S,Sridhar R,Bruce G.Approximate medians and other quantiles in one pass and with lim ited memory[].In: Proc ACM SIGMOD.1998
  • 8Surajit C,Rajeev M,V ivek R.Random sampling for histogram construction: how much is enough[].In: Proc ACM SIGMOD.1998

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部