期刊文献+

云环境下的突发关键字查询算法

Algorithm for Bursty Term Query in Cloud Computing
下载PDF
导出
摘要 基于Spark Streaming计算框架下的分布式突发关键字查询是监测流数据中关键字突发时间的热点研究问题。多数研究方法存储统计所有的关键字,并未考虑热点关键字。在数据呈爆炸式增长的背景下,获取热点关键字的突发时间更具有价值。针对这个问题,提出一种分布式突发关键字查询算法,该算法采用动态的更新策略,通过设置检查点的方法提取热点关键字,并在线性的时间内查询突发的时间范围。实验结果表明,该算法的性能比现有算法更优。 Distributed bursty term query under the framework of Spark Streaming is a hot research issue. It aims to de- tect bursty terms in data streams. Most studies of bursty term query count and save all terms without consideration of hot terms. Under the background of exploding in the data scale, it makes more sense to get bursty time of hot terms. To solve this problem, we presented a distributed bursty term query algorithm. The algorithm uses dynamic update strategy and a checkpoint mechanism to extract hot terms. Also it finds the bursty time range in linear time Experimental results show that the proposed algorithm has better performance.
作者 郑诗敏 秦小麟 刘亮 周倩 ZHENG Shi-min QIN Xiao-lin LIU Liang ZHOU Qian(College of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics, Nanjing 211106, China)
出处 《计算机科学》 CSCD 北大核心 2017年第3期10-15,35,共7页 Computer Science
基金 国家自然科学基金项目(61373015 61300052) 江苏高校优势学科建设工程资助项目(PAPD) 江苏省重大科技成果转化基金项目(BA2013049)资助
关键词 云计算 突发关键字查询 流数据 SPARK STREAMING Cloud computing, Bursty term query, Data streams, Spark Streaming
  • 相关文献

参考文献2

二级参考文献38

  • 1汪卫,周皓峰,袁晴晴,楼宇波,施伯乐.基于图论的频繁模式挖掘[J].计算机研究与发展,2005,42(2):230-235. 被引量:17
  • 2李先通,李建中,高宏.一种高效频繁子图挖掘算法[J].软件学报,2007,18(10):2469-2480. 被引量:35
  • 3Hadoop W-K [EB/OL]. http;//en, wikipedia org/wiki/Hado- op,2012-02-21.
  • 4Nutch W-K [EB/OL]. http://ca, wikipedia, org/wiki/Nutch, 2012-02-21.
  • 5Ghemawats, Gobioffh, Leungst. The google file systenx[EB/ OL]. Http://labs. google corrL hk/papers/gfs. html, 2012-02-21.
  • 6Jean D,Ghemawats. Map/Reduce:simplified data processing on large elusters [EB/OL]. http://userpages, uni-koblenz, de/~ laemmel/MapReduee/paper, pdf, 2012-02-21.
  • 7Map/Reduce [EB/OL]. http://en, wikipedia, org/wiki/MapRe- duce,2012-02-21.
  • 8Applications powered by Hadoop [EB/OL]. Http://wiki. apa- che. nrg. hadoop/PoweredBy, 2012-02-21.
  • 9Schlossers, Lin J. Hadoop Summit 2008 [R/OL]. Http://devel oper. yahoo, com/events/hadoopsummmit2010/agenda, html, 2012-02-21.
  • 10Zaharia M, Konwinski A, Anthony D, Improving mapreduce per- formanee in heterogeneous environments [C]// 8th USENIX Symposium On Operating Systems Design and Implementation. Washingtom, DC: IEEE, 2008,1 : 29-42.

共引文献23

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部