期刊文献+

一种面向分布式数据流的闭频繁模式挖掘方法 被引量:6

Closed frequent patterns mining method over distributed data streams
下载PDF
导出
摘要 对智能交通系统中面向分布式数据流的频繁模式挖掘问题进行了研究。针对智能交通系统中传感器网络数据流的特点,提出一种基于分布式窗口树的分布式数据流闭频繁模式挖掘方法。该方法在分布式节点中构建分布式窗口树,通过对分布式窗口树进行更新、剪枝及挖掘,能够快速响应用户的查询请求,返回任意时间窗口内数据中的闭频繁模式。实验表明,在保证挖掘准确性的前提下,该方法能够有效缩短查询响应时间,并具有良好的可扩展性。 The paper concentrated on frequent patterns mining problem over distributed data streams in intelligent transportation systems. According to the characteristics of sensor network data streams in intelligent transportation systems, this paper proposed a method for mining closed frequent patterns in arbitrary time window of distributed data streams. As data stream flows, the method captured the contents of data streams with a distributed compact prefix-tree, which was preserved in the distributed computing nodes. It deleted the obsolete and infrequent items by periodically pruning the tree. With mining the tree paralled, the result could be returned to user rapidly after submitting the query. The experimental results show that, ensuring the accuracy of mining, the method can effectively reduce the query response time, and has good scalability.
出处 《计算机应用研究》 CSCD 北大核心 2015年第12期3560-3564,3595,共6页 Application Research of Computers
基金 国家自然科学基金资助项目(61472256 61170277) 上海市教委科研创新重点资助项目(12zz137) 上海市一流学科建设项目(S1205YLXK)
关键词 智能交通系统 分布式数据流 闭频繁模式挖掘 MAPREDUCE 传感器网络 intelligent transportation system distributed data streams mining closed frequent patterns MapReduee sensor network
  • 相关文献

参考文献17

  • 1Ichrio MASAKI. A brief History of ITS[R] . Cambridge:Massachusetts Institute of Technology, 1999.
  • 2谢嘉孟,彭宏,周兵,杨东辉.基于数据挖掘技术的智能交通信息分析与决策研究[J].公路,2004,49(4):154-158. 被引量:14
  • 3李国徽,陈辉.挖掘数据流任意滑动时间窗口内频繁模式[J].软件学报,2008,19(10):2585-2596. 被引量:45
  • 4陈辉.挖掘数据流滑动时间窗口内Top-K频繁模式[J].小型微型计算机系统,2010,31(6):1123-1128. 被引量:2
  • 5曲武,隋海峰,杨炳儒,谢永红.分布式数据流挖掘的研究进展[J].计算机科学,2012,39(1):1-8. 被引量:8
  • 6Jiang Nan, Grnenwald L. CFI-stream:mining closed frequent itemsets in data streams[C] //Proc of the 12th ACM SIGKDD Interl Conference on Knowledge Discovery and Data Mining. New York:ACM Press, 2006:592-597.
  • 7Pramudiono I, Kitsuregawa M. Parallel FP-growth on PC cluster[C] //Advances in Knowledge Discovery and Data Mining. Berlin:Springer, 2003:467-473.
  • 8Li Haoyuan, Wang Yi, Zhang Dong, et al. PFP:parallel FP-growth for query recommendation[C] //Proc of ACM Conference on Recommender Systems. 2008:107-114.
  • 9Wang Suqi, Yang Yubin, Chen Guangpeng, et al. MapReduce-based closed frequent itemset mining with efficient redundancy filtering[C] //Proc of the 12th IEEE International Conference on Data Mining. 2012:449-453.
  • 10Otey M E, Wang Chao, Parthasarathy S, et al. Mining frequent itemsets in distributed and dynamic databases[C] //Proc of the 3rd IEEE International Conference on Data Mining. 2003:617-620.

二级参考文献127

  • 1金澈清,钱卫宁,周傲英.流数据分析与管理综述[J].软件学报,2004,15(8):1172-1181. 被引量:161
  • 2陆介平,杨明,孙志挥,鞠时光.快速挖掘全局最大频繁项目集[J].软件学报,2005,16(4):553-560. 被引量:27
  • 3张冬冬,李建中,王伟平,郭龙江.数据流历史数据的存储与聚集查询处理算法[J].软件学报,2005,16(12):2089-2098. 被引量:17
  • 4何波,王华秋,刘贞,王越.快速挖掘频繁项集的并行算法[J].计算机应用,2006,26(2):391-392. 被引量:5
  • 5Cormode G,Muthukrishnan S.What′s hot and what′s not:tracking most frequent items dynamically[C].In:Proceedings of the Twenty-Second ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems,ACM Press,June,2003,296-306.
  • 6Han Jia-wei,Pei Jian,Yin Yi-wen.Mining frequent patterns without candidate generation[A].In:the 2000 ACM SIGMOD International Conference of Management of Data[C],ACM Press,2000.1-12.
  • 7Han Jia-wei,Wang Jian-yong,Lu Ying,et al.Mining top-K frequent closed patterns without minimum support[C].In:the 2002 IEEE International Conference on Data Mining (ICDM′02),IEEE Press,2002,211-218.
  • 8Li H F,Lee S Y.Mining frequent itemsets over data streams using efficient window sliding techniques[J].Expert Systems with Applications,2009,36(2):1466-1477.
  • 9Li H F,Shan M K,Lee S Y.DSM-FI:an efficient algorithm for mining frequent itemsets in data streams[J].Knowledge and Information Systems,2008,17(1):79-97.
  • 10Raymond Chi-Wing Wong,Ada Wai-Chee Fu.Mining top-K frequent itemsets from data streams[J].Data Mining and Knowledge Discovery,2006,13(2):193-217.

共引文献87

同被引文献36

引证文献6

二级引证文献21

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部