期刊文献+

基于数据流程变换的Mashup性能优化方法 被引量:1

Performance Optimization of Mashup Through Data Flow Transformation
下载PDF
导出
摘要 Mashup是一种流行的web2.0应用,由开发者将互联网上多个web数据源的数据进行聚合构建而成.大多数mashup工具支持通过可视化的数据流程设计来开发mashup,但是缺少编程经验的终端用户设计的数据流程可能执行效率很低,当处理较大规模数据时mashup的响应时间会大幅增加.本文研究如何通过数据处理操作的合并拆分、次序交换、并行化等技术实现mashup的数据流程优化,提高mashup的性能及可扩展性.本文提出一种新的mashup性能优化方法,对多样化的mashup组件标注其操作语义特征属性及代价模型,定义适用于mashup的流程变换规则,针对用户设计的mashup数据流程生成所有与其语义等价的流程,并提出算法建立流程之间的代价偏序关系图从而快速选择执行代价最小的流程.文中实现了一个mashup工具,实验表明该方法可以有效提高终端用户设计的mashup的执行效率. Mashup is a new kind of web2.0 applications createxi by aggregating and manipulating data from several web data sources. Mashup tools usually support visually designing data flows to create mashup. Because mashup developers are of varying degrees of technical expertise, the data flows may be of high cost because of inefficient design. This will definitely increase the response time and impair the QoS of mashup. In this paper, we target on enhancing the performance of mashup base on data flow transformation techniques such as operator merging, operator swapping, and operator parallelism. A new optimization method is presented for mash- up, which models a mashup as a data flow graph, annotates operation semantics features and cost model for mashup component, generates semantics equivalent data flows by transforming rules and construct a partially ordered diagram based on the cost of these data flows for quickly optimal selection. Key implementation techniques are provided and efficiency improvement of mashup is demonstrated by experiments.
出处 《小型微型计算机系统》 CSCD 北大核心 2011年第9期1716-1722,共7页 Journal of Chinese Computer Systems
基金 国家"九七三"重点基础研究发展计划项目(2009CB320704)资助 国家"八六三"高技术研究发展计划项目(2007AA010301)资助 国家"核高基"重大专项项目(2009ZX01043-003-002)资助
关键词 MASHUP 数据流程 WEB2.0 WEB服务 性能优化 mashup data flow web2.0 web service performance optimization
  • 相关文献

参考文献13

  • 1Giusy D L, Hakim H, Hye-young P, et al. Data integration in mashups[J]. SIGMOD Record, 2009, 38(1): 59-66.
  • 2Yahoo Inc. Yahoo pipes [ EB/OL]. http://pipes. yahoo. com, Apr. 2010.
  • 3Alkis S, Panos V, Timos S. State-space optimization of ETL work- flows[ J]. IEEE Transactions on Knowledge and Data Engineer- ing, 2005, 17(10) : 1404-1419.
  • 4Alkis S, Kevin W, Umeshwar D, et al. Optimizing ETL workflows for fault-tolerance[ A]. In: Proceedings of IEEE International Con- ference on Data Engineering[C] , New York: IEEE Press, 2010: 385-396.
  • 5Serge A, Ohad G, Tova M. Modeling the mashup space [ A ]. In: Proceeding of the 10th ACM Workshop on Web Information and Data Management[ C ], New York: ACM, 2008: 87-94.
  • 6Serge A, Ohad G, Tova M, et al. Matchup: autocompletion for mashups[ A]. In: Proceedings of IEEE International Conference on Data Engineering[ C], New York: IEEE Press, 2009: 1479-1482.
  • 7Biorm B, Cesare P. Let it flow: building mashups with data pro- eessing pipelines[ A]. In: Proceedings of Servioe-Oriented Com- puting - ICSOC 2007 Workshops[ C], Berlin, Heidelberg: Spring- er-Verlag, 2007 : 15-28.
  • 8Osama A H,Lakshmish R,John A M. MACE: a dynamic caching framework for mashups[ C]. In: Proceedings of IEEE International Conference on Web Service. Washington, DC, USA: IEEE Com- puter Society, 2009 : 75-82.
  • 9Eric W, Peng L, Brett C. Web service mashup middleware with partitioning of xml pipelines[ C]. In: Proceedings of IEEE Interna- tional Conference on Web Service. Washington, DC, USA: IEEE Computer Society, 2009: 91-98.
  • 10Dong L, Ralph D. The reverse C10K problem for server-side mashups[ A ]. In: Proceedings of ICSOC 2008 International Workshops[ C], Berlin, Heidelberg: Springer-Verlag, 2008: 166-177.

同被引文献1

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部