期刊文献+

强化学习方法在Web服务组合中的应用比较研究 被引量:1

A COMPARATIVE STUDY ON THE APPLICATIONS OF REINFORCEMENT LEARNING METHODS IN WEB SERVICE COMPOSITION
下载PDF
导出
摘要 为了提高服务组合适应动态环境的能力,将强化学习技术引入到Web服务组合。目前常用的强化学习方法有三种:蒙特卡罗、时序差分和Q-Learning,为了发现最适合于服务组合的强化学习方法,对这三种方法进行了对比研究。首先将Web服务组合建模为马尔科夫决策过程,然后介绍了这三种强化学习方法并分析了它们的异同,同时,提出了Web服务组合领域的奖赏值确定方法。最后,通过实验比较了这三种强化学习方法的学习效果,实验结果显示,在Web服务组合应用中,Q-Learning比另外两种方法收敛速度更快,因此更适合执行服务组合。 In order to improve the ability of service composition to be adaptive to the dynamic environment,this paper applies reinforcement learning(RL) to Web service composition(Wsc).At present there are three commonly used RL methods: Monte Carlo,temporal difference and Q-Learning.The paper makes comparisons and studies among the three methods.Firstly Wsc is modeled with Markov Decision Process,then the above three RL methods are introduced and compared with each other.An approach to define reward in Wsc is also proposed.Finally experiments are carried out to compare effects of the three RL methods.Experiment results illustrate that the Q-Learning method is faster at convergence than the other two RL methods,so it is better fit for execution of service composition.
出处 《计算机应用与软件》 CSCD 2011年第7期128-131,共4页 Computer Applications and Software
基金 安徽省教育厅重点资助项目(KJ2008A102)
关键词 WEB服务组合 强化学习 马尔科夫决策过程 Web service composition Reinforcement learning Markov Decision Process
  • 相关文献

参考文献12

  • 1Milanovic N, Malek M. Current Solutions for Web Service Composition [ J]. 1EEE INTERNET COMPUTING. IEEE Computer Society,2004, 8(6) :51 -59.
  • 2Seog Chan Oh, Dongwon Lee, Soundar R T Kumara. Effective Web Service Composition in Diverse and Large-Scale Service Networks[J]. IEEE Transactions on Services Computing,2008,1 (1) :15 -22.
  • 3DongHoon Shin, KyongHo Lee, Tatsuya Suda. Automated generation of composite web services based on functional semantics [ J ]. J. Web Sem,2009,7 (4) :332-343.
  • 4Daniela Berardi, Diego Calvanese, Giuseppe De Giacomo, et al. Auto- matic Service Composition Based on Behavioral Descriptions[ J]. Int. J. Cooperative Inf. Syst. 2005,14 (4) :333 - 376.
  • 5Zeng L, Benatallah B, Ngu A H, et al. QoS-aware middleware for web services composition[ J ]. IEEE Transactions on Software Engineering, 2004,30(5) :311 -327.
  • 6Danilo Ardagna, Barbara Pemici. Global and Local QoS Guarantee in Web Service Selection [ C ]//Business Process Management Work' shops ,2005:32 - 46.
  • 7Danilo Ardagna,Barbara Pemici. Adaptive Service Composition in Flexi- ble Processes [ J ]. IEEE TRANSACTIONS ON SOFTWARE ENGI- NEERING,2007,33 (6) :369 - 384.
  • 8高阳.强化学习研究进展[EB/OL].2004.http://cs.nju.edu.ca/gaoy/documents/Agent/RL.doc.
  • 9Kaelbling L P, Littman M L, Moore A P. Reinforcement learning:A survey[J].J. Artif. Intell. Res. (JAIR),1996,4:237-285.
  • 10Richard S Sutton, Andrew G Barto. Reinforcement Learning:An Introduction [ M ]. Cambridge, MA : MIT Press, 1998.

同被引文献4

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部