强化学习方法在Web服务组合中的应用比较研究被引量：1

A COMPARATIVE STUDY ON THE APPLICATIONS OF REINFORCEMENT LEARNING METHODS IN WEB SERVICE COMPOSITION

下载PDF

导出

摘要为了提高服务组合适应动态环境的能力,将强化学习技术引入到Web服务组合。目前常用的强化学习方法有三种:蒙特卡罗、时序差分和Q-Learning,为了发现最适合于服务组合的强化学习方法,对这三种方法进行了对比研究。首先将Web服务组合建模为马尔科夫决策过程,然后介绍了这三种强化学习方法并分析了它们的异同,同时,提出了Web服务组合领域的奖赏值确定方法。最后,通过实验比较了这三种强化学习方法的学习效果,实验结果显示,在Web服务组合应用中,Q-Learning比另外两种方法收敛速度更快,因此更适合执行服务组合。 In order to improve the ability of service composition to be adaptive to the dynamic environment,this paper applies reinforcement learning（RL） to Web service composition（Wsc）.At present there are three commonly used RL methods： Monte Carlo,temporal difference and Q-Learning.The paper makes comparisons and studies among the three methods.Firstly Wsc is modeled with Markov Decision Process,then the above three RL methods are introduced and compared with each other.An approach to define reward in Wsc is also proposed.Finally experiments are carried out to compare effects of the three RL methods.Experiment results illustrate that the Q-Learning method is faster at convergence than the other two RL methods,so it is better fit for execution of service composition.

作者刘卫红周义莲

机构地区安徽工业大学计算机学院

出处《计算机应用与软件》 CSCD 2011年第7期128-131,共4页 Computer Applications and Software

基金安徽省教育厅重点资助项目(KJ2008A102)

关键词 WEB服务组合强化学习马尔科夫决策过程 Web service composition Reinforcement learning Markov Decision Process

分类号 TP393.4 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献12

1Milanovic N, Malek M. Current Solutions for Web Service Composition [ J]. 1EEE INTERNET COMPUTING. IEEE Computer Society,2004, 8(6) :51 -59.
2Seog Chan Oh, Dongwon Lee, Soundar R T Kumara. Effective Web Service Composition in Diverse and Large-Scale Service Networks[J]. IEEE Transactions on Services Computing,2008,1 (1) :15 -22.
3DongHoon Shin, KyongHo Lee, Tatsuya Suda. Automated generation of composite web services based on functional semantics [ J ]. J. Web Sem,2009,7 (4) :332-343.
4Daniela Berardi, Diego Calvanese, Giuseppe De Giacomo, et al. Auto- matic Service Composition Based on Behavioral Descriptions[ J]. Int. J. Cooperative Inf. Syst. 2005,14 (4) :333 - 376.
5Zeng L, Benatallah B, Ngu A H, et al. QoS-aware middleware for web services composition[ J ]. IEEE Transactions on Software Engineering, 2004,30(5) :311 -327.
6Danilo Ardagna, Barbara Pemici. Global and Local QoS Guarantee in Web Service Selection [ C ]//Business Process Management Work' shops ,2005:32 - 46.
7Danilo Ardagna,Barbara Pemici. Adaptive Service Composition in Flexi- ble Processes [ J ]. IEEE TRANSACTIONS ON SOFTWARE ENGI- NEERING,2007,33 (6) :369 - 384.
8高阳.强化学习研究进展[EB/OL].2004.http://cs.nju.edu.ca/gaoy/documents/Agent/RL.doc.
9Kaelbling L P, Littman M L, Moore A P. Reinforcement learning:A survey[J].J. Artif. Intell. Res. (JAIR),1996,4:237-285.
10Richard S Sutton, Andrew G Barto. Reinforcement Learning:An Introduction [ M ]. Cambridge, MA : MIT Press, 1998.

同被引文献4

1王皓,高阳,陈兴国.强化学习中的迁移:方法和进展[J].电子学报,2008,36(B12):39-43. 被引量：26
2刘春阳,谭应清,柳长安,马莹巍.多智能体强化学习在足球机器人中的研究与应用[J].电子学报,2010,38(8):1958-1962. 被引量：19
3刘全,傅启明,龚声蓉,伏玉琛,崔志明.最小状态变元平均奖赏的强化学习方法[J].通信学报,2011,32(1):66-71. 被引量：15
4韩道军,夏兰亭,卓汉逵,李磊.基于强化学习的业务流程中的柔性约束研究[J].计算机科学,2011,38(3):166-171. 被引量：2

引证文献1

1李冠峰,贺学剑,韩道军.强化学习在中职招生系统中的应用[J].计算机应用与软件,2013,30(4):252-254.

1王一飞,吴素芹,王榕.Web服务组合建模的研究[J].通信技术,2009,42(7):140-143. 被引量：6
2王一飞,吴素芹,王榕.基于图的Web服务组合的研究[J].微型机与应用,2010,29(1):41-43. 被引量：1
3李淑芝,彭洁,杨书新.基于着色Petri网的Web服务组合建模[J].江西理工大学学报,2009,30(6):30-33. 被引量：2
4杨志辉,王小民,张雄,许满武.一种新的具适应性的程序结构[J].计算机工程,2008,34(23):47-49. 被引量：1
5夏妍.基于扩展颜色Petri网的Web服务组合建模及应用[J].电脑知识与技术,2011,7(7X):5154-5156.
6李景霞,侯紫峰.基于颜色Petri网的Web服务组合建模及应用[J].计算机应用研究,2006,23(9):149-151. 被引量：12
7李嶒.基于Petri网的WEB服务组合建模及验证[J].宿州学院学报,2014,29(3):75-77.
8高君,包晓安,谢晓鸣,孙献策.基于广义随机Petri网的Web服务组合建模与可达性分析[J].电脑编程技巧与维护,2013(6):46-48.
9陈丁剑,吴健,马满福,胡正国.基于Petri网的Web服务组合建模[J].计算机科学,2006,33(5):128-130. 被引量：11
10陈学松,杨宜民.基于递推最小二乘法的多步时序差分学习算法[J].计算机工程与应用,2010,46(8):52-55. 被引量：5

计算机应用与软件

2011年第7期

浏览历史

内容加载中请稍等...

强化学习方法在Web服务组合中的应用比较研究被引量：1

参考文献12

同被引文献4

引证文献1

相关作者

相关机构

相关主题

浏览历史

强化学习方法在Web服务组合中的应用比较研究 被引量：1

参考文献12

同被引文献4

引证文献1

相关作者

相关机构

相关主题

浏览历史

强化学习方法在Web服务组合中的应用比较研究被引量：1