基于Q学习的适应性进化规划算法被引量：5

An Adaptive Evolutionary Programming Algorithm Based on Q Learning

下载PDF

导出

摘要进化规划中,个体选择变异策略特别重要.适应性变异策略因在进化过程中动态选择个体变异策略,能够取得较好的性能.传统适应性变异策略都依据个体一步进化效果考察个体适应性,没有从多步进化效果上对变异策略进行评价.本文提出一种新的基于Q学习的适应性进化规划算法QEP(Q learning based evolutionary programming),该算法将变异策略看成行动,考察个体多步进化效果,并通过计算Q函数值,学习个体最优变异策略.实验表明,QEP能够获得好的性能. Selection of mutation strategies plays an important role in evolutionary programming, and adaptively selecting a mutation strategy in each evolutionary step can achieve good performance. A mutation strategy is evaluated and selected only based on the one-step performance of mutation operators in classical adaptive evolutionary programming, and the performance of mutation operators in the delayed mutation steps is ignored. This paper proposes a novel adaptive mutation strategy based on Q learning-- QEP （Q learning based evolutionary program- ming）. In this algorithm, several candidate mutation operators are used and each is considered as an action. The evolutionary performance of delayed mutation steps is considered in calculating the Q values for each mutation operator and the mutation operator that maximizes the learned Q values is the optimal one. Experimental results show that the proposed mutation strategy achieves better performance than the existing algorithms.

作者张化祥陆晶

机构地区山东师范大学计算机系山东财政学院计算机系

出处《自动化学报》 EI CSCD 北大核心 2008年第7期819-822,共4页 Acta Automatica Sinica

基金国家自然科学基金(90612003) 山东省中青年科学家科研奖励基金(2006BS01020) 山东省自然科学基金(Y2007G16)资助~~

关键词进化规划变异策略 Q学习收益 Evolutionary programming, mutation strategy, Q learning, reward

分类号 TP18 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献10

1Fogel L J, Owens A J, Walsh M J. Artificial Intelligence Through Simulated Evolution: Forty Years of Evolutionary Programming. New York: Wiley-Interscience, 1999.
2Yao X, Liu Y, Lin G M. Evolutionary programming made faster. IEEE Transactions on Evolutionary Computation, 1999, 3(2): 82-102.
3Lee C Y, Yao X. Evolutionary programming using mutations based on the Levy probability distribution. IEEE Transactions on Evolutionary Computation, 2004, 8(1): 1-13.
4Ji M J, Tang H W, Guo J. A single-point mutation evolutionary programming. Information Processing Letters, 2004, 90(6): 293-299.
5Dong H, He J, Huang H, Hou W. Evolutionary programming using a mixed mutation strategy IOnline], available: http://www.cs.bham.ac.uk/jxh/hejunpl.html, December 20, 2006.
6Fogel D B. Evolving Artificial Intelligence [Ph.D. dissertation].California, USA: University of California. 1992.
7Iwamatsu M. Generalized evolutionary programming with Levy-type mutation. Computer Physics Communications, 2002, 147(1): 729-732.
8Lee S H, Jun H B, Sim K B. Performance improvement of evolution strategies using reinforcement learning. In: Proceedings of IEEE International Fuzzy Systems Conference. Seoul, Korea: IEEE, 1999. 639-644.
9刘习春,喻寿益.局部快速微调遗传算法[J].计算机学报,2006,29(1):100-105. 被引量：37
10Sutton R S, Barto A C. Reinforcement Learning: An Introduction. Cambridge: MIT Press, 1998.

二级参考文献10

1Rowlins G. ed.. Foundations of Genetic Algorithm. Los Altos: Morgan Kanfmann, 1991.
2Powll D. , Tong S. , Skolnik M.. Domain independent machine for design optimization. In: Proceedings of the AAAI-90,George Mason University, USA, 1989, 151-159.
3Cho S. B.. Combining modular neural networks developed by evolutionary algorithm. In: Proceedings of the 1997 IEEE International Conference on Evolutionary Computation, Indianapolis, 1997, 647-650.
4Zhao Q. F. , Arlo, Study on Co-evolutionary Learning of Neural Networks. Heidelberg: Springer-Verlag, 1997.
5Michalewicz Z. et. al. eds.. In: Proceeding of the 1st International Conference on Evolutionary Computation (ICEC' 94),Orlando, Florida, USA, 1994, 665-669.
6Goldberg D. E.. Real-coded genetic algorithms, virtual alphabets, and blocking. University of Illinois at Urbana-Champaign: Technical Report No. 90001,1990.
7Holland J. H.. Adaptation in Natural and Artificial Systems.Ann Arbor: The University of Michigan Press, 1975.
8Belew R. , Booker L.. Proceedings of the 4th International Conference on Genetic Algorithms. Los Altos, CA: Morgan Kaufmann Publishers, 1991.
9Whitley D. , Mathias K. , Fitzhorn P.. Delta Coding: An Iterative Search Strategy for Genetic Algorithms. Los Altos, Morgan Kaufmann Publishers, 1991, 77-84.
10Michalewicz Z.. Genetic Algorithms+ Delta Strucures= Evolution Programs. Berlin Heidelberg: Springer-Verlag, 1996.

共引文献36

1彭阳,廖子贞.遗传算法在入侵检测中的应用研究[J].科技资讯,2008,6(24):22-23.
2王朝辉,张伟丰.基于混合编码的多智能体遗传算法[J].武汉科技大学学报,2006,29(6):603-606. 被引量：1
3李海滨.自适应局部微调遗传算法[J].电机与控制学报,2007,11(2):191-195. 被引量：8
4邵平凡,万程鹏.求解全局优化问题的遗传退火算法[J].计算机工程与应用,2007,43(12):62-65. 被引量：13
5张光卫,康建初,李鹤松,李德毅.基于云模型的全局最优化算法[J].北京航空航天大学学报,2007,33(4):486-490. 被引量：37
6高永超,李歧强.网络拓扑进化算法[J].计算机工程与应用,2007,43(27):91-94.
7李尊朝,张瑞智,张效娟,林尧.基于遗传算法的亚100nm SOI MOSFET模型参数提取[J].电子学报,2007,35(11):2033-2037. 被引量：3
8符国庆,徐维祥,李晓争.一种随机搜索优化算法——网鱼算法[J].北京交通大学学报,2007,31(6):123-127. 被引量：3
9张伟丰.基于进化计算的薄板冷连轧轧制规程优化系统设计与应用[J].湖北汽车工业学院学报,2007,21(4):29-33.
10杨兴春,李进.一种基于多种群隔代融合的遗传算法[J].计算机与数字工程,2008,36(5):30-32. 被引量：1

同被引文献45

1韩江洪,李正荣,魏振春.一种自适应粒子群优化算法及其仿真研究[J].系统仿真学报,2006,18(10):2969-2971. 被引量：122
2胡建秀,曾建潮.微粒群算法中惯性权重的调整策略[J].计算机工程,2007,33(11):193-195. 被引量：62
3Kennedy J, Eberhart R C. Particle swarm optimization[C]. Proc of the IEEE Int Conf on Neural Network. Perth: IEEE Inc, 1995: 1942-1948.
4Shi Y, Eberhart R C. A modified particle swarm optimizer[C]. IEEE World Conf on Computational Intelligence. Piscataway: IEEE Press, 1998: 69-73.
5Shi Y, Eberhart R C. Fuzzy adaptive particle swarm optimization[C]. Proc of the IEEE Conf on Evolutionary Computation. Piscataway: 1EEE Press, 2001: 101-106.
6Zhang L P, Yu H J, Hu S X. A new approach to improve particle swarm optimization[C]. Lecture Notesin Computer Science. Chicago: Springer-Verlag, 2003: 134-139.
7Zhang H X, Lu J. Adaptive evolutionary programming based on reinforcement learning[J]. Information Sciences, 2008, 178(4): 971-984.
8Chatterjee A, Siarry E Nonlinear inertia weight variation for dynamic adaptation in particle swarm optimization[J]. Computers and Operations Research, 2006, 33(3): 859- 871.
9Sutton R S, Barto A G. Reinforcement learning: An introduction[M]. Cambridge: MIT Press, 1998.
10刘建华,樊晓平,瞿志华.一种基于相似度的新型粒子群算法[J].控制与决策,2007,22(10):1155-1159. 被引量：19

引证文献5

1邢长明,刘方爱.基于强化学习的适应性微粒群算法[J].控制与决策,2011,26(1):54-58. 被引量：4
2丁彬楚,汤洪涛.面向作业车间重调度的改进合同网机制研究[J].机电工程,2013,30(2):147-151.
3盛歆漪,孙俊,周頔,须文波.一种Q学习的量子粒子群优化方法[J].计算机工程与应用,2014,50(21):8-13. 被引量：4
4于金亮,涂山山,孟远.移动雾计算中基于强化学习的伪装攻击检测算法[J].计算机工程,2020,46(1):38-44. 被引量：5
5王君逸,王志,李华雄,陈春林.基于自适应噪声的最大熵进化强化学习方法[J].自动化学报,2023,49(1):54-66. 被引量：2

二级引证文献15

1魏赟,邵清.基于Q-学习和粒子群算法的区域交通控制模型[J].系统仿真学报,2011,23(10):2108-2111. 被引量：5
2曾现峰,张勇.基于动态邻域和自适应惯性权重的微粒群算法[J].计算机工程与设计,2013,34(5):1817-1821.
3柯文德,彭志平,蔡则苏,陈珂.仿人机器人相似性阶梯行走约束与优化控制[J].机器人,2014,36(2):233-240. 被引量：1
4关学忠,皇甫旭,李欣,佟宇,聂品磊.基于正态云模型的自适应变异量子粒子群优化算法[J].电子设计工程,2016,24(8):64-67. 被引量：10
5杨杰,万仁霞,刘楷.基于相似度的改进粒子群优化算法[J].计算机工程与应用,2016,52(17):49-53.
6张晓芳,谢俊.可控家用电器负荷优化模型及用电策略研究[J].计算机工程与应用,2016,52(24):246-250. 被引量：3
7朱兰婷,孙丽珺,闫杨.车辆雾计算中基于反向拍卖的停车辅助方案[J].计算机工程,2020,46(7):14-20.
8张小峰,秦丽娜.基于用户特征码的网络信息RSA密钥生成算法[J].信息与电脑,2021,33(10):60-62.
9周睿.一种基于Stackelberg博弈的雾计算风险管理[J].信息技术,2021,45(9):62-68.
10Ling Wang,Zixiao Pan,Jingjing Wang.A Review of Reinforcement Learning Based Intelligent Optimization for Manufacturing Scheduling[J].Complex System Modeling and Simulation,2021,1(4):257-270. 被引量：27

1易云飞,陈国鸿.基于k-means的改进粒子群算法求解TSP问题[J].微计算机信息,2012(9):475-477. 被引量：5
2王兰春.基于统计的关系数据库查询优化器模型分析与研究[J].现代计算机,2011,17(11):13-16. 被引量：2
3Jun Wei,Zhenaiun Pan,Lishang Kang(State Key Lab of Software Engincering, Wuhan UniversityWuhan 430072, P.R. China).Dynamic Behavior Modeling in Multi-Agent System By Evolutionary Programming[J].Wuhan University Journal of Natural Sciences,1996,1(Z1):651-657.
4赵季红,丁小婷,王炜,曲桦.基于用户个体的混合CoMP模式选择算法[J].电信科学,2015,31(3):54-60.
5崔敏.论遗传算法在旅行商问题中的应用[J].办公自动化（综合月刊）,2011(4):50-51.
6胡乃静,罗远,胡金华.基于Petri网的查询计划模型适应性进化的一致性保证[J].计算机应用与软件,2007,24(7):135-137.
7沈掌泉,孔繁胜.基于个体选择的动态权重神经网络集成方法研究[J].计算机工程与应用,2005,41(12):8-11. 被引量：2
8郭李艳,何萍,李美莲.一种应用TMS320F2812和编码器测量电机转速的方法[J].桂林航天工业高等专科学校学报,2007,12(3):13-15. 被引量：5
9欧阳航空,陆林海,侯彦丽.基于DSP的光栅莫尔条纹信号辨向与细分电路研究[J].制造业自动化,2005,27(5):5-7. 被引量：9
10Ren, Qingsheng, Zeng, Jin, Qi, Feihu.Evolutionary Programming for IP/MIP Problems with Linear Constraints[J].Journal of Systems Engineering and Electronics,2000,11(3):59-64. 被引量：2

自动化学报

2008年第7期

浏览历史

内容加载中请稍等...

基于Q学习的适应性进化规划算法被引量：5

参考文献10

二级参考文献10

共引文献36

同被引文献45

引证文献5

二级引证文献15

相关作者

相关机构

相关主题

浏览历史

基于Q学习的适应性进化规划算法 被引量：5

参考文献10

二级参考文献10

共引文献36

同被引文献45

引证文献5

二级引证文献15

相关作者

相关机构

相关主题

浏览历史

基于Q学习的适应性进化规划算法被引量：5