Parallel Reinforcement Learning-Based Energy Efficiency Improvement for a Cyber-Physical System 被引量：17

Parallel Reinforcement Learning-Based Energy Efficiency Improvement for a Cyber-Physical System

下载PDF

导出

摘要 As a complex and critical cyber-physical system(CPS),the hybrid electric powertrain is significant to mitigate air pollution and improve fuel economy.Energy management strategy(EMS)is playing a key role to improve the energy efficiency of this CPS.This paper presents a novel bidirectional long shortterm memory(LSTM)network based parallel reinforcement learning(PRL)approach to construct EMS for a hybrid tracked vehicle(HTV).This method contains two levels.The high-level establishes a parallel system first,which includes a real powertrain system and an artificial system.Then,the synthesized data from this parallel system is trained by a bidirectional LSTM network.The lower-level determines the optimal EMS using the trained action state function in the model-free reinforcement learning(RL)framework.PRL is a fully data-driven and learning-enabled approach that does not depend on any prediction and predefined rules.Finally,real vehicle testing is implemented and relevant experiment data is collected and calibrated.Experimental results validate that the proposed EMS can achieve considerable energy efficiency improvement by comparing with the conventional RL approach and deep RL. As a complex and critical cyber-physical system (CPS),the hybrid electric powertrain is significant to mitigate air pollution and improve fuel economy.Energy management strategy(EMS) is playing a key role to improve the energy efficiency of this CPS.This paper presents a novel bidirectional long shortterm memory (LSTM) network based parallel reinforcement learning (PRL) approach to construct EMS for a hybrid tracked vehicle (HTV).This method contains two levels.The high-level establishes a parallel system first,which includes a real powertrain system and an artificial system.Then,the synthesized data from this parallel system is trained by a bidirectional LSTM network.The lower-level determines the optimal EMS using the trained action state function in the model-free reinforcement learning (RL)framework.PRL is a fully data-driven and learning-enabled approach that does not depend on any prediction and predefined rules.Finally,real vehicle testing is implemented and relevant experiment data is collected and calibrated.Experimental results validate that the proposed EMS can achieve considerable energy efficiency improvement by comparing with the conventional RL approach and deep RL.

作者 Teng Liu Bin Tian Yunfeng Ai Fei-Yue Wang

机构地区 Department of Automotive Engineering the Vehicle Intelligence Pioneers Inc. the State Key Laboratory of Management and Control for Complex Systems the School of Artificial Intelligence

出处《IEEE/CAA Journal of Automatica Sinica》 EI CSCD 2020年第2期617-626,共10页 自动化学报（英文版）

基金 supported in part by the National Natural Science Foundation of China(61533019,91720000) Beijing Municipal Science and Technology Commission(Z181100008918007) the Intel Collaborative Research Institute for Intelligent and Automated Connected Vehicles(pICRI-IACVq)

关键词 Bidirectional long short-term memory(LSTM)network cyber-physical system(CPS) energy management parallel system reinforcement learning(RL) Bidirectional long short-term memory(LSTM) network cyber-physical system(CPS) energy management parallel system reinforcement learning(RL)

分类号 U469.7 [机械工程—车辆工程]

引文网络
相关文献

参考文献3

1王飞跃.人工社会、计算实验、平行系统——关于复杂社会经济系统计算研究的讨论[J].复杂系统与复杂性科学,2004,1(4):25-35. 被引量：234
2Fei-Yue Wang.Control 5.0: From Newton to Merton in Popper's Cyber-Social-Physical Spaces[J].IEEE/CAA Journal of Automatica Sinica,2016,3(3):233-234. 被引量：13
3Li Li,Yilun Lin,Nanning Zheng,Fei-Yue Wang.Parallel Learning:a Perspective and a Framework[J].IEEE/CAA Journal of Automatica Sinica,2017,4(3):389-395. 被引量：36

二级参考文献45

1王飞跃,李乐飞,黄星,邹余敏.关于长周期连续安全节能有效生产基础理论的探讨[J].计算机与应用化学,2007,24(12):1711-1713. 被引量：16
2王飞跃.人工社会、计算实验、平行系统——关于复杂社会经济系统计算研究的讨论[J].复杂系统与复杂性科学,2004,1(4):25-35. 被引量：234
3王飞跃,汤淑明.人工交通系统的基本思想与框架体系[J].复杂系统与复杂性科学,2004,1(2):52-59. 被引量：40
4王飞跃.平行系统方法与复杂系统的管理和控制[J].控制与决策,2004,19(5):485-489. 被引量：333
5王飞跃.计算实验方法与复杂系统行为分析和决策评估[J].系统仿真学报,2004,16(5):893-897. 被引量：147
6王飞跃.关于复杂系统研究的计算理论与方法[J].中国基础科学,2004,6(5):3-10. 被引量：97
7玻恩李宝恒译.我的一生和我的观点[M].北京:商务印书馆,1979..
8[26]王飞跃.从一无所有到万象所归:人工社会与复杂系统研究[N].科学时报(纵横版),2003-03-17.
9[3]Kydland F E, Prescott E C. The Computational Experiment: An Econometric Tool[J]. Journal of Economic Perspectives, 1996, 10(1): 69-85.
10[5]Shoven J B, Whalley J. A General Equilibrium Calculation of the Differential Taxation of Income from Capital in the US[J]. Journal of Public Economics, 1972, 1: 281-321.

共引文献267

1李浥东,张俊,陶耀东,王伟,顾元祥,王飞跃.平行安全:基于CPSS的生成式对抗安全智能系统[J].智能科学与技术学报,2020(2):194-202. 被引量：7
2郭超,鲁越,林懿伦,卓凡,王飞跃.平行艺术:人机协作的艺术创作[J].智能科学与技术学报,2019,0(4):335-341. 被引量：14
3吕宜生,王飞跃,张宇,张晓东.虚实互动的平行城市:基本框架、方法与应用[J].智能科学与技术学报,2019,1(3):311-317. 被引量：15
4白天翔,沈震,刘雅婷,董西松.平行机器:一种智能机器的管理与控制框架[J].智能科学与技术学报,2019,0(2):181-191. 被引量：5
5付朝博,蔡卓函,冯琦琦,亓鹏程.装备体系平行试验基本概念及流程设计[J].装甲兵学报,2022(3):50-55.
6刘建军,王磊,刘希未,马龙江.生产车间物流平行系统体系研究[J].兰州大学学报（自然科学版）,2018,54(5):698-704. 被引量：3
7谷雨,聂帅.基于平行仿真的无人集群试验管控方法[J].中国电子科学研究院学报,2023,18(5):461-468.
8杜晓明.空天海地一体化观测网络的任务管理探讨与展望[J].科技促进发展,2020,16(2):184-191. 被引量：2
9谢成龙,田露,刘培邦,李进,汤晨瑾,马斌,聂文.核电厂DCS平行系统研发与应用[J].电子技术应用,2023,49(S01):218-223.
10张凯.软件质量形成的复杂性分析[J].复杂系统与复杂性科学,2006,3(4):19-27. 被引量：3

同被引文献174

1王飞跃,王艳芬,陈薏竹,田永林,齐红威,王晓,张卫山,张俊,袁勇.联邦生态:从联邦数据到联邦智能[J].智能科学与技术学报,2020,2(4):305-311. 被引量：31
2郭超,鲁越,林懿伦,卓凡,王飞跃.平行艺术:人机协作的艺术创作[J].智能科学与技术学报,2019,0(4):335-341. 被引量：14
3杨超,高玉,艾云峰,田滨,陈龙,王健,王飞跃.端对端平行无人矿山系统及其关键技术[J].智能科学与技术学报,2019,1(3):228-240. 被引量：14
4刘腾,王晓,邢阳,高玉,田滨,陈龙.基于数字四胞胎的平行驾驶系统及应用[J].智能科学与技术学报,2019,0(1):40-51. 被引量：14
5郑南宁.人工智能新时代[J].智能科学与技术学报,2019,0(1):1-3. 被引量：64
6陈振宇,刘金波,李晨,季晓慧,李大鹏,黄运豪,狄方春,高兴宇,徐立中.基于LSTM与XGBoost组合模型的超短期电力负荷预测[J].电网技术,2020,44(2):614-620. 被引量：228
7王飞跃,曹东璞,李升波,邢阳,郭洪艳,吕宜生,李力,吴甘沙.自动驾驶技术的挑战与展望[J].电子科学技术,2018,0(6):111-119. 被引量：9
8王飞跃.人工社会、计算实验、平行系统——关于复杂社会经济系统计算研究的讨论[J].复杂系统与复杂性科学,2004,1(4):25-35. 被引量：234
9Dimitri P.BERTSEKAS.Approximate policy iteration:a survey and somenew methods[J].控制理论与应用（英文版）,2011,9(3):310-335. 被引量：6
10王飞跃,汤淑明.人工交通系统的基本思想与框架体系[J].复杂系统与复杂性科学,2004,1(2):52-59. 被引量：40

引证文献17

1Xin Huang,Jiuxiang Dong.Learning-Based Switched Reliable Control of Cyber-Physical Systems With Intermittent Communication Faults[J].IEEE/CAA Journal of Automatica Sinica,2020,7(3):711-724. 被引量：1
2Ali Forootani,Raffaele Iervolino,Massimo Tipaldi,Joshua Neilson.Approximate Dynamic Programming for Stochastic Resource Allocation Problems[J].IEEE/CAA Journal of Automatica Sinica,2020,7(4):975-990. 被引量：4
3Mohammad Al-Sharman,David Murdoch,Dongpu Cao,Chen Lv,Yahya Zweiri,Derek Rayside,William Melek.A Sensorless State Estimation for A Safety-Oriented Cyber-Physical System in Urban Driving:Deep Learning Approach[J].IEEE/CAA Journal of Automatica Sinica,2021,8(1):169-178. 被引量：3
4Xing Yang,Lei Shu,Jianing Chen,Mohamed Amine Ferrag,Jun Wu,Edmond Nurellari,Kai Huang.A Survey on Smart Agriculture:Development Modes,Technologies,and Security and Privacy Challenges[J].IEEE/CAA Journal of Automatica Sinica,2021,8(2):273-302. 被引量：13
5陈龙,王晓,杨健健,艾云峰,田滨,李宇宸,滕思宇,王健,曹东璞,葛世荣,王飞跃.平行矿山:从数字孪生到矿山智能[J].自动化学报,2021,47(7):1633-1645. 被引量：55
6Yanni Wan,Jiahu Qin,Xinghuo Yu,Tao Yang,Yu Kang.Price-Based Residential Demand Response Management in Smart Grids:A Reinforcement Learning-Based Approach[J].IEEE/CAA Journal of Automatica Sinica,2022,9(1):123-134. 被引量：2
7Mohamed Amine Ferrag,Lei Shu,Othmane Friha,Xing Yang.Cyber Security Intrusion Detection for Agriculture 4.0: Machine Learning-Based Solutions, Datasets,and Future Directions[J].IEEE/CAA Journal of Automatica Sinica,2022,9(3):407-436. 被引量：1
8Majid Mazouchi,Subramanya Nageshrao,Hamidreza Modares.Conflict-Aware Safe Reinforcement Learning:A Meta-Cognitive Learning Framework[J].IEEE/CAA Journal of Automatica Sinica,2022,9(3):466-481. 被引量：2
9赖晨光,伍朝兵,李家曦,孙友长,胡博.并行深度强化学习的柴油机动力系统VGT智能控制[J].重庆理工大学学报（自然科学）,2022,36(6):302-308.
10Yue Ming,Nannan Hu,Chunxiao Fan,Fan Feng,Jiangwan Zhou,Hui Yu.Visuals to Text:A Comprehensive Review on Automatic Image Captioning[J].IEEE/CAA Journal of Automatica Sinica,2022,9(8):1339-1365. 被引量：4

二级引证文献103

1郭一楠,杨帆,葛世荣,黄遥,尤秀松.知识驱动的智采数字孪生主动管控模式[J].煤炭学报,2023,48(S01):334-344. 被引量：5
2王岩,张旭辉,曹现刚,赵友军,杨文娟,杜昱阳,石硕.掘进工作面数字孪生体构建与平行智能控制方法[J].煤炭学报,2022,47(S01):384-394. 被引量：12
3孔若琪,崔琳,董勇.机器学习算法在脱硫系统智能运行及优化中的应用[J].洁净煤技术,2023,29(S02):406-414.
4曹博,吕明家,汪帅,赵波,李青怡,刘光伟.不规则境界露天矿剥离物动态规划研究[J].辽宁工程技术大学学报（自然科学版）,2023(4):427-437.
5Othmane Friha,Mohamed Amine Ferrag,Lei Shu,Leandros Maglaras,Xiaochan Wang.Internet of Things for the Future of Smart Agriculture: A Comprehensive Survey of Emerging Technologies[J].IEEE/CAA Journal of Automatica Sinica,2021,8(4):718-752. 被引量：21
6黄凯,舒磊,李凯亮,杨星,朱艳,汪小旵,苏勤.太阳能杀虫灯物联网节点的防盗防破坏设计及展望[J].智慧农业（中英文）,2021,3(1):129-143. 被引量：1
7桑健,周婷,金彦亮.D2D通信中信道分配的智能优化算法研究[J].工业控制计算机,2021,34(7):117-119. 被引量：2
8王晓军,程禹人.探讨煤矿矿山安全的特殊性与安全管理[J].科技创新导报,2021,18(16):100-102.
9李海,李谊骏,陈诗果,杨谋.苹果树病虫害智能识别系统设计与实现[J].科学技术与工程,2021,21(25):10639-10645. 被引量：6
10陈晓红,张威威,易国栋,唐湘博.新一代信息技术驱动下资源环境协同管理的理论逻辑及实现路径[J].中南大学学报（社会科学版）,2021,27(5):1-10. 被引量：14

1Mohamed Zaher,Sabri Cetinkunt.Fuel Saving and Control for Hybrid Electric Powertrains[J].Energy and Power Engineering,2013,5(5):343-351.
2Call for Contributions The 3^rd International Conference on Data-driven Knowledge Discovery[J].Journal of Data and Information Science,2019,4(4):96-96.
3发射消息[J].中国航天,2019,0(11):86-87.
4PENG Xiangyang,FANG Pengfei.Surface Structure of Aged Composite Insulator Studied by Slow Positron Beam[J].Journal of Wuhan University of Technology(Materials Science),2019,34(5):1008-1012. 被引量：1
5Bernd Teufel,Anton Sentic,Mathias Barmet.Blockchain Energy: Blockchain in Future Energy Systems[J].Journal of Electronic Science and Technology,2019,17(4):317-331. 被引量：5
6Yong Hak Kim,Jeon Yeol Han,You Jin Lee,Yong Ho An,In Jun Song.Development of IEC61850 Based Substation Engineering Tools with IEC61850 Schema Library[J].Smart Grid and Renewable Energy,2011,2(3):271-277. 被引量：1
7G.R.Venkatakrishnan,R.Rengaraj,S.Salivahanan.Grey Wolf Optimizer to Real Power Dispatch with Non-Linear Constraints[J].Computer Modeling in Engineering & Sciences,2018(4):25-45.
8PEI JiaZheng,SU YiXin,ZHANG DanHong,QI Yue,LENG ZhiWen.Velocity forecasts using a combined deep learning model in hybrid electric vehicles with V2V and V2I communication[J].Science China(Technological Sciences),2020,63(1):55-64. 被引量：7
9Filip Milojkovic,Fernando Zuniga,Arash Zandi,Knuth Posern,Erol Uen.The Quantification and Reporting of Negawatt-Hours with Flexible Energy Conservation Measure Verification Software (ECM-Tool)[J].Open Journal of Energy Efficiency,2019,8(4):179-201.
10姜丽,乔延国,刘明贺.血管紧张素受体脑啡肽酶抑制剂对心力衰竭患者心功能及B型利钠肽、一氧化氮表达的影响[J].中华保健医学杂志,2020,22(1):30-33. 被引量：13

IEEE/CAA Journal of Automatica Sinica

2020年第2期

浏览历史

内容加载中请稍等...