期刊文献+
共找到2篇文章
< 1 >
每页显示 20 50 100
催化裂化装置监控现场“拉表”现象及处理
1
作者 李春明 邢正宇 陈志娟 《电脑学习》 1995年第1期14-16,共3页
本文根据多年从事炼油厂催化裂化装置计算机监测优化控制现场信号处理的经验,介绍了仪表的“拉表”现象解决方法和计算机输入接口获取现场信号的方法。
关键词 催化裂化管理 监控 仪表 拉表现象 计算机控制
下载PDF
Hierarchical Reinforcement Learning Adversarial Algorithm Against Opponent with Fixed Offensive Strategy
2
作者 赵英策 张广浩 +1 位作者 邢正宇 李建勋 《Journal of Shanghai Jiaotong university(Science)》 EI 2024年第3期471-479,共9页
Based on option-critic algorithm,a new adversarial algorithm named deterministic policy network with option architecture is proposed to improve agent's performance against opponent with fixed offensive algorithm.A... Based on option-critic algorithm,a new adversarial algorithm named deterministic policy network with option architecture is proposed to improve agent's performance against opponent with fixed offensive algorithm.An option network is introduced in upper level design,which can generate activated signal from defensive and of-fensive strategies according to temporary situation.Then the lower level executive layer can figure out interactive action with guidance of activated signal,and the value of both activated signal and interactive action is evaluated by critic structure together.This method could release requirement of semi Markov decision process effectively and eventually simplified network structure by eliminating termination possibility layer.According to the result of experiment,it is proved that new algorithm switches strategy style between offensive and defensive ones neatly and acquires more reward from environment than classical deep deterministic policy gradient algorithm does. 展开更多
关键词 hierarchical reinforcement learning fixed offensive strategy option architecture deterministic gradi-entpolicy
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部