期刊文献+

基于再励学习的主动队列管理算法 被引量:7

A Robust Active Queue Management Algorithm Based on Reinforcement Learning
下载PDF
导出
摘要 从最优决策的角度出发,将人工智能中的再励学习方法引入主动队列管理的研究中,提出了一种基于再励学习的主动队列管理算法RLGD(reinforcementlearninggradient-descent).RLGD以速率匹配和队列稳定为优化目标,根据网络状态自适应地调节更新步长,使得队列长度能够很快收敛到目标值,并且抖动很小.此外,RLGD不需要知道源端的速率调整算法,因而具有很好的可扩展性.通过不同网络环境下的仿真显示,RLGD与REM,PI等AQM算法相比,具有更好的性能和鲁棒性. From the viewpoint of decision theory, AQM (active queue management) can be considered as an optimal decision problem. In this paper, a new AQM scheme, Reinforcement Learning Gradient-Descent (RLGD), is described based on the optimal decision theory of reinforcement learning. Aiming to maximize the throughput and stabilize the queue length, RLGD adjusts the update step adaptively, without the demand of knowing the rate adjustment scheme of the source sender. Simulation demonstrates that RLGD can lead to the convergence of the queue length to the desired value quickly and maintain the oscillation small. The results also show that the RLGD scheme is very robust to disturbance under various network conditions and outperforms the traditional REM and PI controllers significantly.
出处 《软件学报》 EI CSCD 北大核心 2004年第7期1090-1098,共9页 Journal of Software
基金 国家高技术研究发展计划(863)~~
关键词 拥塞控制 主动队列管理 再励学习 congestion control active queue management reinforcement learning
  • 相关文献

参考文献20

  • 1[1]Jacobson V, Karels MJ. Congestion avoidance and control. ACM SIGCOMM Computer Communication Review, 1988,18(4):314~329.
  • 2[2]Floyd S, Jacobson V. Random early detection gateways for congestion avoidance. IEEE/ACM Trans. on Networking, 1993,1(4):397~413.
  • 3[3]Floyd S. A report on some recent development in TCP congestion control. IEEE Communication Magazine, 2001,39(4):84~90.
  • 4[4]Christiansen M, Jeffay K, Ott D, Smith FD. Tuning RED for Web traffic. In: Proc. of the ACM SIGCOMM 2000. Stockholm: ACM Press, 2000. 139~150.
  • 5[5]Ott TJ, Lakshman TV, Wong LH. SRED: Stabilized RED. In: Proc. of the INFOCOM'99. New York: IEEE Communications Society, 1999. 1346~1355.
  • 6[6]Lin D, Morris R. Dynamics of random early detection. In: Proc. of the SIGCOMM'97. Cannes: ACM Press, 1997. 127~137.
  • 7[7]Anjum F, Tassiulas L. Balanced-RED: An algorithm to achieve fairness in Internet. http://www.isr.umd.edu/CSHCN/
  • 8[8]Feng W, Kandlur DD, Saha D, Shin KG. A self-configuring RED gateway. In: Proc. of the INFOCOM'99. New York: IEEE Communications Society, 1999. 1320~1328.
  • 9[9]Feng W, Kandlur DD, Saha D, Shin KG. Blue: A new class of active queue management algorithms. Technical Report, UM CSE-TR-387-99, 1999.
  • 10[10]Hollot CV, Misra V, Towsley D, Gong WB. A control theoretic analysis of RED. In: Proc. of the INFOCOM 2001. Anchorage, AK:IEEE Communications Society, 2001. 1510~1519.

同被引文献76

引证文献7

二级引证文献36

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部