期刊文献+

供应链分销系统奖金优化与仿真分析

Bonus Optimization & Simulation Analysis of Distribution System in SCM
下载PDF
导出
摘要 研究由多个制造商与一个零售商组成的分销系统,他们以各自的利润最大化为目标,制造商给零售商提供奖金激励,零售商提供对应于奖金激励的服务水平,制造商需要进行为零售商提供多大奖金激励的决策。利用强化学习的启发式学习算法来优化制造商应提供的最优奖金激励。 The paper studies a distribution system consisting of some manufacturers and a single retailer in SCM and uses heuristic learning algorithm which can reinforce learning to optimize the optimal bonus incentive provided by manufacturers to the retailer.
出处 《物流技术》 2007年第9期86-89,共4页 Logistics Technology
基金 国家自然科学基金项目(70401007)
关键词 供应链管理 分销系统 强化学习 SCM distribution system reinforced learning
  • 相关文献

参考文献12

二级参考文献73

  • 1PR科恩 周少柏等(译).人工智能手册(第三卷)[M].科学出版社,1991..
  • 2Sutton R S,Barto A G. Reimforcement learning: an introduction[M] .MA:MIT Press, 1998.
  • 3Brown X T. Low power wireless communication via reinforcement learning[A]. In: Advances in Neural Information Processing Systems[C] .MIT press,2000(12):893 ~ 899.
  • 4Mataric M J. Cetting humanoids to move and imitate[J].IEEE Intelligent Systems,2000(7): 18 ~ 24.
  • 5Mill' an R, Posenato D, Dedieu E. Continuous - Action Qlearning[ J]. Machine Learning,2002(49):247 ~ 265.
  • 6Shapiro D. Value - driven agents[ D]. Ph. D. thesis, Stanford University, 2001.
  • 7Rennie J, McCallum A. Using reinforcement leaming to spider the web efficiently[A]. In: Pwroc of International Conference on Machine Learning (ICML)[C] .1999.
  • 8Sutton R S. Open theoretical questions in reinforcement leaming[A]. In:Proc of EuroCOLT'99[ C] .1999,11 ~ 17.
  • 9Barto A G, Mahadevan S. Recent advances in hierarchical reinforcement learning [ J ]. Special Issue on Reinforcement Learning, Discrete Event Systems,2003,23(4): 197 ~ 223.
  • 10Hailu G,Sommer G.On amount and quality of bias in reinforcement learning[ A]. In: Proc of IEEE SMC' 99[ C].1999, 1491 ~ 1495.

共引文献95

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部