期刊文献+

用户偏好提取MDP建模研究 被引量:2

Modeling of User Preference Based on MDP
下载PDF
导出
摘要 将马尔可夫判决过程和智能强化学习算法相结合,给出了异构无线网络环境下用户业务偏好评估模型的技术框架。为动态环境下用户需求的感知、量化和适配特征的研究提供了基本的数学描述,对解决用户体验的评价问题和业务与业务环境的适配问题提供了新的研究思路。仿真结果表明所构建的MDP模型能够在多状态条件下学习用户偏好,根据用户需求智能选择业务。 A technical architecture for user preference model is presented,and the nature of the problem represented within a Markov Decision Process(MDP) combined with adaptive reinforcement learning algorithm is displayed.We provided a possible candidate solution for user modeling dynamically to satisfy the user's expected preference based on minimal or missing information.It is also a exploration for the evaluation of the user experience when selecting service providers.Simulations of the user models show that the ...
出处 《国防科技大学学报》 EI CAS CSCD 北大核心 2006年第6期81-85,共5页 Journal of National University of Defense Technology
基金 国家863高技术资助项目(2003AA12331004)
关键词 效用理论 用户偏好 马尔可夫判决过程 强化学习 utility theory user preference Markov decision process reinforcement learning
  • 相关文献

参考文献7

  • 1[1]Chajewska U,Koller D,Parr R.Making Rational Decisions during Adaptive Utility Elicitation[A].In Proceedings of the Seventeenth National Conference on Artificial Intelligence[C],Austin,TX,2000:363-369.
  • 2[2]Boutilier C.A POMDP Formulation of Preference Elicitation Problems[A].In Proceedings of American Association of Artificial Intelligence[C],Edmonton,Alberta,Canada,2002:239-640.
  • 3[3]French S.Decision Theory:An Introduction to the Mathematics of Rationality[M].New York,USA Halsted Press,1986.
  • 4[4]Kaelbling L P,Moore A W.Reinforcement Learning:A Survey[J].Journal of Artificial Intelligence Research,1996,4:237-285.
  • 5[5]Sutton R S,Barto A G.Reinforcement Learning[M].MIT Press,Cambridge,MA,1998.
  • 6[6]Boulet D P,Fraser N M.Improving Preference Elicitation for Decision Support Systems[J].IEEE,1995:1574-1579.
  • 7[7]Ha V,Haddawy P.A Hybrid Approach to Reasoning with Partial Preference Models[A].In Proceedings of the Fifteenth Conference on Uncertainty in Artificial Intelligence[C],1999:263-270.

同被引文献16

  • 1刘晓光,金烨.网络服务自动化中服务功能匹配研究[J].计算机集成制造系统,2006,12(5):782-787. 被引量:4
  • 2张志政,翟玉庆,邢汉承.偏好推理的逻辑链实现[J].软件学报,2006,17(12):2518-2528. 被引量:4
  • 3Cooper A, Reimann R, Cronin D.About face 3 : the essentials of interaction design[M].[S.1.]:Wiley Publishing,2007.
  • 4Olewnik A T.Conjoint-HOQ: a quantitative methodology for consumer-driven design[C]//DETC2007,2008 : 207-217.
  • 5Pruitt J, Adlin T.The persona lifecycle:keeping people in mind throughout product design[M].[S.1.]: Morgan Kaufman Pub- lisher, 2006.
  • 6Junior P T A,Filgueiras L V L.User modeling with perso- nas[C]//Proceedings of CLIHC.New York:ACM Press, 2005 : 277-282.
  • 7McGinn J,Kotamraju N.Data-driven persona development[C]// CHI 2008, Florence, Italy, 2008 : 1045-1054.
  • 8Tsai H,Hsiao S.Evaluation of alternatives for product customi- zation using fuzzy logic[J].Information Sciences, 2004, 158( 1 ): 232-262.
  • 9Green P E.Thirty years of conjoint analysis:reflections and prospects[J].Interfaces, 2004,31 (3) : 59-61.
  • 10Wu Kan, Lu Changde.Personas construction based on utility analysis in industrial design[C]//MASS 2010,2010.

引证文献2

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部