摘要
在次用户无法准确掌握信道被占用情况下,为了减少对主用户的干扰,提出了一种基于部分可观察马尔科夫决策过程(Partially Observable Markov DecisionProcesses,POMDP)的机会式频谱接入方法.该方法把次用户在每个决策时刻从多个信道中选择其中一个信道进行接入这一过程模型化为一个无限阶部分可观察马尔科夫决策过程.仿真结果表明,通过不断从外界环境中学习,次用户总可以按照目标函数最大准则选择满意的频谱空穴.该模型为动态频谱接入提供了思路.
A scheme of opportunistic spectrum access based on POMDP is pro- posed, to avoid the interference to the primary user for the secondary user does not have the exact knowledge of which channel is occupied. The process of secondary user selecting a satisfied channel is formulated as a partial observable Markov deci- sion process which is infinite horizon. The simulation results show that through learning from the extern world, secondary users can search for and exploit spec- trum holes according to the rule of maximizing the expected rewards. The simula- tion results validate this model. It provides a new idea to dynamic spectrum access method.
出处
《电波科学学报》
EI
CSCD
北大核心
2013年第3期553-558,共6页
Chinese Journal of Radio Science
基金
电子信息系统复杂电磁环境效应国家重点实验室基金项目(CEMEE2012K0107B)