With the intensive deployment of users and the drastic increase of traffic load, a millimeter wave (mmWave) backhaul network was widely investigated. A typical mmWave backhaul network consists of the macro base stat...With the intensive deployment of users and the drastic increase of traffic load, a millimeter wave (mmWave) backhaul network was widely investigated. A typical mmWave backhaul network consists of the macro base station (MBS) and the small base stations (SBSs). How to efficiently associate users with the MBS and the SBSs for load balancing is a key issue in the network. By adding a virtual power bias to the SBSs, more users can access to the SBSs to share the load of the MBS. The bias values shall be set reasonably to guarantee the backhaul efficiency and the quality of service (QoS). An improved Q-learning algorithm is proposed to effectively adjust the bias value for each SBS. In the proposed algorithm, each SBS becomes an agent with independent learning and can achieve the best behavior, namely the optimal bias value through a series of training. Besides, an improved behavior selection mechanism is adopted to improve the learning efficiency and accelerate the convergence of the algorithm. Finally, simulations conducted in the 60 GHz band demonstrate the superior performance of the proposed algorithm in backhaul efficiency and user outage probability.展开更多
基金supported by the State Major Science and Technique Project (MJ-2014-S-37)the 111 Project (B08038)
文摘With the intensive deployment of users and the drastic increase of traffic load, a millimeter wave (mmWave) backhaul network was widely investigated. A typical mmWave backhaul network consists of the macro base station (MBS) and the small base stations (SBSs). How to efficiently associate users with the MBS and the SBSs for load balancing is a key issue in the network. By adding a virtual power bias to the SBSs, more users can access to the SBSs to share the load of the MBS. The bias values shall be set reasonably to guarantee the backhaul efficiency and the quality of service (QoS). An improved Q-learning algorithm is proposed to effectively adjust the bias value for each SBS. In the proposed algorithm, each SBS becomes an agent with independent learning and can achieve the best behavior, namely the optimal bias value through a series of training. Besides, an improved behavior selection mechanism is adopted to improve the learning efficiency and accelerate the convergence of the algorithm. Finally, simulations conducted in the 60 GHz band demonstrate the superior performance of the proposed algorithm in backhaul efficiency and user outage probability.