Millimeter-Wave Concurrent Beamforming:A Multi-Player Multi-Armed Bandit Approach 被引量：1

下载PDF

导出

摘要 The communication in the Millimeter-wave(mmWave)band,i.e.,30~300 GHz,is characterized by short-range transmissions and the use of antenna beamforming(BF).Thus,multiple mmWave access points(APs)should be installed to fully cover a target environment with gigabits per second(Gbps)connectivity.However,inter-beam interference prevents maximizing the sum rates of the established concurrent links.In this paper,a reinforcement learning(RL)approach is proposed for enabling mmWave concurrent transmissions by finding out beam directions that maximize the long-term average sum rates of the concurrent links.Specifically,the problem is formulated as a multiplayer multiarmed bandit(MAB),where mmWave APs act as the players aiming to maximize their achievable rewards,i.e.,data rates,and the arms to play are the available beam directions.In this setup,a selfish concurrent multiplayer MAB strategy is advocated.Four different MAB algorithms,namely,ϵ-greedy,upper confidence bound(UCB),Thompson sampling(TS),and exponential weight algorithm for exploration and exploitation(EXP3)are examined by employing them in each AP to selfishly enhance its beam selection based only on its previous observations.After a few rounds of interactions,mmWave APs learn how to select concurrent beams that enhance the overall system performance.The proposed MAB based mmWave concurrent BF shows comparable performance to the optimal solution.

作者 Ehab Mahmoud Mohamed Sherief Hashima Kohei Hatano Hani Kasban Mohamed Rihan

机构地区 Electrical Engineering Department Electrical Engineering Department Computational Learning Theory Team Engineering Department Faculty of Arts and Science Electronics and Electrical Communication Engineering

出处《Computers, Materials & Continua》 SCIE EI 2020年第12期1987-2007,共21页 计算机、材料和连续体（英文）

关键词 Millimeter wave(mmWave) concurrent transmissions reinforcement learning multiarmed bandit(MAB)

分类号 TP3 [自动化与计算机技术—计算机科学与技术]

引文网络
相关文献

同被引文献3

1Rihem Farkh,Haykel Marouani,Khaled Al Jaloud,Saad Alhuwaimel,Mohammad Tabrez Quasim,Yasser Fouad.Intelligent Autonomous-Robot Control for Medical Applications[J].Computers, Materials & Continua,2021(8):2189-2203. 被引量：3
2Xiaorui Zhang,Xun Sun,Xingming Sun,Wei Sun,Sunil Kumar Jha.Robust Reversible Audio Watermarking Scheme for Telemedicine and Privacy Protection[J].Computers, Materials & Continua,2022(5):3035-3050. 被引量：63
3Xiaorui Zhang,Wenfang Zhang,Wei Sun,Xingming Sun,Sunil Kumar Jha.A Robust 3-D Medical Watermarking Based on Wavelet Transform for Data Protection[J].Computer Systems Science & Engineering,2022,41(6):1043-1056. 被引量：70

引证文献1

1Biswaranjan Panda,Nitin Kumar Tripathy,Shibashankar Sahu,Bikash K.Behera,Walaa E.Elhady.Controlling Remote Robots Based on Zidan’s Quantum Computing Model[J].Computers, Materials & Continua,2022(12):6225-6236.

1Kang Liu,Wei Quan,Deyun Gao,Chengxiao Yu,Mingyuan Liu,Yuming Zhang.Distributed Asynchronous Learning for Multipath Data Transmission Based on P-DDQN[J].China Communications,2021,18(8):62-74. 被引量：1
2Yaping Wang,Zhicheng Peng,Riquan Zhang,Qian Xiao.Robust sequential design for piecewise-stationary multi-armed bandit problem in the presence of outliers[J].Statistical Theory and Related Fields,2021,5(2):122-133.
3Alessandra De Paola,Salvatore Gaglio,Andrea Giammanco,Giuseppe Lo Re,Marco Morana.A multi-agent system for itinerary suggestion in smart environments[J].CAAI Transactions on Intelligence Technology,2021,6(4):377-393.
4Ping Lu.A Position Self-Adaptive Method to Detect Fake Access Points[J].Journal of Quantum Computing,2020,2(2):119-127.
5张富春,朱孔林.异质能量约束的集群学习节点选择机制[J].无线电工程,2022,52(1):45-52.
6Jingjing Du,Zhongwei Chen.Applying Organizational Ambidexterity in strategic management under a“VUCA”environment:Evidence from high tech companies in China[J].International Journal of Innovation Studies,2018,2(1):42-52. 被引量：6
7Yazhou Hu,Fengzhen Tang,Jun Chen,Wenxue Wang.Quantum-enhanced reinforcement learning for control:a preliminary study[J].Control Theory and Technology,2021,19(4):455-464.
8Shengchun Wang,Xiaozhong Yu,Lianye Liu,Jingui Huang,Tsz Ho Wong,Chengcheng Jiang.An Approach for Radar Quantitative Precipitation Estimation Based on Spatiotemporal Network[J].Computers, Materials & Continua,2020(10):459-479. 被引量：1
9敖天宇,刘全.一种快速收敛的最大置信上界探索方法[J].计算机科学,2022,49(1):298-305.
10Q.Cheng,X.D.Xu,P.Xie,L.L.Han,J.Y.He,X.Q.Li,J.Zhang,Z.T.Li,Y.P.Li,B.Liu,T.G.Nieh,M.W.Chen,J.H.Chen.Unveiling anneal hardening in dilute Al-doped Al_(x)CoCrFeMnNi(x = 0,0.1) high-entropy alloys[J].Journal of Materials Science & Technology,2021(32):270-277. 被引量：6

Computers, Materials & Continua

2020年第12期

浏览历史

内容加载中请稍等...

Millimeter-Wave Concurrent Beamforming:A Multi-Player Multi-Armed Bandit Approach 被引量：1

同被引文献3

引证文献1

相关作者

相关机构

相关主题

浏览历史