WSN中基于强化学习的能效优化任务处理机制

Energy Efficiency Optimization Task Processing Mechanism Based on Reinforcement Learning in WSN

下载PDF

导出

摘要以提高无线传感器网络中任务处理的能效为目标,提出了一种近似最优化的任务处理机制,无线传感器节点可根据任务缓存区的任务数量、信道条件,动态地实现任务向边缘服务器的卸载以及本地处理。将任务处理机制建模为马尔可夫决策过程,因为无线传感器节点不知道此过程的状态转移概率,所以采用A3C算法以实现在环境参数未知情况下的探索和学习,从而得到近似最优的任务处理策略。仿真结果表明,与其他机制相比,所提任务处理机制能提高节点能效,且收敛速度更快。 Aiming at improving the energy efficiency of task processing in wireless sensor networks,a nearly optimal task processing mechanism is proposed,in which wireless sensor nodes can dynamically unload tasks to edge servers and perform local processing according to the number of tasks in the task cache and channel conditions.The task processing mechanism is modeled as a Markov decision process.Since the wireless sensor node does not know the state transition probability of this process,the A3C algorithm is used to realize exploration and learning under unknown environmental parameters,so as to obtain the approximately optimal task processing strategy.Under certain buffer conditions and channel conditions,the optimal task quantity,modulation level and transmission power are selected by this strategy,and the average task processing energy efficiency is improved.Simulation results show that compared with other mechanisms,the proposed task-processing mechanism can improve node energy efficiency and has faster convergence speed.

作者张明杰朱江 ZHANG Mingjie;ZHU Jiang(Chongqing Key Laboratory of Mobile Communications Technology,Engineering Research Center of Mobile Communications of the Ministry of Education,School of Communication and Information Engineering,Chongqing University of Posts and Telecommunications,Chongqing 400065,China)

机构地区重庆邮电大学通信与信息工程学院

出处《信号处理》 CSCD 北大核心 2022年第3期609-618,共10页 Journal of Signal Processing

基金国家自然科学基金(61271260) 重庆市教委科学技术研究项目(KJ1400416)资助课题。

关键词无线传感器网络移动边缘计算马尔可夫决策过程强化学习 wireless sensor network mobile edge computing Markov decision process reinforcement learning

分类号 TN929.5 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献3

1朱江,徐斌阳,李少谦.一种基于马尔可夫决策过程的认知无线电网络传输调度方案[J].电子与信息学报,2009,31(8):2019-2023. 被引量：5
2柏果,程郁凡,唐万斌.基于深度学习的单载波频域均衡算法研究[J].信号处理,2021,37(6):922-931. 被引量：6
3Yang Zhang,Hong Gao,Siyao Cheng,Jianzhong Li.An Efficient EH-WSN Energy Management Mechanism[J].Tsinghua Science and Technology,2018,23(4):406-418. 被引量：5

二级参考文献14

1Hossain E and Bhargava V.Cognitive Wireless Communication Networks[M].First Edition,New York:Springer,2007:1-301.
2Djonin D V,et al..Joint rate and power adaptation for type-I hybrid ARQ systems over correlated fading channels under different buffer cost constraints[J].IEEE Transactions.on Wireless Communications,2008,57(1):421-435.
3Bolch G,et al..Queueing Networks and Markov Chains:Modeling and Performance Evaluation with Computer Science Applications[M].Second Edition,New York:John Wiley & Sons,2006:185-206.
4Chung Seong Taek and Goldsmith A.Degrees of freedom in adaptive modulation:A unified view[J].IEEE Transactions.on Communications,2001,49(9):1561-1571.
5Chang H S,et al..Simulation-based Algorithms for Markov Decision Processes[M].First Edition,London:Springer-Verlag,2007:9-167.
6Beutle F J and Ross K W.Optimal policies for controlled markov chains with a constraint[J].Journal of Mathematical Analysis and Application,1985,112(1):236-252.
7Hossain M J,et al..Delay limited optimal and suboptimal power and bit loading algorithms for OFDM systems over correlated fading[C].IEEE GLOBECOM,St.Louis,USA,Dec.1-2,2005:3448-3453.
8Pandana C and Liu K J R.Near-optimal reinforcement learning framework for energy-aware sensor communications[J].IEEE Transactions.on Wireless Communications,2005,23(4):788-797.
9顾晨阳,李丁山,李含辉.单载波频域均衡系统信道估计的粒子滤波方法[J].信号处理,2014,30(4):483-488. 被引量：7
10Li Feng,Jiguo Yu,Feng Zhao,Honglu Jiang.A Novel Analysis of Delay and Power Consumption for Polling Schemes in the IoT[J].Tsinghua Science and Technology,2017,22(4):368-378. 被引量：3

共引文献13

1杨健,王永华,蔡庆玲,詹宜巨,万频.EHiQ:一种基于增强型HiQ的RFID读写器MAC协议[J].计算机科学,2011,38(7):85-87. 被引量：3
2罗丽平,秦家银.认知无线电研究进展及应用前景[J].科技导报,2012,30(3):74-79. 被引量：5
3朱江,王婷婷,宋永辉,刘亚利.无线网络中基于深度Q学习的传输调度方案[J].通信学报,2018,39(4):35-44. 被引量：6
4李正阳,陶洋,周远林,杨柳.基于占空比的分布式能量中性分簇路由协议[J].微电子学与计算机,2020,37(10):30-37.
5张晶,罗施章,付鑫,保峻嵘,徐亮.基于跳数加权与跳距优化的3D-DVHop定位算法[J].控制工程,2021,28(7):1409-1415. 被引量：2
6王鹏宇,程郁凡,徐昊,尚高阳.基于卷积神经网络联合多域特征提取的干扰识别算法[J].信号处理,2022,38(5):915-925. 被引量：11
7张华,瞿绍军,杨丞.多径干扰下无线激光通信接收端信道并行均衡方法[J].应用激光,2022,42(9):105-110.
8沈露露,梁嘉乐,周雯.基于ARIMA-LSTM的能量预测算法[J].无线电通信技术,2023,49(1):150-156. 被引量：5
9卢明松.传输网络的节点分配与传输调度方案[J].通信电源技术,2023,40(4):228-231.
10区展华,李翠然,杨茜.基于ANN的能量采集无线传感器网络中继选择策略[J].计算机工程,2023,49(5):215-222. 被引量：2

1赵宇科,高红亮,胡惠敏,李小玲.基于STM32的多任务系统的设计与实现[J].湖北师范大学学报（自然科学版）,2021,41(3):64-68. 被引量：2
2唐宏志,姜金辉.扩展卡尔曼滤波对时变参数追踪性能的影响研究[J].南京航空航天大学学报,2022,54(2):304-310. 被引量：2

信号处理

2022年第3期

浏览历史

内容加载中请稍等...

WSN中基于强化学习的能效优化任务处理机制

参考文献3

二级参考文献14

共引文献13

相关作者

相关机构

相关主题

浏览历史