基于连续动作空间深度强化学习的多数据融合室内定位方法

Multi-data fusionaided indoor localization based on continuous action space deep reinforcement learning

下载PDF

导出

摘要基于智能手机的室内定位在研究和工业领域都引起了相当大的关注。然而在复杂的定位环境中,定位的准确性和鲁棒性仍然是具有挑战性的问题。考虑到行人航位推算(PDR,pedestrian dead reckoning)算法被广泛配备在最近的智能手机上,提出了一种基于双延迟深度确定性策略梯度(TD3,twin delayed deep deterministic policy gradient)的室内定位融合方法,该方法集成了Wi-Fi信息和PDR数据,将PDR的定位过程建模为马尔可夫过程并引入了智能体的连续动作空间。最后,与3个最先进的深度Q网络(DQN,deep Q network)室内定位方法进行实验。实验结果表明,该方法能够显著减少定位误差,提高定位准确性。 Significant attention has been paid to indoor localization using smartphones in both research and industry.However,the accuracy and robustness of localization remain challenging issues,particularly in complex indoor environments.In light of the prevalent incorporation of pedestrian dead reckoning(PDR)devices in contemporary smartphones,an advanced indoor localization fusion method,anchored in the twin delayed deep deterministic policy gradient(TD3)framework,was proposed.In this approach,a seamless integration of Wi-Fi information and PDR data was achieved.The localization process of PDR was modeled as a Markov process,and a comprehensive continuous action space was introduced for the agent.To evaluate the performance of the proposed method,experiments were conducted and this approach was compared with three state-of-the-art deep Q network(DQN)based indoor localization methods.The experimental results demonstrate that the proposed method significantly reduces localization errors and enhances overall localization accuracy.

作者陈雪晨易嘉旋王霭祥邓晓衡 CHEN Xuechen;YI Jiaxuan;WANG Aixiang;DENG Xiaoheng(School of Electronic Information,Central South University,Changsha 410004,China;School of Computer Science and Engineering,Central South University,Changsha 410083,China)

机构地区中南大学电子信息学院中南大学计算机学院

出处《物联网学报》 2024年第1期40-48,共9页 Chinese Journal on Internet of Things

基金国家自然科学基金项目(No.62172441) 四川省重点研发计划(No.2023YFG0120)。

关键词 WI-FI 行人航位推算室内定位双延迟深度确定性策略梯度深度强化学习 Wi-Fi pedestrian dead reckoning indoor localization twin delayed deep deterministic policy gradient deep reinforcement learning

分类号 TN915.08 [电子电信—通信与信息系统]

引文网络
相关文献

1赵婷婷,王莹,孙威,陈亚瑞,王嫄,杨巨成.潜在空间中的策略搜索强化学习方法[J].计算机科学与探索,2024,18(4):1032-1046.
2陈潇,秦宁宁,宋书林.双源信号下多元尺度融合室内位置测算方法[J].仪器仪表学报,2024,45(1):311-320.
3章天吉,林文文,张岳君,项薇,战韬阳.使用连续动作的近端策略优化算法求解有限产能批量问题[J].机械设计与研究,2024,40(1):20-25.
4李嘉智,刘宁,节笑晗,王靖骁,赵辉.基于人体运动识别约束的室内定位方法[J].电讯技术,2024,64(4):606-611.
5李俊,肖笛,温想,赵雅洁.基于DQN算法的支线集装箱船航线规划与配载协同优化方法[J].交通信息与安全,2023,41(6):132-141.
6Ki‐Il Kim,Aswani Kumar Cherukuri,Xue Jun Li,Tanveer Ahmad,Muhammad Rafiq,Shehzad Ashraf Chaudhry.Guest Editorial:Special issue on explainable AI empowered for indoor positioning and indoor navigation[J].CAAI Transactions on Intelligence Technology,2023,8(4):1101-1103.

物联网学报

2024年第1期

浏览历史

内容加载中请稍等...

基于连续动作空间深度强化学习的多数据融合室内定位方法

相关作者

相关机构

相关主题

浏览历史