Currently,important privacy data of the Internet of Things(IoT)face extremely high risks of leakage.Attackers persistently engage in continuous attacks on terminal devices to obtain private data of crucial importance....Currently,important privacy data of the Internet of Things(IoT)face extremely high risks of leakage.Attackers persistently engage in continuous attacks on terminal devices to obtain private data of crucial importance.Although significant progress has been made in recent years in deep reinforcement learning defense strategies,most defense methods still face problems such as low defense resource allocation efficiency and insufficient defense coordination capabilities.To solve the above problems,this paper constructs a novel adversarial security scenario and proposes a security game model that integrates defense resource allocation and patrol inspection.Regarding the above game model,this paper designs a deep reinforcement learning algorithm named SDSA to calculate its security defense strategy.SDSA calculates the allocation strategy of the best patrolling strategy that is most suitable for the defender by searching the policy on a multi-dimensional discrete action space,and enables multiple defense agents to cooperate efficiently by training a multi-intelligent Dueling Double Deep Q-Network(D3QN)with prioritized experience replay.Finally,the experimental results show that the SDSA-learned security defense strategy can provide a feasible and effective security protection strategy for defenders against attacks compared to the MADDPG and OptGradFP methods.展开更多
基金supported by the National Natural Science Foundation of China(62172377,61872205)the Shandong Provincial Natural Science Foundation(ZR2019MF018)the Startup Research Foundation for Distinguished Scholars(202112016).
文摘Currently,important privacy data of the Internet of Things(IoT)face extremely high risks of leakage.Attackers persistently engage in continuous attacks on terminal devices to obtain private data of crucial importance.Although significant progress has been made in recent years in deep reinforcement learning defense strategies,most defense methods still face problems such as low defense resource allocation efficiency and insufficient defense coordination capabilities.To solve the above problems,this paper constructs a novel adversarial security scenario and proposes a security game model that integrates defense resource allocation and patrol inspection.Regarding the above game model,this paper designs a deep reinforcement learning algorithm named SDSA to calculate its security defense strategy.SDSA calculates the allocation strategy of the best patrolling strategy that is most suitable for the defender by searching the policy on a multi-dimensional discrete action space,and enables multiple defense agents to cooperate efficiently by training a multi-intelligent Dueling Double Deep Q-Network(D3QN)with prioritized experience replay.Finally,the experimental results show that the SDSA-learned security defense strategy can provide a feasible and effective security protection strategy for defenders against attacks compared to the MADDPG and OptGradFP methods.