期刊文献+
共找到5篇文章
< 1 >
每页显示 20 50 100
User Station Security Protection Method Based on Random Domain Name Detection and Active Defense
1
作者 Hongyan Yin Xiaokang Ren +2 位作者 Jinyu Liu Shuo Zhang Wenkun Liu 《Journal of Information Security》 2023年第1期39-51,共13页
The power monitoring system is the most important production management system in the power industry. As an important part of the power monitoring system, the user station that lacks grid binding will become an import... The power monitoring system is the most important production management system in the power industry. As an important part of the power monitoring system, the user station that lacks grid binding will become an important target of network attacks. In order to perceive the network attack events on the user station side in time, a method combining real-time detection and active defense of random domain names on the user station side was proposed. Capsule network (CapsNet) combined with long short-term memory network (LSTM) was used to classify the domain names extracted from the traffic data. When a random domain name is detected, it sent instructions to routers and switched to update their security policies through the remote terminal protocol (Telnet), or shut down the service interfaces of routers and switched to block network attacks. The experimental results showed that the use of CapsNet combined with LSTM classification algorithm can achieve 99.16% accuracy and 98% recall rate in random domain name detection. Through the Telnet protocol, routers and switches can be linked to make active defense without interrupting services. 展开更多
关键词 User Station Random domain Name Detection Capsule Network Active Defense Long Short Term Memory
下载PDF
Sim-to-Real: A Performance Comparison of PPO, TD3, and SAC Reinforcement Learning Algorithms for Quadruped Walking Gait Generation
2
作者 James W. Mock Suresh S. Muknahallipatna 《Journal of Intelligent Learning Systems and Applications》 2024年第2期23-43,共21页
The performance of the state-of-the-art Deep Reinforcement algorithms such as Proximal Policy Optimization, Twin Delayed Deep Deterministic Policy Gradient, and Soft Actor-Critic for generating a quadruped walking gai... The performance of the state-of-the-art Deep Reinforcement algorithms such as Proximal Policy Optimization, Twin Delayed Deep Deterministic Policy Gradient, and Soft Actor-Critic for generating a quadruped walking gait in a virtual environment was presented in previous research work titled “A Comparison of PPO, TD3, and SAC Reinforcement Algorithms for Quadruped Walking Gait Generation”. We demonstrated that the Soft Actor-Critic Reinforcement algorithm had the best performance generating the walking gait for a quadruped in certain instances of sensor configurations in the virtual environment. In this work, we present the performance analysis of the state-of-the-art Deep Reinforcement algorithms above for quadruped walking gait generation in a physical environment. The performance is determined in the physical environment by transfer learning augmented by real-time reinforcement learning for gait generation on a physical quadruped. The performance is analyzed on a quadruped equipped with a range of sensors such as position tracking using a stereo camera, contact sensing of each of the robot legs through force resistive sensors, and proprioceptive information of the robot body and legs using nine inertial measurement units. The performance comparison is presented using the metrics associated with the walking gait: average forward velocity (m/s), average forward velocity variance, average lateral velocity (m/s), average lateral velocity variance, and quaternion root mean square deviation. The strengths and weaknesses of each algorithm for the given task on the physical quadruped are discussed. 展开更多
关键词 Reinforcement Learning Reality Gap Position Tracking Action Spaces domain randomization
下载PDF
Deep-reinforcement-learning-based robot motion strategies for grabbing objects from human hands
3
作者 Zeyuan CAI Zhiquan FENG +2 位作者 Liran ZHOU Xiaohui YANG Tao XU 《Virtual Reality & Intelligent Hardware》 EI 2023年第5期407-421,共15页
Background Robot grasping encompasses a wide range of research areas;however, most studies have been focused on the grasping of only stationary objects in a scene;only a few studies on how to grasp objects from a user... Background Robot grasping encompasses a wide range of research areas;however, most studies have been focused on the grasping of only stationary objects in a scene;only a few studies on how to grasp objects from a user's hand have been conducted. In this paper, a robot grasping algorithm based on deep reinforcement learning (RGRL) is proposed. Methods The RGRL takes the relative positions of the robot and the object in a user's hand as input and outputs the best action of the robot in the current state. Thus, the proposed algorithm realizes the functions of autonomous path planning and grasping objects safely from the hands of users. A new method for improving the safety of human-robot cooperation is explored. To solve the problems of a low utilization rate and slow convergence of reinforcement learning algorithms, the RGRL is first trained in a simulation scene, and then, the model para-meters are applied to a real scene. To reduce the difference between the simulated and real scenes, domain randomization is applied to randomly change the positions and angles of objects in the simulated scenes at regular intervals, thereby improving the diversity of the training samples and robustness of the algorithm. Results The RGRL's effectiveness and accuracy are verified by evaluating it on both simulated and real scenes, and the results show that the RGRL can achieve an accuracy of more than 80% in both cases. Conclusions RGRL is a robot grasping algorithm that employs domain randomization and deep reinforcement learning for effective grasping in simulated and real scenes. However, it lacks flexibility in adapting to different grasping poses, prompting future research in achieving safe grasping for diverse user postures. 展开更多
关键词 Robot grasping Deep reinforcement learning domain randomization Human-to-robot handovers Human-machine collaboration
下载PDF
A Comparison of PPO, TD3 and SAC Reinforcement Algorithms for Quadruped Walking Gait Generation
4
作者 James W. Mock Suresh S. Muknahallipatna 《Journal of Intelligent Learning Systems and Applications》 2023年第1期36-56,共21页
Deep reinforcement learning (deep RL) has the potential to replace classic robotic controllers. State-of-the-art Deep Reinforcement algorithms such as Proximal Policy Optimization, Twin Delayed Deep Deterministic Poli... Deep reinforcement learning (deep RL) has the potential to replace classic robotic controllers. State-of-the-art Deep Reinforcement algorithms such as Proximal Policy Optimization, Twin Delayed Deep Deterministic Policy Gradient and Soft Actor-Critic Reinforcement Algorithms, to mention a few, have been investigated for training robots to walk. However, conflicting performance results of these algorithms have been reported in the literature. In this work, we present the performance analysis of the above three state-of-the-art Deep Reinforcement algorithms for a constant velocity walking task on a quadruped. The performance is analyzed by simulating the walking task of a quadruped equipped with a range of sensors present on a physical quadruped robot. Simulations of the three algorithms across a range of sensor inputs and with domain randomization are performed. The strengths and weaknesses of each algorithm for the given task are discussed. We also identify a set of sensors that contribute to the best performance of each Deep Reinforcement algorithm. 展开更多
关键词 Reinforcement Learning Machine Learning Markov Decision Process domain randomization
下载PDF
Numerical Solution of Partial Differential Equations in Random Domains:An Application to Wind Engineering
5
作者 Claudio Canuto Davide Fransos 《Communications in Computational Physics》 SCIE 2009年第2期515-531,共17页
An application of recent uncertainty quantification techniques to Wind Engineering is presented.In particular,the study of the effects of small geometric changes in the Sunshine Skyway Bridge deck on its aerodynamic b... An application of recent uncertainty quantification techniques to Wind Engineering is presented.In particular,the study of the effects of small geometric changes in the Sunshine Skyway Bridge deck on its aerodynamic behavior is addressed.This results in the numerical solution of a proper PDE posed in a domain affected by randomness,which is handled through a mapping approach.A non-intrusive Polynomial Chaos expansion allows to transform the stochastic problem into a deterministic one,in which a commercial code is used as a black-box for the solution of a number of Reynolds-Averaged Navier-Stokes simulations.The use of proper Gauss-Patterson nested quadrature formulas with respect to a Truncated Weibull probability density function permits to limit the number of these computationally expensive simulations,though maintaining a sufficient accuracy.Polynomial Chaos approximations,statistical moments and probability density functions of time-independent quantities of interest for the engineering applications are obtained. 展开更多
关键词 Uncertainty quantification stochastic partial differential equations random domains mapping approach polynomial chaos wind engineering
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部