The performance of the state-of-the-art Deep Reinforcement algorithms such as Proximal Policy Optimization, Twin Delayed Deep Deterministic Policy Gradient, and Soft Actor-Critic for generating a quadruped walking gai...The performance of the state-of-the-art Deep Reinforcement algorithms such as Proximal Policy Optimization, Twin Delayed Deep Deterministic Policy Gradient, and Soft Actor-Critic for generating a quadruped walking gait in a virtual environment was presented in previous research work titled “A Comparison of PPO, TD3, and SAC Reinforcement Algorithms for Quadruped Walking Gait Generation”. We demonstrated that the Soft Actor-Critic Reinforcement algorithm had the best performance generating the walking gait for a quadruped in certain instances of sensor configurations in the virtual environment. In this work, we present the performance analysis of the state-of-the-art Deep Reinforcement algorithms above for quadruped walking gait generation in a physical environment. The performance is determined in the physical environment by transfer learning augmented by real-time reinforcement learning for gait generation on a physical quadruped. The performance is analyzed on a quadruped equipped with a range of sensors such as position tracking using a stereo camera, contact sensing of each of the robot legs through force resistive sensors, and proprioceptive information of the robot body and legs using nine inertial measurement units. The performance comparison is presented using the metrics associated with the walking gait: average forward velocity (m/s), average forward velocity variance, average lateral velocity (m/s), average lateral velocity variance, and quaternion root mean square deviation. The strengths and weaknesses of each algorithm for the given task on the physical quadruped are discussed.展开更多
In this study, we investigate the nonlinear cou- pling magneto-electric (ME) effect of a giant magnetostric- tive/piezoelectric composite cylinder. The nonlinear consti- tutive relations of the ME material are taken...In this study, we investigate the nonlinear cou- pling magneto-electric (ME) effect of a giant magnetostric- tive/piezoelectric composite cylinder. The nonlinear consti- tutive relations of the ME material are taken into account, and the influences of the nonlinear material properties on the ME effect are investigated for the static and dynamic cases, respectively. The influences of different constraint conditions on the ME effect are discussed. In the dynamic case considering nonlinear material properties, the double frequency ME response (The response frequency is twice the applied magnetic frequency) is obtained and discussed, which can be used to explain the experiment phenomenon in which the input signal with frequency f is converted to the output signal with 2f in ME laminated structures. Some calculations on nonlinear ME effect are conducted. The obtained results indicate that the nonlinear material properties affect not only the magnitude of the ME effect in the static case but also the ME response frequency in the dynamic case展开更多
Magnetorheological suspension based on 20 (v/V) % CoPc-iron composite particles dispersed in sili-cone oil have been obtained, which exhibited dynamic shear stress up to 2000Pa upon application of external magnetic fi...Magnetorheological suspension based on 20 (v/V) % CoPc-iron composite particles dispersed in sili-cone oil have been obtained, which exhibited dynamic shear stress up to 2000Pa upon application of external magnetic field at 1300 Oe. The response is faster than 0. 15 -with superior reversibility of changes in viscosity induced by external magnetic field at above 12. 5℃. Further, it was found that the MR fluid is in possession of long-term stability a-gainst sedimentation.展开更多
In order to study the critical load position that causes cavities beneath the continuously reinforced concrete pavement( CRCP) slab under vehicle loading, the elliptical load is translated into the square load based...In order to study the critical load position that causes cavities beneath the continuously reinforced concrete pavement( CRCP) slab under vehicle loading, the elliptical load is translated into the square load based on the equivalence principle.The CRCP slab is analyzed to determine the cavity position beneath the slab under vehicle loading. The influences of cavity size on the CRCP slab's stress and vertical displacement are investigated. The study results showthat the formation of the cavity is unavoidable under traffic loading, and the cavity is located at the edge of the longitudinal crack and the slab corner.The cavity size exerts an obvious influence on the largest horizontal tensile stress and vertical displacement. The slab corner is the critical load position of the CRCP slab. The results can be used to assist the design of CRCP in avoiding cavities beneath slabs subject to vehicle loading.展开更多
Large-scale indoor 3D reconstruction with multiple robots faces challenges in core enabling technologies.This work contributes to a framework addressing localization,coordination,and vision processing for multi-agent ...Large-scale indoor 3D reconstruction with multiple robots faces challenges in core enabling technologies.This work contributes to a framework addressing localization,coordination,and vision processing for multi-agent reconstruction.A system architecture fusing visible light positioning,multi-agent path finding via reinforcement learning,and 360°camera techniques for 3D reconstruction is proposed.Our visible light positioning algorithm leverages existing lighting for centimeter-level localization without additional infrastructure.Meanwhile,a decentralized reinforcement learning approach is developed to solve the multi-agent path finding problem,with communications among agents optimized.Our 3D reconstruction pipeline utilizes equirectangular projection from 360°cameras to facilitate depth-independent reconstruction from posed monocular images using neural networks.Experimental validation demonstrates centimeter-level indoor navigation and 3D scene reconstruction capabilities of our framework.The challenges and limitations stemming from the above enabling technologies are discussed at the end of each corresponding section.In summary,this research advances fundamental techniques for multi-robot indoor 3D modeling,contributing to automated,data-driven applications through coordinated robot navigation,perception,and modeling.展开更多
文摘The performance of the state-of-the-art Deep Reinforcement algorithms such as Proximal Policy Optimization, Twin Delayed Deep Deterministic Policy Gradient, and Soft Actor-Critic for generating a quadruped walking gait in a virtual environment was presented in previous research work titled “A Comparison of PPO, TD3, and SAC Reinforcement Algorithms for Quadruped Walking Gait Generation”. We demonstrated that the Soft Actor-Critic Reinforcement algorithm had the best performance generating the walking gait for a quadruped in certain instances of sensor configurations in the virtual environment. In this work, we present the performance analysis of the state-of-the-art Deep Reinforcement algorithms above for quadruped walking gait generation in a physical environment. The performance is determined in the physical environment by transfer learning augmented by real-time reinforcement learning for gait generation on a physical quadruped. The performance is analyzed on a quadruped equipped with a range of sensors such as position tracking using a stereo camera, contact sensing of each of the robot legs through force resistive sensors, and proprioceptive information of the robot body and legs using nine inertial measurement units. The performance comparison is presented using the metrics associated with the walking gait: average forward velocity (m/s), average forward velocity variance, average lateral velocity (m/s), average lateral velocity variance, and quaternion root mean square deviation. The strengths and weaknesses of each algorithm for the given task on the physical quadruped are discussed.
基金supported by the National Natural Science Foundation of China (11072093)the State Key Program of National Natural Science of China (11032006)the Fundamental Research Funds for the Central Universities (lzujbky-2012-k05)
文摘In this study, we investigate the nonlinear cou- pling magneto-electric (ME) effect of a giant magnetostric- tive/piezoelectric composite cylinder. The nonlinear consti- tutive relations of the ME material are taken into account, and the influences of the nonlinear material properties on the ME effect are investigated for the static and dynamic cases, respectively. The influences of different constraint conditions on the ME effect are discussed. In the dynamic case considering nonlinear material properties, the double frequency ME response (The response frequency is twice the applied magnetic frequency) is obtained and discussed, which can be used to explain the experiment phenomenon in which the input signal with frequency f is converted to the output signal with 2f in ME laminated structures. Some calculations on nonlinear ME effect are conducted. The obtained results indicate that the nonlinear material properties affect not only the magnitude of the ME effect in the static case but also the ME response frequency in the dynamic case
基金Chinese National Natural Science Foundation (No. 29674021) Hubei Natural Science Foundation(NO. 97J086).
文摘Magnetorheological suspension based on 20 (v/V) % CoPc-iron composite particles dispersed in sili-cone oil have been obtained, which exhibited dynamic shear stress up to 2000Pa upon application of external magnetic field at 1300 Oe. The response is faster than 0. 15 -with superior reversibility of changes in viscosity induced by external magnetic field at above 12. 5℃. Further, it was found that the MR fluid is in possession of long-term stability a-gainst sedimentation.
基金The Science Foundation of Ministry of Transport of the People's Republic of China(No.200731822301-7)
文摘In order to study the critical load position that causes cavities beneath the continuously reinforced concrete pavement( CRCP) slab under vehicle loading, the elliptical load is translated into the square load based on the equivalence principle.The CRCP slab is analyzed to determine the cavity position beneath the slab under vehicle loading. The influences of cavity size on the CRCP slab's stress and vertical displacement are investigated. The study results showthat the formation of the cavity is unavoidable under traffic loading, and the cavity is located at the edge of the longitudinal crack and the slab corner.The cavity size exerts an obvious influence on the largest horizontal tensile stress and vertical displacement. The slab corner is the critical load position of the CRCP slab. The results can be used to assist the design of CRCP in avoiding cavities beneath slabs subject to vehicle loading.
文摘针对四旋翼无人机的悬停控制及轨迹跟踪问题,利用近端策略优化算法来控制四旋翼飞行器,通过强化学习训练神经网络,将状态直接映射到四个旋翼,是一种用于在未知动态参数和干扰下控制任何线性或非线性系统的技术。基于回报塑形技术(The reward shaping of RL),提出了一种新颖的奖励函数,相比传统的PID算法,可以使无人机飞行更迅速且平稳。实验表明,四旋翼无人机可以以高精度高平稳的性能从三维中的定点悬停及轨迹跟踪,精度高达97.2%;文中的位置控制器具有泛化性和鲁棒性。
基金supported by Bright Dream Robotics and the HKUSTBDR Joint Research Institute Funding Scheme under Project HBJRI-FTP-005(Automated 3D Reconstruction using Robot-mounted 360-Degree Camera with Visible Light Positioning Technology for Building Information Modelling Applications,OKT22EG06).
文摘Large-scale indoor 3D reconstruction with multiple robots faces challenges in core enabling technologies.This work contributes to a framework addressing localization,coordination,and vision processing for multi-agent reconstruction.A system architecture fusing visible light positioning,multi-agent path finding via reinforcement learning,and 360°camera techniques for 3D reconstruction is proposed.Our visible light positioning algorithm leverages existing lighting for centimeter-level localization without additional infrastructure.Meanwhile,a decentralized reinforcement learning approach is developed to solve the multi-agent path finding problem,with communications among agents optimized.Our 3D reconstruction pipeline utilizes equirectangular projection from 360°cameras to facilitate depth-independent reconstruction from posed monocular images using neural networks.Experimental validation demonstrates centimeter-level indoor navigation and 3D scene reconstruction capabilities of our framework.The challenges and limitations stemming from the above enabling technologies are discussed at the end of each corresponding section.In summary,this research advances fundamental techniques for multi-robot indoor 3D modeling,contributing to automated,data-driven applications through coordinated robot navigation,perception,and modeling.