期刊文献+
共找到71,806篇文章
< 1 2 250 >
每页显示 20 50 100
Elliptical encirclement control capable of reinforcing performances for UAVs around a dynamic target
1
作者 Fei Zhang Xingling Shao +1 位作者 Yi Xia Wendong Zhang 《Defence Technology(防务技术)》 SCIE EI CAS CSCD 2024年第2期104-119,共16页
Most researches associated with target encircling control are focused on moving along a circular orbit under an ideal environment free from external disturbances.However,elliptical encirclement with a time-varying obs... Most researches associated with target encircling control are focused on moving along a circular orbit under an ideal environment free from external disturbances.However,elliptical encirclement with a time-varying observation radius,may permit a more flexible and high-efficacy enclosing solution,whilst the non-orthogonal property between axial and tangential speed components,non-ignorable environmental perturbations,and strict assignment requirements empower elliptical encircling control to be more challenging,and the relevant investigations are still open.Following this line,an appointed-time elliptical encircling control rule capable of reinforcing circumnavigation performances is developed to enable Unmanned Aerial Vehicles(UAVs)to move along a specified elliptical path within a predetermined reaching time.The remarkable merits of the designed strategy are that the relative distance controlling error can be guaranteed to evolve within specified regions with a designer-specified convergence behavior.Meanwhile,wind perturbations can be online counteracted based on an unknown system dynamics estimator(USDE)with only one regulating parameter and high computational efficiency.Lyapunov tool demonstrates that all involved error variables are ultimately limited,and simulations are implemented to confirm the usability of the suggested control algorithm. 展开更多
关键词 Elliptical encirclement reinforced performances Wind perturbations UAVS
下载PDF
Field implementation of enzyme-induced carbonate precipitation technology for reinforcing a bedding layer beneath an underground cable duct 被引量:3
2
作者 Kai Xu Ming Huang +2 位作者 Jiajie Zhen Chaoshui Xu Mingjuan Cui 《Journal of Rock Mechanics and Geotechnical Engineering》 SCIE CSCD 2023年第4期1011-1022,共12页
A suitable bearing capacity of foundation is critical for the safety of civil structures.Sometimes foundation reinforcement is necessary and an effective and environmentally friendly method would be the preferred choi... A suitable bearing capacity of foundation is critical for the safety of civil structures.Sometimes foundation reinforcement is necessary and an effective and environmentally friendly method would be the preferred choice.In this study,the potential application of enzyme-induced carbonate precipitation(EICP)was investigated for reinforcing a 0.6 m bedding layer on top of clay to improve the bearing capacity of the foundation underneath an underground cable duct.Laboratory experiments were conducted to determine the optimal operational parameters for the extraction of crude urease liquid and optimal grain size range of sea sands to be used to construct the bedding layer.Field tests were planned based on orthogonal experimental design to study the factors that would significantly affect the biocementation effect on site.The dynamic deformation modulus,calcium carbonate content and longterm ground stress variations were used to evaluate the bio-cementation effect and the long-term performance of the EICP-treated bedding layer.The laboratory test results showed that the optimal duration for the extraction of crude urease liquid is 1 h and the optimal usage of soybean husk powder in urease extraction solution is 100 g/L.The calcium carbonate production rate decreases significantly when the concentration of cementation solution exceeds 0.5 mol/L.The results of site trial showed that the number of EICP treatments has the most significant impact on the effectiveness of EICP treatment and the highest dynamic deformation modulus(Evd)of EICP-treated bedding layer reached 50.55 MPa.The area with better bio-cementation effect was found to take higher ground stress which validates that the EICP treatment could improve the bearing capacity of foundation by reinforcing the bedding layer.The field trial described and the analysis introduced in this paper can provide a practical basis for applying EICP technology to the reinforcement of bedding layer in poor ground conditions. 展开更多
关键词 Enzyme-induced carbonate precipitation (EICP) Plant-based urease Underground cable duct Foundation reinforcement
下载PDF
UAV-Assisted Dynamic Avatar Task Migration for Vehicular Metaverse Services: A Multi-Agent Deep Reinforcement Learning Approach 被引量:1
3
作者 Jiawen Kang Junlong Chen +6 位作者 Minrui Xu Zehui Xiong Yutao Jiao Luchao Han Dusit Niyato Yongju Tong Shengli Xie 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2024年第2期430-445,共16页
Avatars, as promising digital representations and service assistants of users in Metaverses, can enable drivers and passengers to immerse themselves in 3D virtual services and spaces of UAV-assisted vehicular Metavers... Avatars, as promising digital representations and service assistants of users in Metaverses, can enable drivers and passengers to immerse themselves in 3D virtual services and spaces of UAV-assisted vehicular Metaverses. However, avatar tasks include a multitude of human-to-avatar and avatar-to-avatar interactive applications, e.g., augmented reality navigation,which consumes intensive computing resources. It is inefficient and impractical for vehicles to process avatar tasks locally. Fortunately, migrating avatar tasks to the nearest roadside units(RSU)or unmanned aerial vehicles(UAV) for execution is a promising solution to decrease computation overhead and reduce task processing latency, while the high mobility of vehicles brings challenges for vehicles to independently perform avatar migration decisions depending on current and future vehicle status. To address these challenges, in this paper, we propose a novel avatar task migration system based on multi-agent deep reinforcement learning(MADRL) to execute immersive vehicular avatar tasks dynamically. Specifically, we first formulate the problem of avatar task migration from vehicles to RSUs/UAVs as a partially observable Markov decision process that can be solved by MADRL algorithms. We then design the multi-agent proximal policy optimization(MAPPO) approach as the MADRL algorithm for the avatar task migration problem. To overcome slow convergence resulting from the curse of dimensionality and non-stationary issues caused by shared parameters in MAPPO, we further propose a transformer-based MAPPO approach via sequential decision-making models for the efficient representation of relationships among agents. Finally, to motivate terrestrial or non-terrestrial edge servers(e.g., RSUs or UAVs) to share computation resources and ensure traceability of the sharing records, we apply smart contracts and blockchain technologies to achieve secure sharing management. Numerical results demonstrate that the proposed approach outperforms the MAPPO approach by around 2% and effectively reduces approximately 20% of the latency of avatar task execution in UAV-assisted vehicular Metaverses. 展开更多
关键词 AVATAR blockchain metaverses multi-agent deep reinforcement learning transformer UAVS
下载PDF
Assessment of Deviation in Quality of Steel Reinforcing Bars Used in Some Building Sites in Cameroon
4
作者 Patrick Che Bame Bell Emmanuel Yamb Billong Ndigui 《World Journal of Engineering and Technology》 2023年第4期917-931,共15页
The present work evaluated the deviations in the quality of steel reinforcing bars in terms of markings, diameter, yield strength and ductility in order to facilitate the drawing up of a yield strength value for the C... The present work evaluated the deviations in the quality of steel reinforcing bars in terms of markings, diameter, yield strength and ductility in order to facilitate the drawing up of a yield strength value for the Cameroon National Annex to Eurocode 2. The methodology of the work started with the collection of steel samples from various active building project sites in four different towns viz: Bamenda, Douala, Maroua and Yaoundé and testing their tensile strength and elongation using a Universal Testing Machine and also carrying out the bending test. Results show that bars without marked manufacturer’s name fell all the tests. Other results show that 52% of all the steel had yield stresses below 400 Mpa and the highest deviation in the yield strengths was 22.50%. The study recommends that properly marked grade 500 steel bars should be adopted in the Cameroon national annex to Eurocode 2. 展开更多
关键词 Eurocode 2 National Annex reinforcement Steel DEVIATIONS Yield Strengths
下载PDF
Recent Progress in Reinforcement Learning and Adaptive Dynamic Programming for Advanced Control Applications 被引量:1
5
作者 Ding Wang Ning Gao +2 位作者 Derong Liu Jinna Li Frank L.Lewis 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2024年第1期18-36,共19页
Reinforcement learning(RL) has roots in dynamic programming and it is called adaptive/approximate dynamic programming(ADP) within the control community. This paper reviews recent developments in ADP along with RL and ... Reinforcement learning(RL) has roots in dynamic programming and it is called adaptive/approximate dynamic programming(ADP) within the control community. This paper reviews recent developments in ADP along with RL and its applications to various advanced control fields. First, the background of the development of ADP is described, emphasizing the significance of regulation and tracking control problems. Some effective offline and online algorithms for ADP/adaptive critic control are displayed, where the main results towards discrete-time systems and continuous-time systems are surveyed, respectively.Then, the research progress on adaptive critic control based on the event-triggered framework and under uncertain environment is discussed, respectively, where event-based design, robust stabilization, and game design are reviewed. Moreover, the extensions of ADP for addressing control problems under complex environment attract enormous attention. The ADP architecture is revisited under the perspective of data-driven and RL frameworks,showing how they promote ADP formulation significantly.Finally, several typical control applications with respect to RL and ADP are summarized, particularly in the fields of wastewater treatment processes and power systems, followed by some general prospects for future research. Overall, the comprehensive survey on ADP and RL for advanced control applications has d emonstrated its remarkable potential within the artificial intelligence era. In addition, it also plays a vital role in promoting environmental protection and industrial intelligence. 展开更多
关键词 Adaptive dynamic programming(ADP) advanced control complex environment data-driven control event-triggered design intelligent control neural networks nonlinear systems optimal control reinforcement learning(RL)
下载PDF
Reinforcement Learning in Process Industries:Review and Perspective
6
作者 Oguzhan Dogru Junyao Xie +6 位作者 Om Prakash Ranjith Chiplunkar Jansen Soesanto Hongtian Chen Kirubakaran Velswamy Fadi Ibrahim Biao Huang 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2024年第2期283-300,共18页
This survey paper provides a review and perspective on intermediate and advanced reinforcement learning(RL)techniques in process industries. It offers a holistic approach by covering all levels of the process control ... This survey paper provides a review and perspective on intermediate and advanced reinforcement learning(RL)techniques in process industries. It offers a holistic approach by covering all levels of the process control hierarchy. The survey paper presents a comprehensive overview of RL algorithms,including fundamental concepts like Markov decision processes and different approaches to RL, such as value-based, policy-based, and actor-critic methods, while also discussing the relationship between classical control and RL. It further reviews the wide-ranging applications of RL in process industries, such as soft sensors, low-level control, high-level control, distributed process control, fault detection and fault tolerant control, optimization,planning, scheduling, and supply chain. The survey paper discusses the limitations and advantages, trends and new applications, and opportunities and future prospects for RL in process industries. Moreover, it highlights the need for a holistic approach in complex systems due to the growing importance of digitalization in the process industries. 展开更多
关键词 Process control process systems engineering reinforcement learning
下载PDF
Role Dynamic Allocation of Human-Robot Cooperation Based on Reinforcement Learning in an Installation of Curtain Wall
7
作者 Zhiguang Liu Shilin Wang +2 位作者 Jian Zhao Jianhong Hao Fei Yu 《Computer Modeling in Engineering & Sciences》 SCIE EI 2024年第1期473-487,共15页
A real-time adaptive roles allocation method based on reinforcement learning is proposed to improve humanrobot cooperation performance for a curtain wall installation task.This method breaks the traditional idea that ... A real-time adaptive roles allocation method based on reinforcement learning is proposed to improve humanrobot cooperation performance for a curtain wall installation task.This method breaks the traditional idea that the robot is regarded as the follower or only adjusts the leader and the follower in cooperation.In this paper,a self-learning method is proposed which can dynamically adapt and continuously adjust the initiative weight of the robot according to the change of the task.Firstly,the physical human-robot cooperation model,including the role factor is built.Then,a reinforcement learningmodel that can adjust the role factor in real time is established,and a reward and actionmodel is designed.The role factor can be adjusted continuously according to the comprehensive performance of the human-robot interaction force and the robot’s Jerk during the repeated installation.Finally,the roles adjustment rule established above continuously improves the comprehensive performance.Experiments of the dynamic roles allocation and the effect of the performance weighting coefficient on the result have been verified.The results show that the proposed method can realize the role adaptation and achieve the dual optimization goal of reducing the sum of the cooperator force and the robot’s Jerk. 展开更多
关键词 Human-robot cooperation roles allocation reinforcement learning
下载PDF
Toward Trustworthy Decision-Making for Autonomous Vehicles:A Robust Reinforcement Learning Approach with Safety Guarantees
8
作者 Xiangkun He Wenhui Huang Chen Lv 《Engineering》 SCIE EI CAS CSCD 2024年第2期77-89,共13页
While autonomous vehicles are vital components of intelligent transportation systems,ensuring the trustworthiness of decision-making remains a substantial challenge in realizing autonomous driving.Therefore,we present... While autonomous vehicles are vital components of intelligent transportation systems,ensuring the trustworthiness of decision-making remains a substantial challenge in realizing autonomous driving.Therefore,we present a novel robust reinforcement learning approach with safety guarantees to attain trustworthy decision-making for autonomous vehicles.The proposed technique ensures decision trustworthiness in terms of policy robustness and collision safety.Specifically,an adversary model is learned online to simulate the worst-case uncertainty by approximating the optimal adversarial perturbations on the observed states and environmental dynamics.In addition,an adversarial robust actor-critic algorithm is developed to enable the agent to learn robust policies against perturbations in observations and dynamics.Moreover,we devise a safety mask to guarantee the collision safety of the autonomous driving agent during both the training and testing processes using an interpretable knowledge model known as the Responsibility-Sensitive Safety Model.Finally,the proposed approach is evaluated through both simulations and experiments.These results indicate that the autonomous driving agent can make trustworthy decisions and drastically reduce the number of collisions through robust safety policies. 展开更多
关键词 Autonomous vehicle DECISION-MAKING reinforcement learning Adversarial attack Safety guarantee
下载PDF
Reinforced tissue matrix to strengthen the abdominal wall following reversal of temporary ostomies or to treat incisional hernias
9
作者 Spencer P Lake Corey R Deeken Amit K Agarwal 《World Journal of Gastrointestinal Surgery》 SCIE 2024年第3期823-832,共10页
BACKGROUND Abdominal wall deficiencies or weakness are a common complication of tem-porary ostomies,and incisional hernias frequently develop after colostomy or ileostomy takedown.The use of synthetic meshes to reinfo... BACKGROUND Abdominal wall deficiencies or weakness are a common complication of tem-porary ostomies,and incisional hernias frequently develop after colostomy or ileostomy takedown.The use of synthetic meshes to reinforce the abdominal wall has reduced hernia occurrence.Biologic meshes have also been used to enhance healing,particularly in contaminated conditions.Reinforced tissue matrices(R-TMs),which include a biologic scaffold of native extracellular matrix and a syn-thetic component for added strength/durability,are designed to take advantage of aspects of both synthetic and biologic materials.To date,RTMs have not been reported to reinforce the abdominal wall following stoma reversal.METHODS Twenty-eight patients were selected with a parastomal and/or incisional hernia who had received a temporary ileostomy or colostomy for fecal diversion after rectal cancer treatment or trauma.Following hernia repair and proximal stoma closure,RTM(OviTex®1S permanent or OviTex®LPR)was placed to reinforce the abdominal wall using a laparoscopic,robotic,or open surgical approach.Post-operative follow-up was performed at 1 month and 1 year.Hernia recurrence was determined by physical examination and,when necessary,via computed tomo-graphy scan.Secondary endpoints included length of hospital stay,time to return to work,and hospital readmissions.Evaluated complications of the wound/repair site included presence of surgical site infection,seroma,hematoma,wound dehiscence,or fistula formation.RESULTS The observational study cohort included 16 male and 12 female patients with average age of 58.5 years±16.3 years and average body mass index of 26.2 kg/m^(2)±4.1 kg/m^(2).Patients presented with a parastomal hernia(75.0%),in-cisional hernia(14.3%),or combined parastomal/incisional hernia(10.7%).Using a laparoscopic(53.6%),robotic(35.7%),or open(10.7%)technique,RTMs(OviTex®LPR:82.1%,OviTex®1S:17.9%)were placed using sublay(82.1%)or intraperitoneal onlay(IPOM;17.9%)mesh positioning.At 1-month and 1-year follow-ups,there were no hernia recurrences(0%).Average hospital stays were 2.1 d±1.2 d and return to work occurred at 8.3 post-operative days±3.0 post-operative days.Three patients(10.7%)were readmitted before the 1-month follow up due to mesh infection and/or gastrointestinal issues.Fistula and mesh infection were observed in two patients each(7.1%),leading to partial mesh removal in one patient(3.6%).There were no complications between 1 month and 1 year(0%).CONCLUSION RTMs were used successfully to treat parastomal and incisional hernias at ileostomy reversal,with no hernia recurrences and favorable outcomes after 1-month and 1-year. 展开更多
关键词 reinforced tissue matrix reinforced forestomach matrix ILEOSTOMY COLOSTOMY Ostomy takedown Incisional hernia Abdominal wall
下载PDF
Multi-circular formation control with reinforced transient profiles for nonholonomic vehicles:A path-following framework
10
作者 Jintao Zhang Xingling Shao +1 位作者 Wendong Zhang Zongyu Zuo 《Defence Technology(防务技术)》 SCIE EI CAS CSCD 2024年第1期278-287,共10页
This article investigates a multi-circular path-following formation control with reinforced transient profiles for nonholonomic vehicles connected by a digraph.A multi-circular formation controller endowed with the fe... This article investigates a multi-circular path-following formation control with reinforced transient profiles for nonholonomic vehicles connected by a digraph.A multi-circular formation controller endowed with the feature of spatial-temporal decoupling is devised for a group of vehicles guided by a virtual leader evolving along an implicit path,which allows for a circumnavigation on multiple circles with an anticipant angular spacing.In addition,notice that it typically imposes a stringent time constraint on time-sensitive enclosing scenarios,hence an improved prescribed performance control(IPPC)using novel tighter behavior boundaries is presented to enhance transient capabilities with an ensured appointed-time convergence free from any overshoots.The significant merits are that coordinated circumnavigation along different circles can be realized via executing geometric and dynamic assignments independently with modified transient profiles.Furthermore,all variables existing in the entire system are analyzed to be convergent.Simulation and experimental results are provided to validate the utility of suggested solution. 展开更多
关键词 Multi-circular formation reinforced transient profiles Nonholonomic vehicles Path following
下载PDF
Electrochemical Study of the Corrosion Inhibitory Capacity of Calcined Attapulgite in Reinforced Concrete Medium
11
作者 Malang Bodian Kinda Hannawi +3 位作者 Dame Keinde Modou Fall Aveline Darquennes Prince William Agbodjan 《Advances in Materials Physics and Chemistry》 CAS 2024年第5期76-94,共19页
The durability of reinforced concrete structures is greatly influenced by the corrosion of the reinforcement. In addition to air pollution related to the repair of corroded structures, chloride ions are the main facto... The durability of reinforced concrete structures is greatly influenced by the corrosion of the reinforcement. In addition to air pollution related to the repair of corroded structures, chloride ions are the main factors of corrosion of reinforced concrete structures. This study aims to valorize a clay inhibitor against reinforcement corrosion in reinforced concrete. This clay (Attapulgite) was incorporated into reinforced concretes at different percentages of substitution of calcined attapulgite (0%, 5% and 10%) to cement in the formulation. The corrosion inhibitory power of attapulgite is evaluated in reinforced concretes subjected to the action of chloride ions at different intervals in the NaCl solution (1 day, 21 days and 45 days) by electrochemical methods (zero current chronopotentiometry, polarization curves and electrochemical impedance spectroscopy). This study showed that in the presence of chloride ions, the composition based on 10% attapulgite has an appreciable inhibitory effect with an average inhibitory efficiency of 82%. 展开更多
关键词 ATTAPULGITE Electrochemical Methods INHIBITOR reinforced Concrete
下载PDF
Reinforcement learning based adaptive control for uncertain mechanical systems with asymptotic tracking
12
作者 Xiang-long Liang Zhi-kai Yao +1 位作者 Yao-wen Ge Jian-yong Yao 《Defence Technology(防务技术)》 SCIE EI CAS CSCD 2024年第4期19-28,共10页
This paper mainly focuses on the development of a learning-based controller for a class of uncertain mechanical systems modeled by the Euler-Lagrange formulation.The considered system can depict the behavior of a larg... This paper mainly focuses on the development of a learning-based controller for a class of uncertain mechanical systems modeled by the Euler-Lagrange formulation.The considered system can depict the behavior of a large class of engineering systems,such as vehicular systems,robot manipulators and satellites.All these systems are often characterized by highly nonlinear characteristics,heavy modeling uncertainties and unknown perturbations,therefore,accurate-model-based nonlinear control approaches become unavailable.Motivated by the challenge,a reinforcement learning(RL)adaptive control methodology based on the actor-critic framework is investigated to compensate the uncertain mechanical dynamics.The approximation inaccuracies caused by RL and the exogenous unknown disturbances are circumvented via a continuous robust integral of the sign of the error(RISE)control approach.Different from a classical RISE control law,a tanh(·)function is utilized instead of a sign(·)function to acquire a more smooth control signal.The developed controller requires very little prior knowledge of the dynamic model,is robust to unknown dynamics and exogenous disturbances,and can achieve asymptotic output tracking.Eventually,co-simulations through ADAMS and MATLAB/Simulink on a three degrees-of-freedom(3-DOF)manipulator and experiments on a real-time electromechanical servo system are performed to verify the performance of the proposed approach. 展开更多
关键词 Adaptive control reinforcement learning Uncertain mechanical systems Asymptotic tracking
下载PDF
Enhancing Image Description Generation through Deep Reinforcement Learning:Fusing Multiple Visual Features and Reward Mechanisms
13
作者 Yan Li Qiyuan Wang Kaidi Jia 《Computers, Materials & Continua》 SCIE EI 2024年第2期2469-2489,共21页
Image description task is the intersection of computer vision and natural language processing,and it has important prospects,including helping computers understand images and obtaining information for the visually imp... Image description task is the intersection of computer vision and natural language processing,and it has important prospects,including helping computers understand images and obtaining information for the visually impaired.This study presents an innovative approach employing deep reinforcement learning to enhance the accuracy of natural language descriptions of images.Our method focuses on refining the reward function in deep reinforcement learning,facilitating the generation of precise descriptions by aligning visual and textual features more closely.Our approach comprises three key architectures.Firstly,it utilizes Residual Network 101(ResNet-101)and Faster Region-based Convolutional Neural Network(Faster R-CNN)to extract average and local image features,respectively,followed by the implementation of a dual attention mechanism for intricate feature fusion.Secondly,the Transformer model is engaged to derive contextual semantic features from textual data.Finally,the generation of descriptive text is executed through a two-layer long short-term memory network(LSTM),directed by the value and reward functions.Compared with the image description method that relies on deep learning,the score of Bilingual Evaluation Understudy(BLEU-1)is 0.762,which is 1.6%higher,and the score of BLEU-4 is 0.299.Consensus-based Image Description Evaluation(CIDEr)scored 0.998,Recall-Oriented Understudy for Gisting Evaluation(ROUGE)scored 0.552,the latter improved by 0.36%.These results not only attest to the viability of our approach but also highlight its superiority in the realm of image description.Future research can explore the integration of our method with other artificial intelligence(AI)domains,such as emotional AI,to create more nuanced and context-aware systems. 展开更多
关键词 Image description deep reinforcement learning attention mechanism
下载PDF
Stability behavior of the Lanxi ancient flood control levee after reinforcement with upside-down hanging wells and grouting curtain
14
作者 QIN Zipeng TIAN Yan +4 位作者 GAO Siyuan ZHOU Jianfen HE Xiaohui HE Weizhong GAO Jingquan 《Journal of Mountain Science》 SCIE CSCD 2024年第1期84-99,共16页
The stability of the ancient flood control levees is mainly influenced by water level fluctuations, groundwater concentration and rainfalls. This paper takes the Lanxi ancient levee as a research object to study the e... The stability of the ancient flood control levees is mainly influenced by water level fluctuations, groundwater concentration and rainfalls. This paper takes the Lanxi ancient levee as a research object to study the evolution laws of its seepage, displacement and stability before and after reinforcement with the upside-down hanging wells and grouting curtain through numerical simulation methods combined with experiments and observations. The study results indicate that the filled soil is less affected by water level fluctuations and groundwater concentration after reinforcement. A high groundwater level is detrimental to the levee's long-term stability, and the drainage issues need to be fully considered. The deformation of the reinforced levee is effectively controlled since the fill deformation is mainly borne by the upside-down hanging wells. The safety factors of the levee before reinforcement vary significantly with the water level. The minimum value of the safety factors is 0.886 during the water level decreasing period, indicating a very high risk of the instability. While it reached 1.478 after reinforcement, the stability of the ancient levee is improved by a large margin. 展开更多
关键词 Stability analysis Multiple factors Antiseepage reinforcement Upside-down hanging well Grouting curtain Ancient levee
下载PDF
Policy Network-Based Dual-Agent Deep Reinforcement Learning for Multi-Resource Task Offloading in Multi-Access Edge Cloud Networks
15
作者 Feng Chuan Zhang Xu +2 位作者 Han Pengchao Ma Tianchun Gong Xiaoxue 《China Communications》 SCIE CSCD 2024年第4期53-73,共21页
The Multi-access Edge Cloud(MEC) networks extend cloud computing services and capabilities to the edge of the networks. By bringing computation and storage capabilities closer to end-users and connected devices, MEC n... The Multi-access Edge Cloud(MEC) networks extend cloud computing services and capabilities to the edge of the networks. By bringing computation and storage capabilities closer to end-users and connected devices, MEC networks can support a wide range of applications. MEC networks can also leverage various types of resources, including computation resources, network resources, radio resources,and location-based resources, to provide multidimensional resources for intelligent applications in 5/6G.However, tasks generated by users often consist of multiple subtasks that require different types of resources. It is a challenging problem to offload multiresource task requests to the edge cloud aiming at maximizing benefits due to the heterogeneity of resources provided by devices. To address this issue,we mathematically model the task requests with multiple subtasks. Then, the problem of task offloading of multi-resource task requests is proved to be NP-hard. Furthermore, we propose a novel Dual-Agent Deep Reinforcement Learning algorithm with Node First and Link features(NF_L_DA_DRL) based on the policy network, to optimize the benefits generated by offloading multi-resource task requests in MEC networks. Finally, simulation results show that the proposed algorithm can effectively improve the benefit of task offloading with higher resource utilization compared with baseline algorithms. 展开更多
关键词 benefit maximization deep reinforcement learning multi-access edge cloud task offloading
下载PDF
Quafu-RL:The cloud quantum computers based quantum reinforcement learning
16
作者 靳羽欣 许宏泽 +29 位作者 王正安 庄伟峰 黄凯旋 时运豪 马卫国 李天铭 陈驰通 许凯 冯玉龙 刘培 陈墨 李尚书 杨智鹏 钱辰 马运恒 肖骁 钱鹏 顾炎武 柴绪丹 普亚南 张翼鹏 魏世杰 曾进峰 李行 龙桂鲁 金贻荣 于海峰 范桁 刘东 胡孟军 《Chinese Physics B》 SCIE EI CAS CSCD 2024年第5期29-34,共6页
With the rapid advancement of quantum computing,hybrid quantum–classical machine learning has shown numerous potential applications at the current stage,with expectations of being achievable in the noisy intermediate... With the rapid advancement of quantum computing,hybrid quantum–classical machine learning has shown numerous potential applications at the current stage,with expectations of being achievable in the noisy intermediate-scale quantum(NISQ)era.Quantum reinforcement learning,as an indispensable study,has recently demonstrated its ability to solve standard benchmark environments with formally provable theoretical advantages over classical counterparts.However,despite the progress of quantum processors and the emergence of quantum computing clouds,implementing quantum reinforcement learning algorithms utilizing parameterized quantum circuits(PQCs)on NISQ devices remains infrequent.In this work,we take the first step towards executing benchmark quantum reinforcement problems on real devices equipped with at most 136 qubits on the BAQIS Quafu quantum computing cloud.The experimental results demonstrate that the policy agents can successfully accomplish objectives under modified conditions in both the training and inference phases.Moreover,we design hardware-efficient PQC architectures in the quantum model using a multi-objective evolutionary algorithm and develop a learning algorithm that is adaptable to quantum devices.We hope that the Quafu-RL can be a guiding example to show how to realize machine learning tasks by taking advantage of quantum computers on the quantum cloud platform. 展开更多
关键词 quantum cloud platform quantum reinforcement learning evolutionary quantum architecture search
下载PDF
Deep Reinforcement Learning-Based Task Offloading and Service Migrating Policies in Service Caching-Assisted Mobile Edge Computing
17
作者 Ke Hongchang Wang Hui +1 位作者 Sun Hongbin Halvin Yang 《China Communications》 SCIE CSCD 2024年第4期88-103,共16页
Emerging mobile edge computing(MEC)is considered a feasible solution for offloading the computation-intensive request tasks generated from mobile wireless equipment(MWE)with limited computational resources and energy.... Emerging mobile edge computing(MEC)is considered a feasible solution for offloading the computation-intensive request tasks generated from mobile wireless equipment(MWE)with limited computational resources and energy.Due to the homogeneity of request tasks from one MWE during a longterm time period,it is vital to predeploy the particular service cachings required by the request tasks at the MEC server.In this paper,we model a service caching-assisted MEC framework that takes into account the constraint on the number of service cachings hosted by each edge server and the migration of request tasks from the current edge server to another edge server with service caching required by tasks.Furthermore,we propose a multiagent deep reinforcement learning-based computation offloading and task migrating decision-making scheme(MBOMS)to minimize the long-term average weighted cost.The proposed MBOMS can learn the near-optimal offloading and migrating decision-making policy by centralized training and decentralized execution.Systematic and comprehensive simulation results reveal that our proposed MBOMS can converge well after training and outperforms the other five baseline algorithms. 展开更多
关键词 deep reinforcement learning mobile edge computing service caching service migrating
下载PDF
Cognitive interference decision method for air defense missile fuze based on reinforcement learning
18
作者 Dingkun Huang Xiaopeng Yan +2 位作者 Jian Dai Xinwei Wang Yangtian Liu 《Defence Technology(防务技术)》 SCIE EI CAS CSCD 2024年第2期393-404,共12页
To solve the problem of the low interference success rate of air defense missile radio fuzes due to the unified interference form of the traditional fuze interference system,an interference decision method based Q-lea... To solve the problem of the low interference success rate of air defense missile radio fuzes due to the unified interference form of the traditional fuze interference system,an interference decision method based Q-learning algorithm is proposed.First,dividing the distance between the missile and the target into multiple states to increase the quantity of state spaces.Second,a multidimensional motion space is utilized,and the search range of which changes with the distance of the projectile,to select parameters and minimize the amount of ineffective interference parameters.The interference effect is determined by detecting whether the fuze signal disappears.Finally,a weighted reward function is used to determine the reward value based on the range state,output power,and parameter quantity information of the interference form.The effectiveness of the proposed method in selecting the range of motion space parameters and designing the discrimination degree of the reward function has been verified through offline experiments involving full-range missile rendezvous.The optimal interference form for each distance state has been obtained.Compared with the single-interference decision method,the proposed decision method can effectively improve the success rate of interference. 展开更多
关键词 Cognitive radio Interference decision Radio fuze reinforcement learning Interference strategy optimization
下载PDF
Interfacial reinforcement of core-shell HMX@energetic polymer composites featuring enhanced thermal and safety performance
19
作者 Binghui Duan Hongchang Mo +3 位作者 Bojun Tan Xianming Lu Bozhou Wang Ning Liu 《Defence Technology(防务技术)》 SCIE EI CAS CSCD 2024年第1期387-399,共13页
The weak interface interaction and solid-solid phase transition have long been a conundrum for 1,3,5,7-tetranitro-1,3,5,7-tetraazacyclooctane(HMX)-based polymer-bonded explosives(PBX).A two-step strategy that involves... The weak interface interaction and solid-solid phase transition have long been a conundrum for 1,3,5,7-tetranitro-1,3,5,7-tetraazacyclooctane(HMX)-based polymer-bonded explosives(PBX).A two-step strategy that involves the pretreatment of HMX to endow—OH groups on the surface via polyalcohol bonding agent modification and in situ coating with nitrate ester-containing polymer,was proposed to address the problem.Two types of energetic polyether—glycidyl azide polymer(GAP)and nitrate modified GAP(GNP)were grafted onto HMX crystal based on isocyanate addition reaction bridged through neutral polymeric bonding agent(NPBA)layer.The morphology and structure of the HMX-based composites were characterized in detail and the core-shell structure was validated.The grafted polymers obviously enhanced the adhesion force between HMX crystals and fluoropolymer(F2314)binder.Due to the interfacial reinforcement among the components,the two HMX-based composites exhibited a remarkable increment of phase transition peak temperature by 10.2°C and 19.6°C with no more than 1.5%shell content,respectively.Furthermore,the impact and friction sensitivity of the composites decreased significantly as a result of the barrier produced by the grafted polymers.These findings will enhance the future prospects for the interface design of energetic composites aiming to solve the weak interface and safety concerns. 展开更多
关键词 HMX crystals Polyalcohol bonding agent Energetic polymer Core-shell structure Interfacial reinforcement
下载PDF
Decoding topological XYZ^(2) codes with reinforcement learning based on attention mechanisms
20
作者 陈庆辉 姬宇欣 +2 位作者 王柯涵 马鸿洋 纪乃华 《Chinese Physics B》 SCIE EI CAS CSCD 2024年第6期262-270,共9页
Quantum error correction, a technique that relies on the principle of redundancy to encode logical information into additional qubits to better protect the system from noise, is necessary to design a viable quantum co... Quantum error correction, a technique that relies on the principle of redundancy to encode logical information into additional qubits to better protect the system from noise, is necessary to design a viable quantum computer. For this new topological stabilizer code-XYZ^(2) code defined on the cellular lattice, it is implemented on a hexagonal lattice of qubits and it encodes the logical qubits with the help of stabilizer measurements of weight six and weight two. However topological stabilizer codes in cellular lattice quantum systems suffer from the detrimental effects of noise due to interaction with the environment. Several decoding approaches have been proposed to address this problem. Here, we propose the use of a state-attention based reinforcement learning decoder to decode XYZ^(2) codes, which enables the decoder to more accurately focus on the information related to the current decoding position, and the error correction accuracy of our reinforcement learning decoder model under the optimisation conditions can reach 83.27% under the depolarizing noise model, and we have measured thresholds of 0.18856 and 0.19043 for XYZ^(2) codes at code spacing of 3–7 and 7–11, respectively. our study provides directions and ideas for applications of decoding schemes combining reinforcement learning attention mechanisms to other topological quantum error-correcting codes. 展开更多
关键词 quantum error correction topological quantum stabilizer code reinforcement learning attention mechanism
下载PDF
上一页 1 2 250 下一页 到第
使用帮助 返回顶部