期刊文献+
共找到16篇文章
< 1 >
每页显示 20 50 100
三轴MEMS陀螺仪轨迹跟踪的自耦PID控制方法
1
作者 熊洛 曾喆昭 《传感技术学报》 CAS CSCD 北大核心 2024年第9期1586-1592,共7页
为了有效提高三轴微机械(Micro-Electro-Mechanic System,MEMS)陀螺仪的抗干扰能力和检测精度,基于自耦PID控制理论提出了一种简单的陀螺仪控制方法。该方法将三轴MEMS陀螺仪控制问题分解为三个严格反馈子系统的控制问题,然后将三个子... 为了有效提高三轴微机械(Micro-Electro-Mechanic System,MEMS)陀螺仪的抗干扰能力和检测精度,基于自耦PID控制理论提出了一种简单的陀螺仪控制方法。该方法将三轴MEMS陀螺仪控制问题分解为三个严格反馈子系统的控制问题,然后将三个子系统已知或未知动态、其不确定性和外部有界扰动分别定义为三个总扰动,进而将三个子系统等价映射为三个二阶线性扰动系统,据此分别构建了在总扰动反相激励下的三个受控误差系统,并根据自耦PID控制理论分别设计了三个自耦PD控制器,最后分析了每个子系统的鲁棒稳定性和抗扰动鲁棒性。仿真实验证明了所提控制方法的有效性,其在MEMS陀螺仪控制系统领域具有良好的实际应用前景。 展开更多
关键词 三轴 MEMS 陀螺仪 轨迹跟踪 自耦 PID 控制 自适应速度因子
下载PDF
A Survey of Knowledge Graph Construction Using Machine Learning
2
作者 Zhigang Zhao xiong luo +1 位作者 Maojian Chen Ling Ma 《Computer Modeling in Engineering & Sciences》 SCIE EI 2024年第4期225-257,共33页
Knowledge graph(KG)serves as a specialized semantic network that encapsulates intricate relationships among real-world entities within a structured framework.This framework facilitates a transformation in information ... Knowledge graph(KG)serves as a specialized semantic network that encapsulates intricate relationships among real-world entities within a structured framework.This framework facilitates a transformation in information retrieval,transitioning it from mere string matching to far more sophisticated entity matching.In this transformative process,the advancement of artificial intelligence and intelligent information services is invigorated.Meanwhile,the role ofmachine learningmethod in the construction of KG is important,and these techniques have already achieved initial success.This article embarks on a comprehensive journey through the last strides in the field of KG via machine learning.With a profound amalgamation of cutting-edge research in machine learning,this article undertakes a systematical exploration of KG construction methods in three distinct phases:entity learning,ontology learning,and knowledge reasoning.Especially,a meticulous dissection of machine learningdriven algorithms is conducted,spotlighting their contributions to critical facets such as entity extraction,relation extraction,entity linking,and link prediction.Moreover,this article also provides an analysis of the unresolved challenges and emerging trajectories that beckon within the expansive application of machine learning-fueled,large-scale KG construction. 展开更多
关键词 Knowledge graph(KG) semantic network relation extraction entity linking knowledge reasoning
下载PDF
Introduction to the Special Issue on Machine Learning-Guided Intelligent Modeling with Its Industrial Applications
3
作者 xiong luo Yongqiang Cheng Zhifang Liao 《Computer Modeling in Engineering & Sciences》 SCIE EI 2024年第10期7-11,共5页
With the advancement of Artificial Intelligence(AI)technology,traditional industrial systems are undergoing an intelligent transformation,bringing together advanced computing,communication and control technologies,Mac... With the advancement of Artificial Intelligence(AI)technology,traditional industrial systems are undergoing an intelligent transformation,bringing together advanced computing,communication and control technologies,Machine Learning(ML)-based intelligentmodelling has become a newparadigm for solving problems in the industrial domain[1–3].With numerous applications and diverse data types in the industrial domain,algorithmic and data-driven ML techniques can intelligently learn potential correlations between complex data and make efficient decisions while reducing human intervention.However,in real-world application scenarios,existing algorithms may have a variety of limitations,such as small data volumes,small detection targets,low efficiency,and algorithmic gaps in specific application domains[4].Therefore,many new algorithms and strategies have been proposed to address the challenges in industrial applications[5–8]. 展开更多
关键词 INTELLIGENCE bringing INTELLIGENT
下载PDF
A Quantized Kernel Least Mean Square Scheme with Entropy-Guided Learning for Intelligent Data Analysis 被引量:5
4
作者 xiong luo Jing Deng +3 位作者 Ji Liu Weiping Wang Xiaojuan Ban Jenq-Haur Wang 《China Communications》 SCIE CSCD 2017年第7期127-136,共10页
Quantized kernel least mean square(QKLMS) algorithm is an effective nonlinear adaptive online learning algorithm with good performance in constraining the growth of network size through the use of quantization for inp... Quantized kernel least mean square(QKLMS) algorithm is an effective nonlinear adaptive online learning algorithm with good performance in constraining the growth of network size through the use of quantization for input space. It can serve as a powerful tool to perform complex computing for network service and application. With the purpose of compressing the input to further improve learning performance, this article proposes a novel QKLMS with entropy-guided learning, called EQ-KLMS. Under the consecutive square entropy learning framework, the basic idea of entropy-guided learning technique is to measure the uncertainty of the input vectors used for QKLMS, and delete those data with larger uncertainty, which are insignificant or easy to cause learning errors. Then, the dataset is compressed. Consequently, by using square entropy, the learning performance of proposed EQ-KLMS is improved with high precision and low computational cost. The proposed EQ-KLMS is validated using a weather-related dataset, and the results demonstrate the desirable performance of our scheme. 展开更多
关键词 quantized kernel least mean square (QKLMS) consecutive square entropy data analysis
下载PDF
Effective short text classification via the fusion of hybrid features for IoT social data 被引量:3
5
作者 xiong luo Zhijian Yu +2 位作者 Zhigang Zhao Wenbing Zhao Jenq-Haur Wang 《Digital Communications and Networks》 SCIE CSCD 2022年第6期942-954,共13页
Nowadays short texts can be widely found in various social data in relation to the 5G-enabled Internet of Things (IoT). Short text classification is a challenging task due to its sparsity and the lack of context. Prev... Nowadays short texts can be widely found in various social data in relation to the 5G-enabled Internet of Things (IoT). Short text classification is a challenging task due to its sparsity and the lack of context. Previous studies mainly tackle these problems by enhancing the semantic information or the statistical information individually. However, the improvement achieved by a single type of information is limited, while fusing various information may help to improve the classification accuracy more effectively. To fuse various information for short text classification, this article proposes a feature fusion method that integrates the statistical feature and the comprehensive semantic feature together by using the weighting mechanism and deep learning models. In the proposed method, we apply Bidirectional Encoder Representations from Transformers (BERT) to generate word vectors on the sentence level automatically, and then obtain the statistical feature, the local semantic feature and the overall semantic feature using Term Frequency-Inverse Document Frequency (TF-IDF) weighting approach, Convolutional Neural Network (CNN) and Bidirectional Gate Recurrent Unit (BiGRU). Then, the fusion feature is accordingly obtained for classification. Experiments are conducted on five popular short text classification datasets and a 5G-enabled IoT social dataset and the results show that our proposed method effectively improves the classification performance. 展开更多
关键词 Information fusion Short text classi fication BERT Bidirectional encoder representations fr 0om transformers Deep learning Social data
下载PDF
Towards improving detection performance for malware with a correntropy-based deep learning method 被引量:2
6
作者 xiong luo Jianyuan Li +2 位作者 Weiping Wang Yang Gao Wenbing Zhao 《Digital Communications and Networks》 SCIE CSCD 2021年第4期570-579,共10页
With the rapid development of Internet of Things(IoT)technologies,the detection and analysis of malware have become a matter of concern in the industrial application of Cyber-Physical System(CPS)that provides various ... With the rapid development of Internet of Things(IoT)technologies,the detection and analysis of malware have become a matter of concern in the industrial application of Cyber-Physical System(CPS)that provides various services using the IoT paradigm.Currently,many advanced machine learning methods such as deep learning are popular in the research of malware detection and analysis,and some achievements have been made so far.However,there are also some problems.For example,considering the noise and outliers in the existing datasets of malware,some methods are not robust enough.Therefore,the accuracy of malware classification still needs to be improved.Aiming at this issue,we propose a novel method that combines the correntropy and the deep learning model.In our proposed method for malware detection and analysis,given the success of the mixture correntropy as an effective similarity measure in addressing complex datasets with noise,it is therefore incorporated into a popular deep learning model,i.e.,Convolutional Neural Network(CNN),to reconstruct its loss function,with the purpose of further detecting the features of outliers.We present the detailed design process of our method.Furthermore,the proposed method is tested both on a real-world malware dataset and a popular benchmark dataset to verify its learning performance. 展开更多
关键词 Malware detection Mixture correntropy Deep learning Convolutional neural network(CNN)
下载PDF
A Novel Named Entity Recognition Scheme for Steel E-Commerce Platforms Using a Lite BERT 被引量:1
7
作者 Maojian Chen xiong luo +2 位作者 Hailun Shen Ziyang Huang Qiaojuan Peng 《Computer Modeling in Engineering & Sciences》 SCIE EI 2021年第10期47-63,共17页
In the era of big data,E-commerce plays an increasingly important role,and steel E-commerce certainly occupies a positive position.However,it is very difficult to choose satisfactory steel raw materials from diverse s... In the era of big data,E-commerce plays an increasingly important role,and steel E-commerce certainly occupies a positive position.However,it is very difficult to choose satisfactory steel raw materials from diverse steel commodities online on steel E-commerce platforms in the purchase of staffs.In order to improve the efficiency of purchasers searching for commodities on the steel E-commerce platforms,we propose a novel deep learning-based loss function for named entity recognition(NER).Considering the impacts of small sample and imbalanced data,in our NER scheme,the focal loss,the label smoothing,and the cross entropy are incorporated into a lite bidirectional encoder representations from transformers(BERT)model to avoid the over-fitting.Moreover,through the analysis of different classic annotation techniques used to tag data,an ideal one is chosen for the training model in our proposed scheme.Experiments are conducted on Chinese steel E-commerce datasets.The experimental results show that the training time of a lite BERT(ALBERT)-based method is much shorter than that of BERT-based models,while achieving the similar computational performance in terms of metrics precision,recall,and F1 with BERT-based models.Meanwhile,our proposed approach performs much better than that of combining Word2Vec,bidirectional long short-term memory(Bi-LSTM),and conditional random field(CRF)models,in consideration of training time and F1. 展开更多
关键词 Named entity recognition bidirectional encoder representations from transformers steel E-commerce platform annotation technique
下载PDF
Transferable Features from 1D-Convolutional Network for Industrial Malware Classification
8
作者 LiweiWang Jiankun Sun +1 位作者 xiong luo Xi Yang 《Computer Modeling in Engineering & Sciences》 SCIE EI 2022年第2期1003-1016,共14页
With the development of information technology,malware threats to the industrial system have become an emergent issue,since various industrial infrastructures have been deeply integrated into our modern works and live... With the development of information technology,malware threats to the industrial system have become an emergent issue,since various industrial infrastructures have been deeply integrated into our modern works and lives.To identify and classify new malware variants,different types of deep learning models have been widely explored recently.Generally,sufficient data is usually required to achieve a well-trained deep learning classifier with satisfactory generalization ability.However,in current practical applications,an ample supply of data is absent in most specific industrial malware detection scenarios.Transfer learning as an effective approach can be used to alleviate the influence of the small sample size problem.In addition,it can also reuse the knowledge from pretrained models,which is beneficial to the real-time requirement in industrial malware detection.In this paper,we investigate the transferable features learned by a 1D-convolutional network and evaluate our proposed methods on 6 transfer learning tasks.The experiment results show that 1D-convolutional architecture is effective to learn transferable features for malware classification,and indicate that transferring the first 2 layers of our proposed 1D-convolutional network is the most efficient way to reuse the learned features. 展开更多
关键词 Transfer learning malware classification sequence data modeling convolutional network
下载PDF
A Novel Hidden Danger Prediction Method in CloudBased Intelligent Industrial Production Management Using Timeliness Managing Extreme Learning Machine
9
作者 xiong luo Xiaona Yang +3 位作者 Weiping Wang Xiaohui Chang Xinyan Wang Zhigang Zhao 《China Communications》 SCIE CSCD 2016年第7期74-82,共9页
To prevent possible accidents,the study of data-driven analytics to predict hidden dangers in cloud service-based intelligent industrial production management has been the subject of increasing interest recently.A mac... To prevent possible accidents,the study of data-driven analytics to predict hidden dangers in cloud service-based intelligent industrial production management has been the subject of increasing interest recently.A machine learning algorithm that uses timeliness managing extreme learning machine is utilized in this article to achieve the above prediction.Compared with traditional learning algorithms,extreme learning machine(ELM) exhibits high performance because of its unique feature of a high generalization capability at a fast learning speed.Timeliness managing ELM is proposed by incorporating timeliness management scheme into ELM.When using the timeliness managing ELM scheme to predict hidden dangers,newly incremental data could be added prior to the historical data to maximize the contribution of the newly incremental training data,because the incremental data may be able to contribute reasonable weights to represent the current production situation according to practical analysis of accidents in some industrial productions.Experimental results from a coal mine show that the use of timeliness managing ELM can improve the prediction accuracy of hidden dangers with better stability compared with other similar machine learning methods. 展开更多
关键词 prediction incremental learning extreme learning machine cloud service
下载PDF
Magnetic phase transition and continuous spin switching in a high-entropy orthoferrite single crystal
10
作者 Wanting Yang Shuang Zhu +9 位作者 xiong luo Xiaoxuan Ma Chenfei Shi Huan Song Zhiqiang Sun Yefei Guo Yuriy Dedkov Baojuan Kang Jin-Ke Bao Shixun Cao 《Frontiers of physics》 SCIE CSCD 2024年第2期157-167,共11页
Rare-earth orthoferrite REFeO_(3)(where RE is a rare-earth ion)is gaining interest.We created a high-entropy orthoferrite(Tm_(0.2)Nd_(0.2)Dy_(0.2)Y_(0.2)Yb_(0.2))FeO_(3)(HEOR)by doping five RE ions in equimolar ratios... Rare-earth orthoferrite REFeO_(3)(where RE is a rare-earth ion)is gaining interest.We created a high-entropy orthoferrite(Tm_(0.2)Nd_(0.2)Dy_(0.2)Y_(0.2)Yb_(0.2))FeO_(3)(HEOR)by doping five RE ions in equimolar ratios and grew the single crystal by optical floating zone method.It strongly tends to form a single-phase structure stabilized by high configurational entropy.In the low-temperature region(11.6‒14.4 K),the spin reorientation transition(SRT)ofΓ_(2)(F_(x),C_(y),G_(z))‒Γ_(24)‒Γ_(4)(G_(x),A_(y),F_(z))occurs.The weak ferromagnetic(FM)moment,which comes from the Fe sublattices distortion,rotates from the a-to c-axis.The two-step dynamic processes(Γ_(2)‒Γ_(24)‒Γ_(4))are identified by AC susceptibility measurements.SRT in HEOR can be tuned in the range of 50‒60000 Oe,which is an order of magnitude larger than that of orthoferrites in the peer system,making it a candidate for high-field spin sensing.Typical spin-switching(SSW)and continuous spin-switching(CSSW)effects occur under low magnetic fields due to the strong interactions between RE‒Fe sublattices.The CSSW effect is tunable between 20‒50 Oe,and hence,HEOR potentially can be applied to spin modulation devices.Furthermore,because of the strong anisotropy of magnetic entropy change()and refrigeration capacity(RC)based on its high configurational entropy,HEOR is expected to provide a novel approach for refrigeration by altering the orientations of the crystallographic axes(anisotropic configurational entropy). 展开更多
关键词 high-entropy oxide rare-earth orthoferrite spin reorientation transition spin switching magnetocaloric effect
原文传递
Damaged apple detection with a hybrid YOLOv3 algorithm
11
作者 Meng Zhang Huazhao Liang +3 位作者 Zhongju Wang Long Wang Chao Huang xiong luo 《Information Processing in Agriculture》 EI CSCD 2024年第2期163-171,共9页
This paper proposes an improved You Only Look Once(YOLOv3)algorithm for automatically detecting damaged apples to promote the automation of the fruit processing industry.In the proposed method,a clustering method base... This paper proposes an improved You Only Look Once(YOLOv3)algorithm for automatically detecting damaged apples to promote the automation of the fruit processing industry.In the proposed method,a clustering method based on Rao-1 algorithm is introduced to optimize anchor box sizes.The clustering method uses the intersection over the union to form the objective function and the most representative anchor boxes are generated for normal and damaged apple detection.To verify the feasibility and effectiveness of the proposed method,real apple images collected from the Internet are employed.Compared with the generic YOLOv3 and Fast Region-based Convolutional Neural Network(Fast R-CNN)algorithms,the proposed method yields the highest mean average precision value for the test dataset.Therefore,it is practical to apply the proposed method for intelligent apple detection and classification tasks. 展开更多
关键词 Rao algorithm Apple detection CLUSTERING Smart agriculture
原文传递
OsFAD1–OsMYBR22 modulates clustered spikelet through regulating BRD3 in rice
12
作者 Mingxing Cheng Huanran Yuan +5 位作者 Ruihua Wang Fengfeng Fan Fengfeng Si xiong luo Wei Liu Shaoqing Li 《Journal of Integrative Plant Biology》 SCIE CAS CSCD 2024年第11期2325-2328,共4页
Grain number per panicle,a crucial component for rice grain yield,is usually determined by the primary and secondary branches of spikelets.Normally,the tips of the primary and secondary branches of spikelets are occup... Grain number per panicle,a crucial component for rice grain yield,is usually determined by the primary and secondary branches of spikelets.Normally,the tips of the primary and secondary branches of spikelets are occupied by a single grain.Occasionally,this phenotype can be disrupted by two or three complete spikelets or grains clustered on the branch tips,significantly altering the spatial arrangement of spikelet and the structure of the panicle,thus impacting grain yield (Ren et al.,2017,2020;Zhang et al.,2017). 展开更多
关键词 PANICLE arrangement GRAIN
原文传递
Mixed Deep Reinforcement Learning Considering Discrete-continuous Hybrid Action Space for Smart Home Energy Management 被引量:5
13
作者 Chao Huang Hongcai Zhang +2 位作者 Long Wang xiong luo Yonghua Song 《Journal of Modern Power Systems and Clean Energy》 SCIE EI CSCD 2022年第3期743-754,共12页
This paper develops deep reinforcement learning(DRL)algorithms for optimizing the operation of home energy system which consists of photovoltaic(PV)panels,battery energy storage system,and household appliances.Model-f... This paper develops deep reinforcement learning(DRL)algorithms for optimizing the operation of home energy system which consists of photovoltaic(PV)panels,battery energy storage system,and household appliances.Model-free DRL algorithms can efficiently handle the difficulty of energy system modeling and uncertainty of PV generation.However,discretecontinuous hybrid action space of the considered home energy system challenges existing DRL algorithms for either discrete actions or continuous actions.Thus,a mixed deep reinforcement learning(MDRL)algorithm is proposed,which integrates deep Q-learning(DQL)algorithm and deep deterministic policy gradient(DDPG)algorithm.The DQL algorithm deals with discrete actions,while the DDPG algorithm handles continuous actions.The MDRL algorithm learns optimal strategy by trialand-error interactions with the environment.However,unsafe actions,which violate system constraints,can give rise to great cost.To handle such problem,a safe-MDRL algorithm is further proposed.Simulation studies demonstrate that the proposed MDRL algorithm can efficiently handle the challenge from discrete-continuous hybrid action space for home energy management.The proposed MDRL algorithm reduces the operation cost while maintaining the human thermal comfort by comparing with benchmark algorithms on the test dataset.Moreover,the safe-MDRL algorithm greatly reduces the loss of thermal comfort in the learning stage by the proposed MDRL algorithm. 展开更多
关键词 Demand response deep reinforcement learning discrete-continuous action space home energy management safe reinforcement learning
原文传递
Towards Rehabilitation at Home After Total Knee Replacement 被引量:1
14
作者 Wenbing Zhao Shunkun Yang xiong luo 《Tsinghua Science and Technology》 SCIE EI CAS CSCD 2021年第6期791-799,共9页
In this paper,we present the design and implementation of an avatar-based interactive system that facilitates rehabilitation for people who have received total knee replacement surgeries.The system empowers patients t... In this paper,we present the design and implementation of an avatar-based interactive system that facilitates rehabilitation for people who have received total knee replacement surgeries.The system empowers patients to carry out exercises prescribed by a clinician at the home settings more effectively.Our system helps improve accountability for both patients and clinicians.The primary sensing modality is the Microsoft Kinect sensor,which is a depth camera that comes with a Software Development Kit(SDK).The SDK provides access to 3-dimensional skeleton joint positions to software developers,which significantly reduces the challenges in developing accurate motion tracking systems,especially for use at home.However,the Kinect sensor is not wellequipped to track foot orientation and its subtle movements.To overcome this issue,we augment the system with a commercial off-the-shelf Inertial Measurement Unit(IMU).The two sensing modalities are integrated where the Kinect serves as the primary sensing modality and the IMU is used for exercises where Kinect fails to produce accurate measurement.In this pilot study,we experiment with four rehabilitation exercises,namely,quad set,side-lying hip abduction,straight raise leg,and ankle pump.The Kinect is used to assess the first three exercises,and the IMU is used to assess the ankle pump exercise. 展开更多
关键词 REHABILITATION physical therapy total knee replacement AVATAR virtual reality repetition count range of motion
原文传递
A Heterogeneous Ensemble of Extreme Learning Machines with Correntropy and Negative Correlation 被引量:2
15
作者 Adnan O.M.Abuassba Yao Zhang +2 位作者 xiong luo Dezheng Zhang Wulamu Aziguli 《Tsinghua Science and Technology》 SCIE EI CAS CSCD 2017年第6期691-701,共11页
The Extreme Learning Machine(ELM) is an effective learning algorithm for a Single-Layer Feedforward Network(SLFN). It performs well in managing some problems due to its fast learning speed. However, in practical a... The Extreme Learning Machine(ELM) is an effective learning algorithm for a Single-Layer Feedforward Network(SLFN). It performs well in managing some problems due to its fast learning speed. However, in practical applications, its performance might be affected by the noise in the training data. To tackle the noise issue, we propose a novel heterogeneous ensemble of ELMs in this article. Specifically, the correntropy is used to achieve insensitive performance to outliers, while implementing Negative Correlation Learning(NCL) to enhance diversity among the ensemble. The proposed Heterogeneous Ensemble of ELMs(HE2 LM) for classification has different ELM algorithms including the Regularized ELM(RELM), the Kernel ELM(KELM), and the L2-norm-optimized ELM(ELML2). The ensemble is constructed by training a randomly selected ELM classifier on a subset of the training data selected through random resampling. Then, the class label of unseen data is predicted using a maximum weighted sum approach. After splitting the training data into subsets, the proposed HE2 LM is tested through classification and regression tasks on real-world benchmark datasets and synthetic datasets. Hence, the simulation results show that compared with other algorithms, our proposed method can achieve higher prediction accuracy, better generalization, and less sensitivity to outliers. 展开更多
关键词 Extreme Learning Machine(ELM) ensemble classification correntropy negative correlation
原文传递
LIDAR:learning from imperfect demonstrations with advantage rectification
16
作者 Xiaoqin ZHANG Huimin MA +1 位作者 xiong luo Jian YUAN 《Frontiers of Computer Science》 SCIE EI CSCD 2022年第1期57-66,共10页
In actor-critic reinforcement learning(RL)algorithms,function estimation errors are known to cause ineffective random exploration at the beginning of training,and lead to overestimated value estimates and suboptimal p... In actor-critic reinforcement learning(RL)algorithms,function estimation errors are known to cause ineffective random exploration at the beginning of training,and lead to overestimated value estimates and suboptimal policies.In this paper,we address the problem by executing advantage rectification with imperfect demonstrations,thus reducing the function estimation errors.Pretraining with expert demonstrations has been widely adopted to accelerate the learning process of deep reinforcement learning when simulations are expensive to obtain.However,existing methods,such as behavior cloning,often assume the demonstrations contain other information or labels with regard to performances,such as optimal assumption,which is usually incorrect and useless in the real world.In this paper,we explicitly handle imperfect demonstrations within the actor-critic RL frameworks,and propose a new method called learning from imperfect demonstrations with advantage rectification(LIDAR).LIDAR utilizes a rectified loss function to merely learn from selective demonstrations,which is derived from a minimal assumption that the demonstrating policies have better performances than our current policy.LIDAR learns from contradictions caused by estimation errors,and in turn reduces estimation errors.We apply LIDAR to three popular actor-critic algorithms,DDPG,TD3 and SAC,and experiments show that our method can observably reduce the function estimation errors,effectively leverage demonstrations far from the optimal,and outperform state-of-the-art baselines consistently in all the scenarios. 展开更多
关键词 learning from demonstrations actor-critic reinforcement learning advantage rectification
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部