Parking in a small parking lot within limited space poses a difficult task. It often leads to deviations between the final parking posture and the target posture. These deviations can lead to partial occupancy of adja...Parking in a small parking lot within limited space poses a difficult task. It often leads to deviations between the final parking posture and the target posture. These deviations can lead to partial occupancy of adjacent parking lots, which poses a safety threat to vehicles parked in these parking lots. However, previous studies have not addressed this issue. In this paper, we aim to evaluate the impact of parking deviation of existing vehicles next to the target parking lot(PDEVNTPL) on the automatic ego vehicle(AEV) parking, in terms of safety, comfort, accuracy, and efficiency of parking. A segmented parking training framework(SPTF) based on soft actor-critic(SAC) is proposed to improve parking performance. In the proposed method, the SAC algorithm incorporates strategy entropy into the objective function, to enable the AEV to learn parking strategies based on a more comprehensive understanding of the environment. Additionally, the SPTF simplifies complex parking tasks to maintain the high performance of deep reinforcement learning(DRL). The experimental results reveal that the PDEVNTPL has a detrimental influence on the AEV parking in terms of safety, accuracy, and comfort, leading to reductions of more than 27%, 54%, and 26%respectively. However, the SAC-based SPTF effectively mitigates this impact, resulting in a considerable increase in the parking success rate from 71% to 93%. Furthermore, the heading angle deviation is significantly reduced from 2.25 degrees to 0.43degrees.展开更多
Autonomous driving has witnessed rapid advancement;however,ensuring safe and efficient driving in intricate scenarios remains a critical challenge.In particular,traffic roundabouts bring a set of challenges to autonom...Autonomous driving has witnessed rapid advancement;however,ensuring safe and efficient driving in intricate scenarios remains a critical challenge.In particular,traffic roundabouts bring a set of challenges to autonomous driving due to the unpredictable entry and exit of vehicles,susceptibility to traffic flow bottlenecks,and imperfect data in perceiving environmental information,rendering them a vital issue in the practical application of autonomous driving.To address the traffic challenges,this work focused on complex roundabouts with multi-lane and proposed a Perception EnhancedDeepDeterministic Policy Gradient(PE-DDPG)for AutonomousDriving in the Roundabouts.Specifically,themodel incorporates an enhanced variational autoencoder featuring an integrated spatial attention mechanism alongside the Deep Deterministic Policy Gradient framework,enhancing the vehicle’s capability to comprehend complex roundabout environments and make decisions.Furthermore,the PE-DDPG model combines a dynamic path optimization strategy for roundabout scenarios,effectively mitigating traffic bottlenecks and augmenting throughput efficiency.Extensive experiments were conducted with the collaborative simulation platform of CARLA and SUMO,and the experimental results show that the proposed PE-DDPG outperforms the baseline methods in terms of the convergence capacity of the training process,the smoothness of driving and the traffic efficiency with diverse traffic flow patterns and penetration rates of autonomous vehicles(AVs).Generally,the proposed PE-DDPGmodel could be employed for autonomous driving in complex scenarios with imperfect data.展开更多
Actor-Critic是一种强化学习方法,通过与环境在线试错交互收集样本来学习策略,是求解序贯感知决策问题的有效手段.但是,这种在线交互的主动学习范式在一些复杂真实环境中收集样本时会带来成本和安全问题离线强化学习作为一种基于数据驱...Actor-Critic是一种强化学习方法,通过与环境在线试错交互收集样本来学习策略,是求解序贯感知决策问题的有效手段.但是,这种在线交互的主动学习范式在一些复杂真实环境中收集样本时会带来成本和安全问题离线强化学习作为一种基于数据驱动的强化学习范式,强调从静态样本数据集中学习策略,与环境无探索交互,为机器人、自动驾驶、健康护理等真实世界部署应用提供了可行的解决方案,是近年来的研究热点.目前,离线强化学习方法存在学习策略和行为策略之间的分布偏移挑战,针对这个挑战,通常采用策略约束或值函数正则化来限制访问数据集分布之外(Out-Of-Distribution,OOD)的动作,从而导致学习性能过于保守,阻碍了值函数网络的泛化和学习策略的性能提升.为此,本文利用不确定性估计和OOD采样来平衡值函数学习的泛化性和保守性,提出一种基于不确定性估计的离线确定型Actor-Critic方法(Offline Deterministic Actor-Critic based on UncertaintyEstimation,ODACUE).首先,针对确定型策略,给出一种Q值函数的不确定性估计算子定义,理论证明了该算子学到的Q值函数是最优Q值函数的一种悲观估计.然后,将不确定性估计算子应用于确定型Actor-Critic框架中,通过对不确定性估计算子进行凸组合构造Critic学习的目标函数.最后,D4RL基准数据集任务上的实验结果表明:相较于对比算法,ODACUE在11个不同质量等级数据集任务中的总体性能提升最低达9.56%,最高达64.92%.此外,参数分析和消融实验进一步验证了ODACUE的稳定性和泛化能力.展开更多
In this study,a novel residential virtual power plant(RVPP)scheduling method that leverages a gate recurrent unit(GRU)-integrated deep reinforcement learning(DRL)algorithm is proposed.In the proposed scheme,the GRU-in...In this study,a novel residential virtual power plant(RVPP)scheduling method that leverages a gate recurrent unit(GRU)-integrated deep reinforcement learning(DRL)algorithm is proposed.In the proposed scheme,the GRU-integrated DRL algorithm guides the RVPP to participate effectively in both the day-ahead and real-time markets,lowering the electricity purchase costs and consumption risks for end-users.The Lagrangian relaxation technique is introduced to transform the constrained Markov decision process(CMDP)into an unconstrained optimization problem,which guarantees that the constraints are strictly satisfied without determining the penalty coefficients.Furthermore,to enhance the scalability of the constrained soft actor-critic(CSAC)-based RVPP scheduling approach,a fully distributed scheduling architecture was designed to enable plug-and-play in the residential distributed energy resources(RDER).Case studies performed on the constructed RVPP scenario validated the performance of the proposed methodology in enhancing the responsiveness of the RDER to power tariffs,balancing the supply and demand of the power grid,and ensuring customer comfort.展开更多
Background:Combined knee valgus and tibial internal rotation(VL+IR)moments have been shown to stress the anterior cruciate ligament(ACL)in several in vitro cadaveric studies.To utilize this knowledge for non-contact A...Background:Combined knee valgus and tibial internal rotation(VL+IR)moments have been shown to stress the anterior cruciate ligament(ACL)in several in vitro cadaveric studies.To utilize this knowledge for non-contact ACL injury prevention in sports,it is necessary to elucidate how the ground reaction force(GRF)acting point(center of pressure(CoP))in the stance foot produces combined knee VL+IR moments in risky maneuvers,such as cuttings.However,the effects of the GRF acting point on the development of the combined knee VL+IR moment in cutting are still unknown.Methods:We first established the deterministic mechanical condition that the CoP position relative to the tibial rotational axis differentiates the GRF vector’s directional probability for developing the combined knee VL+IR moment,and theoretically predicted that when the CoP is posterior to the tibial rotational axis,the GRF vector is more likely to produce the combined knee VL+IR moment than when the CoP is anterior to the tibial rotational axis.Then,we tested a stochastic aspect of our theory in a lab-controlled in vivo experiment.Fourteen females performed 60˚cutting under forefoot/rearfoot strike conditions(10 trials each).The positions of lower limb markers and GRF data were measured,and the knee moment due to GRF vector was calculated.The trials were divided into anterior-and posterior-CoP groups depending on the CoP position relative to the tibial rotational axis at each 10 ms interval from 0 to 100 ms after foot strike,and the occurrence rate of the combined knee VL+IR moment was compared between trial groups.Results:The posterior-CoP group showed significantly higher occurrence rates of the combined knee VL+IR moment(maximum of 82.8%)at every time point than those of the anterior-CoP trials,as theoretically predicted by the deterministic mechanical condition.Conclusion:The rearfoot strikes inducing the posterior CoP should be avoided to reduce the risk of non-contact ACL injury associated with the combined knee VL+IR stress.展开更多
Glaucoma is a group of ocular atrophy diseases that cause progressive vision loss by affecting the optic nerve.Because of its asymptomatic nature,glaucoma has become the leading cause of human blindness worldwide.In t...Glaucoma is a group of ocular atrophy diseases that cause progressive vision loss by affecting the optic nerve.Because of its asymptomatic nature,glaucoma has become the leading cause of human blindness worldwide.In this paper,a novel computer-aided diagnosis(CAD)approach for glaucomatous retinal image classification has been introduced.It extracts graph-based texture features from structurally improved fundus images using discrete wavelet-transformation(DWT)and deterministic tree-walk(DTW)procedures.Retinal images are considered from both public repositories and eye hospitals.Images are enhanced with image-specific luminance and gradient transitions for both contrast and texture improvement.The enhanced images are mapped into undirected graphs using DTW trajectories formed by the image’s wavelet coefficients.Graph-based features are extracted fromthese graphs to capture image texture patterns.Machine learning(ML)classifiers use these features to label retinal images.This approach has attained an accuracy range of 93.5%to 100%,82.1%to 99.3%,95.4%to 100%,83.3%to 96.6%,77.7%to 88.8%,and 91.4%to 100%on the ACRIMA,ORIGA,RIM-ONE,Drishti,HRF,and HOSPITAL datasets,respectively.The major strength of this approach is texture pattern identification using various topological graphs.It has achieved optimal performance with SVM and RF classifiers using biorthogonal DWT combinations on both public and patients’fundus datasets.The classification performance of the DWT-DTW approach is on par with the contemporary state-of-the-art methods,which can be helpful for ophthalmologists in glaucoma screening.展开更多
This paper presents a subspace identification method for closed-loop systems with unknown deterministic disturbances.To deal with the unknown deterministic disturbances,two strategies are implemented to construct the ...This paper presents a subspace identification method for closed-loop systems with unknown deterministic disturbances.To deal with the unknown deterministic disturbances,two strategies are implemented to construct the row space that can be used to approximately represent the unknown deterministic disturbances using the trigonometric functions or Bernstein polynomials depending on whether the disturbance frequencies are known.For closed-loop identification,CCF-N4SID is extended to the case with unknown deterministic disturbances using the oblique projection.In addition,a proper Bernstein polynomial order can be determined using the Akaike information criterion(AIC)or the Bayesian information criterion(BIC).Numerical simulation results demonstrate the effectiveness of the proposed identification method for both periodic and aperiodic deterministic disturbances.展开更多
In previous works, the theoretical and experimental deterministic scalar kinematic structures, the theoretical and experimental deterministic vector kinematic structures, the theoretical and experimental deterministic...In previous works, the theoretical and experimental deterministic scalar kinematic structures, the theoretical and experimental deterministic vector kinematic structures, the theoretical and experimental deterministic scalar dynamic structures, and the theoretical and experimental deterministic vector dynamic structures have been developed to compute the exact solution for deterministic chaos of the exponential pulsons and oscillons that is governed by the nonstationary three-dimensional Navier-Stokes equations. To explore properties of the kinetic energy, rectangular, diagonal, and triangular summations of a matrix of the kinetic energy and general terms of various sums have been used in the current paper to develop quantization of the kinetic energy of deterministic chaos. Nested structures of a cumulative energy pulson, an energy pulson of propagation, an internal energy oscillon, a diagonal energy oscillon, and an external energy oscillon have been established. In turn, the energy pulsons and oscillons include group pulsons of propagation, internal group oscillons, diagonal group oscillons, and external group oscillons. Sequentially, the group pulsons and oscillons contain wave pulsons of propagation, internal wave oscillons, diagonal wave oscillons, and external wave oscillons. Consecutively, the wave pulsons and oscillons are composed of elementary pulsons of propagation, internal elementary oscillons, diagonal elementary oscillons, and external elementary oscillons. Topology, periodicity, and integral properties of the exponential pulsons and oscillons have been studied using the novel method of the inhomogeneous Fourier expansions via eigenfunctions in coordinates and time. Symbolic computations of the exact expansions have been performed using the experimental and theoretical programming in Maple. Results of the symbolic computations have been justified by probe visualizations.展开更多
Deterministic optimization methods are combined with the Pareto front concept to solve multi-criterion design problems. The algorithm and the numerical implementation are applied to aerodynamic designs. Evolutionary a...Deterministic optimization methods are combined with the Pareto front concept to solve multi-criterion design problems. The algorithm and the numerical implementation are applied to aerodynamic designs. Evolutionary algorithms (EAs) and the Pareto front concept are used to solve practical design problems in industry for its robustness in capturing convex, concave, discrete or discontinuous Pareto fronts of multi-objective optimization problems. However, the process is time-consuming. Therefore, deterministic optimization methods are introduced to capture the Pareto front, and the types of the captured Pareto front are explained. Numerical experiments show that the deterministic optimization method is a good alternative to EAs for capturing any convex and some concave Pareto fronts in multi-criterion aerodynamic optimization problems due to its efficiency.展开更多
基金supported by National Natural Science Foundation of China(52222215, 52272420, 52072051)。
文摘Parking in a small parking lot within limited space poses a difficult task. It often leads to deviations between the final parking posture and the target posture. These deviations can lead to partial occupancy of adjacent parking lots, which poses a safety threat to vehicles parked in these parking lots. However, previous studies have not addressed this issue. In this paper, we aim to evaluate the impact of parking deviation of existing vehicles next to the target parking lot(PDEVNTPL) on the automatic ego vehicle(AEV) parking, in terms of safety, comfort, accuracy, and efficiency of parking. A segmented parking training framework(SPTF) based on soft actor-critic(SAC) is proposed to improve parking performance. In the proposed method, the SAC algorithm incorporates strategy entropy into the objective function, to enable the AEV to learn parking strategies based on a more comprehensive understanding of the environment. Additionally, the SPTF simplifies complex parking tasks to maintain the high performance of deep reinforcement learning(DRL). The experimental results reveal that the PDEVNTPL has a detrimental influence on the AEV parking in terms of safety, accuracy, and comfort, leading to reductions of more than 27%, 54%, and 26%respectively. However, the SAC-based SPTF effectively mitigates this impact, resulting in a considerable increase in the parking success rate from 71% to 93%. Furthermore, the heading angle deviation is significantly reduced from 2.25 degrees to 0.43degrees.
基金supported in part by the projects of the National Natural Science Foundation of China(62376059,41971340)Fujian Provincial Department of Science and Technology(2023XQ008,2023I0024,2021Y4019),Fujian Provincial Department of Finance(GY-Z230007,GYZ23012)Fujian Key Laboratory of Automotive Electronics and Electric Drive(KF-19-22001).
文摘Autonomous driving has witnessed rapid advancement;however,ensuring safe and efficient driving in intricate scenarios remains a critical challenge.In particular,traffic roundabouts bring a set of challenges to autonomous driving due to the unpredictable entry and exit of vehicles,susceptibility to traffic flow bottlenecks,and imperfect data in perceiving environmental information,rendering them a vital issue in the practical application of autonomous driving.To address the traffic challenges,this work focused on complex roundabouts with multi-lane and proposed a Perception EnhancedDeepDeterministic Policy Gradient(PE-DDPG)for AutonomousDriving in the Roundabouts.Specifically,themodel incorporates an enhanced variational autoencoder featuring an integrated spatial attention mechanism alongside the Deep Deterministic Policy Gradient framework,enhancing the vehicle’s capability to comprehend complex roundabout environments and make decisions.Furthermore,the PE-DDPG model combines a dynamic path optimization strategy for roundabout scenarios,effectively mitigating traffic bottlenecks and augmenting throughput efficiency.Extensive experiments were conducted with the collaborative simulation platform of CARLA and SUMO,and the experimental results show that the proposed PE-DDPG outperforms the baseline methods in terms of the convergence capacity of the training process,the smoothness of driving and the traffic efficiency with diverse traffic flow patterns and penetration rates of autonomous vehicles(AVs).Generally,the proposed PE-DDPGmodel could be employed for autonomous driving in complex scenarios with imperfect data.
文摘Actor-Critic是一种强化学习方法,通过与环境在线试错交互收集样本来学习策略,是求解序贯感知决策问题的有效手段.但是,这种在线交互的主动学习范式在一些复杂真实环境中收集样本时会带来成本和安全问题离线强化学习作为一种基于数据驱动的强化学习范式,强调从静态样本数据集中学习策略,与环境无探索交互,为机器人、自动驾驶、健康护理等真实世界部署应用提供了可行的解决方案,是近年来的研究热点.目前,离线强化学习方法存在学习策略和行为策略之间的分布偏移挑战,针对这个挑战,通常采用策略约束或值函数正则化来限制访问数据集分布之外(Out-Of-Distribution,OOD)的动作,从而导致学习性能过于保守,阻碍了值函数网络的泛化和学习策略的性能提升.为此,本文利用不确定性估计和OOD采样来平衡值函数学习的泛化性和保守性,提出一种基于不确定性估计的离线确定型Actor-Critic方法(Offline Deterministic Actor-Critic based on UncertaintyEstimation,ODACUE).首先,针对确定型策略,给出一种Q值函数的不确定性估计算子定义,理论证明了该算子学到的Q值函数是最优Q值函数的一种悲观估计.然后,将不确定性估计算子应用于确定型Actor-Critic框架中,通过对不确定性估计算子进行凸组合构造Critic学习的目标函数.最后,D4RL基准数据集任务上的实验结果表明:相较于对比算法,ODACUE在11个不同质量等级数据集任务中的总体性能提升最低达9.56%,最高达64.92%.此外,参数分析和消融实验进一步验证了ODACUE的稳定性和泛化能力.
基金supported by the Sichuan Science and Technology Program(grant number 2022YFG0123).
文摘In this study,a novel residential virtual power plant(RVPP)scheduling method that leverages a gate recurrent unit(GRU)-integrated deep reinforcement learning(DRL)algorithm is proposed.In the proposed scheme,the GRU-integrated DRL algorithm guides the RVPP to participate effectively in both the day-ahead and real-time markets,lowering the electricity purchase costs and consumption risks for end-users.The Lagrangian relaxation technique is introduced to transform the constrained Markov decision process(CMDP)into an unconstrained optimization problem,which guarantees that the constraints are strictly satisfied without determining the penalty coefficients.Furthermore,to enhance the scalability of the constrained soft actor-critic(CSAC)-based RVPP scheduling approach,a fully distributed scheduling architecture was designed to enable plug-and-play in the residential distributed energy resources(RDER).Case studies performed on the constructed RVPP scenario validated the performance of the proposed methodology in enhancing the responsiveness of the RDER to power tariffs,balancing the supply and demand of the power grid,and ensuring customer comfort.
基金supported by the Grant-in-Aid for Young Scientists(B)Project(Grant No.24700716)funded by the Ministry of Education,Culture,Sports,Science and Technology,Japan.
文摘Background:Combined knee valgus and tibial internal rotation(VL+IR)moments have been shown to stress the anterior cruciate ligament(ACL)in several in vitro cadaveric studies.To utilize this knowledge for non-contact ACL injury prevention in sports,it is necessary to elucidate how the ground reaction force(GRF)acting point(center of pressure(CoP))in the stance foot produces combined knee VL+IR moments in risky maneuvers,such as cuttings.However,the effects of the GRF acting point on the development of the combined knee VL+IR moment in cutting are still unknown.Methods:We first established the deterministic mechanical condition that the CoP position relative to the tibial rotational axis differentiates the GRF vector’s directional probability for developing the combined knee VL+IR moment,and theoretically predicted that when the CoP is posterior to the tibial rotational axis,the GRF vector is more likely to produce the combined knee VL+IR moment than when the CoP is anterior to the tibial rotational axis.Then,we tested a stochastic aspect of our theory in a lab-controlled in vivo experiment.Fourteen females performed 60˚cutting under forefoot/rearfoot strike conditions(10 trials each).The positions of lower limb markers and GRF data were measured,and the knee moment due to GRF vector was calculated.The trials were divided into anterior-and posterior-CoP groups depending on the CoP position relative to the tibial rotational axis at each 10 ms interval from 0 to 100 ms after foot strike,and the occurrence rate of the combined knee VL+IR moment was compared between trial groups.Results:The posterior-CoP group showed significantly higher occurrence rates of the combined knee VL+IR moment(maximum of 82.8%)at every time point than those of the anterior-CoP trials,as theoretically predicted by the deterministic mechanical condition.Conclusion:The rearfoot strikes inducing the posterior CoP should be avoided to reduce the risk of non-contact ACL injury associated with the combined knee VL+IR stress.
文摘Glaucoma is a group of ocular atrophy diseases that cause progressive vision loss by affecting the optic nerve.Because of its asymptomatic nature,glaucoma has become the leading cause of human blindness worldwide.In this paper,a novel computer-aided diagnosis(CAD)approach for glaucomatous retinal image classification has been introduced.It extracts graph-based texture features from structurally improved fundus images using discrete wavelet-transformation(DWT)and deterministic tree-walk(DTW)procedures.Retinal images are considered from both public repositories and eye hospitals.Images are enhanced with image-specific luminance and gradient transitions for both contrast and texture improvement.The enhanced images are mapped into undirected graphs using DTW trajectories formed by the image’s wavelet coefficients.Graph-based features are extracted fromthese graphs to capture image texture patterns.Machine learning(ML)classifiers use these features to label retinal images.This approach has attained an accuracy range of 93.5%to 100%,82.1%to 99.3%,95.4%to 100%,83.3%to 96.6%,77.7%to 88.8%,and 91.4%to 100%on the ACRIMA,ORIGA,RIM-ONE,Drishti,HRF,and HOSPITAL datasets,respectively.The major strength of this approach is texture pattern identification using various topological graphs.It has achieved optimal performance with SVM and RF classifiers using biorthogonal DWT combinations on both public and patients’fundus datasets.The classification performance of the DWT-DTW approach is on par with the contemporary state-of-the-art methods,which can be helpful for ophthalmologists in glaucoma screening.
基金partially supported by National Key Research and Development Program of China(2019YFC1510902)National Natural Science Foundation of China(62073104)+1 种基金Natural Science Foundation of Heilongjiang Province(LH2022F024)China Postdoctoral Science Foundation(2022M710965)。
文摘This paper presents a subspace identification method for closed-loop systems with unknown deterministic disturbances.To deal with the unknown deterministic disturbances,two strategies are implemented to construct the row space that can be used to approximately represent the unknown deterministic disturbances using the trigonometric functions or Bernstein polynomials depending on whether the disturbance frequencies are known.For closed-loop identification,CCF-N4SID is extended to the case with unknown deterministic disturbances using the oblique projection.In addition,a proper Bernstein polynomial order can be determined using the Akaike information criterion(AIC)or the Bayesian information criterion(BIC).Numerical simulation results demonstrate the effectiveness of the proposed identification method for both periodic and aperiodic deterministic disturbances.
文摘In previous works, the theoretical and experimental deterministic scalar kinematic structures, the theoretical and experimental deterministic vector kinematic structures, the theoretical and experimental deterministic scalar dynamic structures, and the theoretical and experimental deterministic vector dynamic structures have been developed to compute the exact solution for deterministic chaos of the exponential pulsons and oscillons that is governed by the nonstationary three-dimensional Navier-Stokes equations. To explore properties of the kinetic energy, rectangular, diagonal, and triangular summations of a matrix of the kinetic energy and general terms of various sums have been used in the current paper to develop quantization of the kinetic energy of deterministic chaos. Nested structures of a cumulative energy pulson, an energy pulson of propagation, an internal energy oscillon, a diagonal energy oscillon, and an external energy oscillon have been established. In turn, the energy pulsons and oscillons include group pulsons of propagation, internal group oscillons, diagonal group oscillons, and external group oscillons. Sequentially, the group pulsons and oscillons contain wave pulsons of propagation, internal wave oscillons, diagonal wave oscillons, and external wave oscillons. Consecutively, the wave pulsons and oscillons are composed of elementary pulsons of propagation, internal elementary oscillons, diagonal elementary oscillons, and external elementary oscillons. Topology, periodicity, and integral properties of the exponential pulsons and oscillons have been studied using the novel method of the inhomogeneous Fourier expansions via eigenfunctions in coordinates and time. Symbolic computations of the exact expansions have been performed using the experimental and theoretical programming in Maple. Results of the symbolic computations have been justified by probe visualizations.
文摘Deterministic optimization methods are combined with the Pareto front concept to solve multi-criterion design problems. The algorithm and the numerical implementation are applied to aerodynamic designs. Evolutionary algorithms (EAs) and the Pareto front concept are used to solve practical design problems in industry for its robustness in capturing convex, concave, discrete or discontinuous Pareto fronts of multi-objective optimization problems. However, the process is time-consuming. Therefore, deterministic optimization methods are introduced to capture the Pareto front, and the types of the captured Pareto front are explained. Numerical experiments show that the deterministic optimization method is a good alternative to EAs for capturing any convex and some concave Pareto fronts in multi-criterion aerodynamic optimization problems due to its efficiency.