The behavior of Chinese patients seeking help for erectile dysfunction (ED) has not been described in detail. This was an observational study conducted using an outpatient clinic-based questionnaire survey of ED pat...The behavior of Chinese patients seeking help for erectile dysfunction (ED) has not been described in detail. This was an observational study conducted using an outpatient clinic-based questionnaire survey of ED patients. From 2008 to 2009, physicians in 10 medical centers in China enrolled 2693 men (aged 25-70years) diagnosed with ED. The diagnosis was based on the International Index of Erectile Function 5 (IIEF-5) Questionnaire. The men completed a survey that asked questions about demographics, marital status, education level and household income as well as help-seeking behavior and awareness of medical therapy. The mean age of the 2693 men was 43.4 5.3years; 73% were 〈50-years-old and 49% had a high household income. The mean time between noticing ED and taking the first treatment was 4.3 2.1months. Of the 2577 respondents, physicians (54%) and the internet (52%) were most frequently consulted sources for information about ED. Young ED patients preferred using the internet and older patients preferred consulting with physicians. Western medicine (19%) and traditional Chinese medicine (16%) were most frequently used for treatment. Young ED patients preferred to first search the internet for information, whereas older patients first asked physicians for help. Side effects of treatment were the greatest concern, especially for older patients. Physicians and the internet are frequently consulted for ED information and therapy. On the basis of these survey results, we believe that physicians in China should enhance health education about ED, especially via the internet.展开更多
With the maturation of autonomous driving technology, the use of autonomous vehicles in a socially acceptable manner has become a growing demand of the public. Human-like autonomous driving is expected due to the impa...With the maturation of autonomous driving technology, the use of autonomous vehicles in a socially acceptable manner has become a growing demand of the public. Human-like autonomous driving is expected due to the impact of the differences between autonomous vehicles and human drivers on safety.Although human-like decision-making has become a research hotspot, a unified theory has not yet been formed, and there are significant differences in the implementation and performance of existing methods. This paper provides a comprehensive overview of human-like decision-making for autonomous vehicles. The following issues are discussed: 1) The intelligence level of most autonomous driving decision-making algorithms;2) The driving datasets and simulation platforms for testing and verifying human-like decision-making;3) The evaluation metrics of human-likeness;personalized driving;the application of decisionmaking in real traffic scenarios;and 4) The potential research direction of human-like driving. These research results are significant for creating interpretable human-like driving models and applying them in dynamic traffic scenarios. In the future, the combination of intuitive logical reasoning and hierarchical structure will be an important topic for further research. It is expected to meet the needs of human-like driving.展开更多
Behavior-based autonomous systems rely on human intelligence to resolve multi-mission conflicts by designing mission priority rules and nonlinear controllers.In this work,a novel twolayer reinforcement learning behavi...Behavior-based autonomous systems rely on human intelligence to resolve multi-mission conflicts by designing mission priority rules and nonlinear controllers.In this work,a novel twolayer reinforcement learning behavioral control(RLBC)method is proposed to reduce such dependence by trial-and-error learning.Specifically,in the upper layer,a reinforcement learning mission supervisor(RLMS)is designed to learn the optimal mission priority.Compared with existing mission supervisors,the RLMS improves the dynamic performance of mission priority adjustment by maximizing cumulative rewards and reducing hardware storage demand when using neural networks.In the lower layer,a reinforcement learning controller(RLC)is designed to learn the optimal control policy.Compared with existing behavioral controllers,the RLC reduces the control cost of mission priority adjustment by balancing control performance and consumption.All error signals are proved to be semi-globally uniformly ultimately bounded(SGUUB).Simulation results show that the number of mission priority adjustment and the control cost are significantly reduced compared to some existing mission supervisors and behavioral controllers,respectively.展开更多
This paper is to explore the problems of intelligent connected vehicles(ICVs)autonomous driving decision-making under a 5G-V2X structured road environment.Through literature review and interviews with autonomous drivi...This paper is to explore the problems of intelligent connected vehicles(ICVs)autonomous driving decision-making under a 5G-V2X structured road environment.Through literature review and interviews with autonomous driving practitioners,this paper firstly puts forward a logical framework for designing a cerebrum-like autonomous driving system.Secondly,situated on this framework,it builds a hierarchical finite state machine(HFSM)model as well as a TOPSIS-GRA algorithm for making ICV autonomous driving decisions by employing a data fusion approach between the entropy weight method(EWM)and analytic hierarchy process method(AHP)and by employing a model fusion approach between the technique for order preference by similarity to an ideal solution(TOPSIS)and grey relational analysis(GRA).The HFSM model is composed of two layers:the global FSM model and the local FSM model.The decision of the former acts as partial input information of the latter and the result of the latter is sent forward to the local pathplanning module,meanwhile pulsating feedback to the former as real-time refresh data.To identify different traffic scenarios in a cerebrum-like way,the global FSM model is designed as 7 driving behavior states and 17 driving characteristic events,and the local FSM model is designed as 16 states and 8 characteristic events.In respect to designing a cerebrum-like algorithm for state transition,this paper firstly fuses AHP weight and EWM weight at their output layer to generate a synthetic weight coefficient for each characteristic event;then,it further fuses TOPSIS method and GRA method at the model building layer to obtain the implementable order of state transition.To verify the feasibility,reliability,and safety of theHFSMmodel aswell as its TOPSISGRA state transition algorithm,this paper elaborates on a series of simulative experiments conducted on the PreScan8.50 platform.The results display that the accuracy of obstacle detection gets 98%,lane line prediction is beyond 70 m,the speed of collision avoidance is higher than 45 km/h,the distance of collision avoidance is less than 5 m,path planning time for obstacle avoidance is averagely less than 50 ms,and brake deceleration is controlled under 6 m/s2.These technical indexes support that the driving states set and characteristic events set for the HFSM model as well as its TOPSIS-GRA algorithm may bring about cerebrum-like decision-making effectiveness for ICV autonomous driving under 5G-V2X intelligent road infrastructure.展开更多
Given the difficulty in hand coding task schemes, an intellectualized architecture of the autonomous micro mobile robot based behavior for fault repair was presented. Integrating the reinforcement learning and the...Given the difficulty in hand coding task schemes, an intellectualized architecture of the autonomous micro mobile robot based behavior for fault repair was presented. Integrating the reinforcement learning and the group behavior evolution simulating the human's learning and evolution, the autonomous micro mobile robot will automatically generate the suited actions satisfied the environment. However, the designer only devises some basic behaviors, which decreases the workload of the designer and cognitive deficiency of the robot to the environment. The results of simulation have shown that the architecture endows micro robot with the ability of learning, adaptation and robustness, also with the ability of accomplishing the given task.展开更多
This study takes Gannan Tibetan Autonomous Prefecture as the place of case study and tourists as research objects. From the perspectives of geographical distribution of source tourist markets, Tourist activity behavio...This study takes Gannan Tibetan Autonomous Prefecture as the place of case study and tourists as research objects. From the perspectives of geographical distribution of source tourist markets, Tourist activity behavioral and spatial patterns of Tourists, this study looks into the geographical structure of the source tourists and spatial patterns by geography. The analysis of 341 questionnaires on tourists shows that:(1) The tourism cycle of Gannan is in the development phase, competing with adjacent Aba, and greatly impacted by the substitution effect and shadow effect of Aba.(2) The spatial distribution of tourist sources is concentrated, indicating that Gannan is a regional tourism destination. The temporal distance of tourists is mainly concentrated within the 6-hour traffi c circle.(3) Gannan Tibetan Autonomous Prefecture has already become the composite tourist destination dominated by leisure vacation. The minority folkcustom and special landscape are the most attractive tourism resources. Due to the impact of man-land harmonious lifestyle in the tourist areas, the environmental attitude of tourists is improved, and the transportation and shopping are the most vulnerable links in tourism service in Gannan Tibetan Autonomous Prefecture.(4) The spatial behavior of tourists in Gannan is mainly of single-destination style(52%), Transit leg and circle tour style(7%) as well as circle tour style(41%). The spatial distribution of tourist fl ow in Gannan shows a signifi cant feature "more in the north, less in the south and dependent on National Road". Tourism resources, transport facilities, regional competition and lack of route connecting different ecological units are important causes of the spatial distribution of self-help tourists.展开更多
The present study aimed to investigate senior high school students to explore the relationships among their English achievement goal orientations,learning anxiety,and autonomous learning behavior.748 first-year senior...The present study aimed to investigate senior high school students to explore the relationships among their English achievement goal orientations,learning anxiety,and autonomous learning behavior.748 first-year senior high school students in Guizhou Province,China were selected as participants.A comprehensive questionnaire measuring the above variables was designed to collect the data.The Structural Equation Modeling(SEM)was used to analyze the data.The results showed that the model had good fit to the sample.The students’mastery goals and performance-approach goals positively contributed to their autonomous learning behavior,whereas their performance-avoidance goals were negatively associated with their autonomous learning behavior.The students’mastery goals effectively reduced their learning anxiety,but their performance-approach goals and performance-avoidance goals engendered learning anxiety.The students’learning anxiety and their autonomous learning behavior were negatively correlated.展开更多
This study examined the differences and primary factors from the impact of autonomous motivation and controlled motivation on the self-management behavior of hemodialysis patients.Anonymous,self-describing questionnai...This study examined the differences and primary factors from the impact of autonomous motivation and controlled motivation on the self-management behavior of hemodialysis patients.Anonymous,self-describing questionnaires were used for research on nine different dialysis facilities of 413 people who regularly visit.From using the primary factor results of multiple regression analysis,that took autonomous motivation and controlled motivation as the dependent variable,a path diagram was created that led to each motivation.The acknowledgement of autonomy support facilitated whether it was autonomous motivation or controlled motivation(The standardized coefficient was 0.385,0.346,p<0.0001).Positive evaluation coping skills were a primary factor that promoted autonomous motivation,while trait anxiety,disorders of social activities,and lack of motivation were primary factors that promoted controlled motivation.In order to raise the autonomous motivation to promote self-management behavior in patients with hemodialysis treatment,situations that easily cause amotivation and anxiety,as well as tendencies for depression should be assessed.Also the encouragement to attain positive evaluation coping skills to support patient autonomy appears to be effective.展开更多
In the near future, active safety systems will take more control over the vehicle driving, even up to introducing fully autonomous vehicles. Nowadays, it is expected that the active safety systems will aid avoiding co...In the near future, active safety systems will take more control over the vehicle driving, even up to introducing fully autonomous vehicles. Nowadays, it is expected that the active safety systems will aid avoiding collisions much more efficiently than human drivers. These systems can protect not only the passengers, but also other road users. To mitigate collision, certain maneuvers (e.g., sudden braking, lane change, etc.) need to be done in a reasonably quick time. However, this may lead to low-g energy pulses. The latter fact, may cause unexpected and, in some cases, unwanted occupant body motion resulting even in OOP (out of position) postures. New patterns of occupant reactions in such cases are, to some extent, confirmed experimentally [1-3]. This paper evaluates the limits of standard ATDs (anthropometric test devices) and chosen human models in well established maneuver scenarios. Obtained results are compared with experimental data available in the literature. Drawbacks identify new challenges for the near future simulation based safety engineering. One scenario with combined conditions of emergency braking during lane change has been used as an example of OOP posture after maneuver.展开更多
In order to deeply analyze the differences in the acceptance of autonomous driving technology among different gender groups,a multiple indicators and multiple causes model was constructed by integrating a technology a...In order to deeply analyze the differences in the acceptance of autonomous driving technology among different gender groups,a multiple indicators and multiple causes model was constructed by integrating a technology acceptance model and theory of planned behavior to comprehensively reveal the gender differences in the influence mechanisms of subjective and objective factors.The analysis is based on data collected from Chinese urban residents.Among objective factors,age has a significant negative impact on women's perceived behavior control and a significant positive impact on perceived ease of use.Education has a significant positive impact on men's perceived behavior control,and has a strong positive impact on women's perceived usefulness(PU).For men,income and education are found to have strong positive impacts on perceived behavior control.Among subjective factors,perceived ease of use(PEU)has the greatest influence on women's behavior intention,and it is the only influential factor for women's intention to use autonomous driving technology,with an influence coefficient of 0.72.The influencing path of men's intention to use autonomous driving technology is more complex.It is not only directly affected by the significant and positive joint effects of attitude and PU,but also indirectly affected by perceived behavior controls,subjective norms,and PEU.展开更多
自动驾驶车辆决策系统直接影响车辆综合行驶性能,是实现自动驾驶技术需要解决的关键难题之一。基于深度强化学习算法DDPG(deep deterministic policy gradient),针对此问题提出了一种端到端驾驶行为决策模型。首先,结合驾驶员模型选取...自动驾驶车辆决策系统直接影响车辆综合行驶性能,是实现自动驾驶技术需要解决的关键难题之一。基于深度强化学习算法DDPG(deep deterministic policy gradient),针对此问题提出了一种端到端驾驶行为决策模型。首先,结合驾驶员模型选取自车、道路、干扰车辆等共64维度状态空间信息作为输入数据集对决策模型进行训练,决策模型输出合理的驾驶行为以及控制量,为解决训练测试中的奖励和控制量突变问题,改进DDPG决策模型对决策控制效果进行优化,并在TORCS(the open racing car simulator)平台进行仿真实验验证。结果表明:所提出的决策模型可以根据车辆和环境实时状态信息输出合理的驾驶行为以及控制量,与DDPG模型相比,改进的模型具有更好的控制精度,且车辆横向速度显著减小,车辆舒适性以及车辆稳定性明显改善。展开更多
文摘The behavior of Chinese patients seeking help for erectile dysfunction (ED) has not been described in detail. This was an observational study conducted using an outpatient clinic-based questionnaire survey of ED patients. From 2008 to 2009, physicians in 10 medical centers in China enrolled 2693 men (aged 25-70years) diagnosed with ED. The diagnosis was based on the International Index of Erectile Function 5 (IIEF-5) Questionnaire. The men completed a survey that asked questions about demographics, marital status, education level and household income as well as help-seeking behavior and awareness of medical therapy. The mean age of the 2693 men was 43.4 5.3years; 73% were 〈50-years-old and 49% had a high household income. The mean time between noticing ED and taking the first treatment was 4.3 2.1months. Of the 2577 respondents, physicians (54%) and the internet (52%) were most frequently consulted sources for information about ED. Young ED patients preferred using the internet and older patients preferred consulting with physicians. Western medicine (19%) and traditional Chinese medicine (16%) were most frequently used for treatment. Young ED patients preferred to first search the internet for information, whereas older patients first asked physicians for help. Side effects of treatment were the greatest concern, especially for older patients. Physicians and the internet are frequently consulted for ED information and therapy. On the basis of these survey results, we believe that physicians in China should enhance health education about ED, especially via the internet.
基金supported by the National Key R&D Program of China (2022YFB2502900)the National Natural Science Foundation of China (62088102, 61790563)。
文摘With the maturation of autonomous driving technology, the use of autonomous vehicles in a socially acceptable manner has become a growing demand of the public. Human-like autonomous driving is expected due to the impact of the differences between autonomous vehicles and human drivers on safety.Although human-like decision-making has become a research hotspot, a unified theory has not yet been formed, and there are significant differences in the implementation and performance of existing methods. This paper provides a comprehensive overview of human-like decision-making for autonomous vehicles. The following issues are discussed: 1) The intelligence level of most autonomous driving decision-making algorithms;2) The driving datasets and simulation platforms for testing and verifying human-like decision-making;3) The evaluation metrics of human-likeness;personalized driving;the application of decisionmaking in real traffic scenarios;and 4) The potential research direction of human-like driving. These research results are significant for creating interpretable human-like driving models and applying them in dynamic traffic scenarios. In the future, the combination of intuitive logical reasoning and hierarchical structure will be an important topic for further research. It is expected to meet the needs of human-like driving.
基金the National Natural Science Foundation of China(61603094)。
文摘Behavior-based autonomous systems rely on human intelligence to resolve multi-mission conflicts by designing mission priority rules and nonlinear controllers.In this work,a novel twolayer reinforcement learning behavioral control(RLBC)method is proposed to reduce such dependence by trial-and-error learning.Specifically,in the upper layer,a reinforcement learning mission supervisor(RLMS)is designed to learn the optimal mission priority.Compared with existing mission supervisors,the RLMS improves the dynamic performance of mission priority adjustment by maximizing cumulative rewards and reducing hardware storage demand when using neural networks.In the lower layer,a reinforcement learning controller(RLC)is designed to learn the optimal control policy.Compared with existing behavioral controllers,the RLC reduces the control cost of mission priority adjustment by balancing control performance and consumption.All error signals are proved to be semi-globally uniformly ultimately bounded(SGUUB).Simulation results show that the number of mission priority adjustment and the control cost are significantly reduced compared to some existing mission supervisors and behavioral controllers,respectively.
基金funded by Chongqing Science and Technology Bureau (No.cstc2021jsyj-yzysbAX0008)Chongqing University of Arts and Sciences (No.P2021JG13)2021 Humanities and Social Sciences Program of Chongqing Education Commission (No.21SKGH227).
文摘This paper is to explore the problems of intelligent connected vehicles(ICVs)autonomous driving decision-making under a 5G-V2X structured road environment.Through literature review and interviews with autonomous driving practitioners,this paper firstly puts forward a logical framework for designing a cerebrum-like autonomous driving system.Secondly,situated on this framework,it builds a hierarchical finite state machine(HFSM)model as well as a TOPSIS-GRA algorithm for making ICV autonomous driving decisions by employing a data fusion approach between the entropy weight method(EWM)and analytic hierarchy process method(AHP)and by employing a model fusion approach between the technique for order preference by similarity to an ideal solution(TOPSIS)and grey relational analysis(GRA).The HFSM model is composed of two layers:the global FSM model and the local FSM model.The decision of the former acts as partial input information of the latter and the result of the latter is sent forward to the local pathplanning module,meanwhile pulsating feedback to the former as real-time refresh data.To identify different traffic scenarios in a cerebrum-like way,the global FSM model is designed as 7 driving behavior states and 17 driving characteristic events,and the local FSM model is designed as 16 states and 8 characteristic events.In respect to designing a cerebrum-like algorithm for state transition,this paper firstly fuses AHP weight and EWM weight at their output layer to generate a synthetic weight coefficient for each characteristic event;then,it further fuses TOPSIS method and GRA method at the model building layer to obtain the implementable order of state transition.To verify the feasibility,reliability,and safety of theHFSMmodel aswell as its TOPSISGRA state transition algorithm,this paper elaborates on a series of simulative experiments conducted on the PreScan8.50 platform.The results display that the accuracy of obstacle detection gets 98%,lane line prediction is beyond 70 m,the speed of collision avoidance is higher than 45 km/h,the distance of collision avoidance is less than 5 m,path planning time for obstacle avoidance is averagely less than 50 ms,and brake deceleration is controlled under 6 m/s2.These technical indexes support that the driving states set and characteristic events set for the HFSM model as well as its TOPSIS-GRA algorithm may bring about cerebrum-like decision-making effectiveness for ICV autonomous driving under 5G-V2X intelligent road infrastructure.
文摘Given the difficulty in hand coding task schemes, an intellectualized architecture of the autonomous micro mobile robot based behavior for fault repair was presented. Integrating the reinforcement learning and the group behavior evolution simulating the human's learning and evolution, the autonomous micro mobile robot will automatically generate the suited actions satisfied the environment. However, the designer only devises some basic behaviors, which decreases the workload of the designer and cognitive deficiency of the robot to the environment. The results of simulation have shown that the architecture endows micro robot with the ability of learning, adaptation and robustness, also with the ability of accomplishing the given task.
文摘This study takes Gannan Tibetan Autonomous Prefecture as the place of case study and tourists as research objects. From the perspectives of geographical distribution of source tourist markets, Tourist activity behavioral and spatial patterns of Tourists, this study looks into the geographical structure of the source tourists and spatial patterns by geography. The analysis of 341 questionnaires on tourists shows that:(1) The tourism cycle of Gannan is in the development phase, competing with adjacent Aba, and greatly impacted by the substitution effect and shadow effect of Aba.(2) The spatial distribution of tourist sources is concentrated, indicating that Gannan is a regional tourism destination. The temporal distance of tourists is mainly concentrated within the 6-hour traffi c circle.(3) Gannan Tibetan Autonomous Prefecture has already become the composite tourist destination dominated by leisure vacation. The minority folkcustom and special landscape are the most attractive tourism resources. Due to the impact of man-land harmonious lifestyle in the tourist areas, the environmental attitude of tourists is improved, and the transportation and shopping are the most vulnerable links in tourism service in Gannan Tibetan Autonomous Prefecture.(4) The spatial behavior of tourists in Gannan is mainly of single-destination style(52%), Transit leg and circle tour style(7%) as well as circle tour style(41%). The spatial distribution of tourist fl ow in Gannan shows a signifi cant feature "more in the north, less in the south and dependent on National Road". Tourism resources, transport facilities, regional competition and lack of route connecting different ecological units are important causes of the spatial distribution of self-help tourists.
文摘The present study aimed to investigate senior high school students to explore the relationships among their English achievement goal orientations,learning anxiety,and autonomous learning behavior.748 first-year senior high school students in Guizhou Province,China were selected as participants.A comprehensive questionnaire measuring the above variables was designed to collect the data.The Structural Equation Modeling(SEM)was used to analyze the data.The results showed that the model had good fit to the sample.The students’mastery goals and performance-approach goals positively contributed to their autonomous learning behavior,whereas their performance-avoidance goals were negatively associated with their autonomous learning behavior.The students’mastery goals effectively reduced their learning anxiety,but their performance-approach goals and performance-avoidance goals engendered learning anxiety.The students’learning anxiety and their autonomous learning behavior were negatively correlated.
文摘This study examined the differences and primary factors from the impact of autonomous motivation and controlled motivation on the self-management behavior of hemodialysis patients.Anonymous,self-describing questionnaires were used for research on nine different dialysis facilities of 413 people who regularly visit.From using the primary factor results of multiple regression analysis,that took autonomous motivation and controlled motivation as the dependent variable,a path diagram was created that led to each motivation.The acknowledgement of autonomy support facilitated whether it was autonomous motivation or controlled motivation(The standardized coefficient was 0.385,0.346,p<0.0001).Positive evaluation coping skills were a primary factor that promoted autonomous motivation,while trait anxiety,disorders of social activities,and lack of motivation were primary factors that promoted controlled motivation.In order to raise the autonomous motivation to promote self-management behavior in patients with hemodialysis treatment,situations that easily cause amotivation and anxiety,as well as tendencies for depression should be assessed.Also the encouragement to attain positive evaluation coping skills to support patient autonomy appears to be effective.
文摘In the near future, active safety systems will take more control over the vehicle driving, even up to introducing fully autonomous vehicles. Nowadays, it is expected that the active safety systems will aid avoiding collisions much more efficiently than human drivers. These systems can protect not only the passengers, but also other road users. To mitigate collision, certain maneuvers (e.g., sudden braking, lane change, etc.) need to be done in a reasonably quick time. However, this may lead to low-g energy pulses. The latter fact, may cause unexpected and, in some cases, unwanted occupant body motion resulting even in OOP (out of position) postures. New patterns of occupant reactions in such cases are, to some extent, confirmed experimentally [1-3]. This paper evaluates the limits of standard ATDs (anthropometric test devices) and chosen human models in well established maneuver scenarios. Obtained results are compared with experimental data available in the literature. Drawbacks identify new challenges for the near future simulation based safety engineering. One scenario with combined conditions of emergency braking during lane change has been used as an example of OOP posture after maneuver.
基金The National Key Research and Development Program of China(No.2018YFB1601304)the National Natural Science Foundation of China(No.71871107)Philosophy and Social Science Foundation Project of Universities in Jiangsu Province(No.2020SJA2059).
文摘In order to deeply analyze the differences in the acceptance of autonomous driving technology among different gender groups,a multiple indicators and multiple causes model was constructed by integrating a technology acceptance model and theory of planned behavior to comprehensively reveal the gender differences in the influence mechanisms of subjective and objective factors.The analysis is based on data collected from Chinese urban residents.Among objective factors,age has a significant negative impact on women's perceived behavior control and a significant positive impact on perceived ease of use.Education has a significant positive impact on men's perceived behavior control,and has a strong positive impact on women's perceived usefulness(PU).For men,income and education are found to have strong positive impacts on perceived behavior control.Among subjective factors,perceived ease of use(PEU)has the greatest influence on women's behavior intention,and it is the only influential factor for women's intention to use autonomous driving technology,with an influence coefficient of 0.72.The influencing path of men's intention to use autonomous driving technology is more complex.It is not only directly affected by the significant and positive joint effects of attitude and PU,but also indirectly affected by perceived behavior controls,subjective norms,and PEU.
文摘自动驾驶车辆决策系统直接影响车辆综合行驶性能,是实现自动驾驶技术需要解决的关键难题之一。基于深度强化学习算法DDPG(deep deterministic policy gradient),针对此问题提出了一种端到端驾驶行为决策模型。首先,结合驾驶员模型选取自车、道路、干扰车辆等共64维度状态空间信息作为输入数据集对决策模型进行训练,决策模型输出合理的驾驶行为以及控制量,为解决训练测试中的奖励和控制量突变问题,改进DDPG决策模型对决策控制效果进行优化,并在TORCS(the open racing car simulator)平台进行仿真实验验证。结果表明:所提出的决策模型可以根据车辆和环境实时状态信息输出合理的驾驶行为以及控制量,与DDPG模型相比,改进的模型具有更好的控制精度,且车辆横向速度显著减小,车辆舒适性以及车辆稳定性明显改善。