This paper introduced the design of the hybrid powertrain of the Fuel Cell City Bus demonstrated in 2008 Beijing Olympic Games. The configuration of the hybrid fuel cell powertrain was introduced. The safety of hydrog...This paper introduced the design of the hybrid powertrain of the Fuel Cell City Bus demonstrated in 2008 Beijing Olympic Games. The configuration of the hybrid fuel cell powertrain was introduced. The safety of hydrogen storage and delivery system, the hydrogen leakage alarm system were developed. The real-time distributed control and diagnosis system based on the Time Trigger Controller Area Network (TTCAN) with 10 ms basic control period was developed. The concept and implementation of processor (or controller) monitor and process (or task) monitor technique based on the TYCAN were applied in this paper. The fault tolerant control algorithm of the fuel cell engine and the battery man- agement system were considered. The demonstration experience verified that the fault tolerant control was very important for the fuel cell city bus.展开更多
The core task of tracking control is to make the controlled plant track a desired trajectory.The traditional performance index used in previous studies cannot eliminate completely the tracking error as the number of t...The core task of tracking control is to make the controlled plant track a desired trajectory.The traditional performance index used in previous studies cannot eliminate completely the tracking error as the number of time steps increases.In this paper,a new cost function is introduced to develop the value-iteration-based adaptive critic framework to solve the tracking control problem.Unlike the regulator problem,the iterative value function of tracking control problem cannot be regarded as a Lyapunov function.A novel stability analysis method is developed to guarantee that the tracking error converges to zero.The discounted iterative scheme under the new cost function for the special case of linear systems is elaborated.Finally,the tracking performance of the present scheme is demonstrated by numerical results and compared with those of the traditional approaches.展开更多
The optimization of the control strategy of a plug-in hybrid electric bus(PHEB) for the repeatedly driven bus route is a key technique to improve the fuel economy. The widely used rule-based(RB) control strategy is la...The optimization of the control strategy of a plug-in hybrid electric bus(PHEB) for the repeatedly driven bus route is a key technique to improve the fuel economy. The widely used rule-based(RB) control strategy is lacking in the global optimization property, while the global optimization algorithms have an unacceptable computation complexity for real-time application. Therefore, a novel hybrid dynamic programming-rule based(DPRB) algorithm is brought forward to solve the global energy optimization problem in a real-time controller of PHEB. Firstly, a control grid is built up for a given typical city bus route, according to the station locations and discrete levels of battery state of charge(SOC). Moreover, the decision variables for the energy optimization at each point of the control grid might be deduced from an off-line dynamic programming(DP) with the historical running information of the driving cycle. Meanwhile, the genetic algorithm(GA) is adopted to replace the quantization process of DP permissible control set to reduce the computation burden. Secondly, with the optimized decision variables as control parameters according to the position and battery SOC of a PHEB, a RB control is used as an implementable controller for the energy management. Simulation results demonstrate that the proposed DPRB might distribute electric energy more reasonably throughout the bus route, compared with the optimized RB. The proposed hybrid algorithm might give a practicable solution, which is a tradeoff between the applicability of RB and the global optimization property of DP.展开更多
Purpose–The purpose of this paper is to develop a dynamic control method to improve bus schedule adherence under connected bus system.Design/methodology/approach–The authors developed a dynamic programming model tha...Purpose–The purpose of this paper is to develop a dynamic control method to improve bus schedule adherence under connected bus system.Design/methodology/approach–The authors developed a dynamic programming model that optimally schedules the bus operating speed at road sections and multiple signal timing plans at intersections to improve bus schedule adherence.First,the bus route was partitioned into three types of sections:stop,road and intersection.Then,transit agencies can control buses in real time based on all collected information;i.e.control bus operating speed on road sections and adjust the signal timing plans through signal controllers to improve the schedule adherence in connected bus environment.Finally,bus punctuality at the downstream stop and the saturation degree deviations of intersections were selected as the evaluation criteria in optimizing signal control plans and bus speeds jointly.Findings–An illustrative case study by using a bus rapid transit line in Jinan city was performed to verify the proposed model.It revealed that based on the proposed strategy,the objective value could be reduced by 73.7%,which indicated that the punctuality was highly improved but not to incur excessive congestion for other vehicular traffic.Originality/value–In this paper,the authors applied speed guidance and the adjustment of the signal control plans for multiple cycles in advance to improve the scheduled stability;furthermore,the proposed control strategy can reduce the effect on private traffics to the utmost extend.展开更多
Reinforcement learning(RL) has roots in dynamic programming and it is called adaptive/approximate dynamic programming(ADP) within the control community. This paper reviews recent developments in ADP along with RL and ...Reinforcement learning(RL) has roots in dynamic programming and it is called adaptive/approximate dynamic programming(ADP) within the control community. This paper reviews recent developments in ADP along with RL and its applications to various advanced control fields. First, the background of the development of ADP is described, emphasizing the significance of regulation and tracking control problems. Some effective offline and online algorithms for ADP/adaptive critic control are displayed, where the main results towards discrete-time systems and continuous-time systems are surveyed, respectively.Then, the research progress on adaptive critic control based on the event-triggered framework and under uncertain environment is discussed, respectively, where event-based design, robust stabilization, and game design are reviewed. Moreover, the extensions of ADP for addressing control problems under complex environment attract enormous attention. The ADP architecture is revisited under the perspective of data-driven and RL frameworks,showing how they promote ADP formulation significantly.Finally, several typical control applications with respect to RL and ADP are summarized, particularly in the fields of wastewater treatment processes and power systems, followed by some general prospects for future research. Overall, the comprehensive survey on ADP and RL for advanced control applications has d emonstrated its remarkable potential within the artificial intelligence era. In addition, it also plays a vital role in promoting environmental protection and industrial intelligence.展开更多
Aimed at infinite horizon optimal control problems of discrete time-varying nonlinear systems,in this paper,a new iterative adaptive dynamic programming algorithm,which is the discrete-time time-varying policy iterati...Aimed at infinite horizon optimal control problems of discrete time-varying nonlinear systems,in this paper,a new iterative adaptive dynamic programming algorithm,which is the discrete-time time-varying policy iteration(DTTV)algorithm,is developed.The iterative control law is designed to update the iterative value function which approximates the index function of optimal performance.The admissibility of the iterative control law is analyzed.The results show that the iterative value function is non-increasingly convergent to the Bellman-equation optimal solution.To implement the algorithm,neural networks are employed and a new implementation structure is established,which avoids solving the generalized Bellman equation in each iteration.Finally,the optimal control laws for torsional pendulum and inverted pendulum systems are obtained by using the DTTV policy iteration algorithm,where the mass and pendulum bar length are permitted to be time-varying parameters.The effectiveness of the developed method is illustrated by numerical results and comparisons.展开更多
We develop an online adaptive dynamic programming (ADP) based optimal control scheme for continuous-time chaotic systems. The idea is to use the ADP algorithm to obtain the optimal control input that makes the perfo...We develop an online adaptive dynamic programming (ADP) based optimal control scheme for continuous-time chaotic systems. The idea is to use the ADP algorithm to obtain the optimal control input that makes the performance index function reach an optimum. The expression of the performance index function for the chaotic system is first presented. The online ADP algorithm is presented to achieve optimal control. In the ADP structure, neural networks are used to construct a critic network and an action network, which can obtain an approximate performance index function and the control input, respectively. It is proven that the critic parameter error dynamics and the closed-loop chaotic systems are uniformly ultimately bounded exponentially. Our simulation results illustrate the performance of the established optimal control method.展开更多
文摘This paper introduced the design of the hybrid powertrain of the Fuel Cell City Bus demonstrated in 2008 Beijing Olympic Games. The configuration of the hybrid fuel cell powertrain was introduced. The safety of hydrogen storage and delivery system, the hydrogen leakage alarm system were developed. The real-time distributed control and diagnosis system based on the Time Trigger Controller Area Network (TTCAN) with 10 ms basic control period was developed. The concept and implementation of processor (or controller) monitor and process (or task) monitor technique based on the TYCAN were applied in this paper. The fault tolerant control algorithm of the fuel cell engine and the battery man- agement system were considered. The demonstration experience verified that the fault tolerant control was very important for the fuel cell city bus.
基金This work was supported in part by Beijing Natural Science Foundation(JQ19013)the National Key Research and Development Program of China(2021ZD0112302)the National Natural Science Foundation of China(61773373).
文摘The core task of tracking control is to make the controlled plant track a desired trajectory.The traditional performance index used in previous studies cannot eliminate completely the tracking error as the number of time steps increases.In this paper,a new cost function is introduced to develop the value-iteration-based adaptive critic framework to solve the tracking control problem.Unlike the regulator problem,the iterative value function of tracking control problem cannot be regarded as a Lyapunov function.A novel stability analysis method is developed to guarantee that the tracking error converges to zero.The discounted iterative scheme under the new cost function for the special case of linear systems is elaborated.Finally,the tracking performance of the present scheme is demonstrated by numerical results and compared with those of the traditional approaches.
基金supported by the National Natural Science Foundation of China(Grant No.51275557,5142505)the National Science-Technology Support Plan Projects of China(Grant No.2013BAG14B01)
文摘The optimization of the control strategy of a plug-in hybrid electric bus(PHEB) for the repeatedly driven bus route is a key technique to improve the fuel economy. The widely used rule-based(RB) control strategy is lacking in the global optimization property, while the global optimization algorithms have an unacceptable computation complexity for real-time application. Therefore, a novel hybrid dynamic programming-rule based(DPRB) algorithm is brought forward to solve the global energy optimization problem in a real-time controller of PHEB. Firstly, a control grid is built up for a given typical city bus route, according to the station locations and discrete levels of battery state of charge(SOC). Moreover, the decision variables for the energy optimization at each point of the control grid might be deduced from an off-line dynamic programming(DP) with the historical running information of the driving cycle. Meanwhile, the genetic algorithm(GA) is adopted to replace the quantization process of DP permissible control set to reduce the computation burden. Secondly, with the optimized decision variables as control parameters according to the position and battery SOC of a PHEB, a RB control is used as an implementable controller for the energy management. Simulation results demonstrate that the proposed DPRB might distribute electric energy more reasonably throughout the bus route, compared with the optimized RB. The proposed hybrid algorithm might give a practicable solution, which is a tradeoff between the applicability of RB and the global optimization property of DP.
基金supported by the National Natural Science Foundation of China(No.71771062)Natural Science Foundation of Zhejiang Province(LY18G030021)China Postdoctoral Science Foundation(NO.2019M661214).
文摘Purpose–The purpose of this paper is to develop a dynamic control method to improve bus schedule adherence under connected bus system.Design/methodology/approach–The authors developed a dynamic programming model that optimally schedules the bus operating speed at road sections and multiple signal timing plans at intersections to improve bus schedule adherence.First,the bus route was partitioned into three types of sections:stop,road and intersection.Then,transit agencies can control buses in real time based on all collected information;i.e.control bus operating speed on road sections and adjust the signal timing plans through signal controllers to improve the schedule adherence in connected bus environment.Finally,bus punctuality at the downstream stop and the saturation degree deviations of intersections were selected as the evaluation criteria in optimizing signal control plans and bus speeds jointly.Findings–An illustrative case study by using a bus rapid transit line in Jinan city was performed to verify the proposed model.It revealed that based on the proposed strategy,the objective value could be reduced by 73.7%,which indicated that the punctuality was highly improved but not to incur excessive congestion for other vehicular traffic.Originality/value–In this paper,the authors applied speed guidance and the adjustment of the signal control plans for multiple cycles in advance to improve the scheduled stability;furthermore,the proposed control strategy can reduce the effect on private traffics to the utmost extend.
基金supported in part by the National Natural Science Foundation of China(62222301, 62073085, 62073158, 61890930-5, 62021003)the National Key Research and Development Program of China (2021ZD0112302, 2021ZD0112301, 2018YFC1900800-5)Beijing Natural Science Foundation (JQ19013)。
文摘Reinforcement learning(RL) has roots in dynamic programming and it is called adaptive/approximate dynamic programming(ADP) within the control community. This paper reviews recent developments in ADP along with RL and its applications to various advanced control fields. First, the background of the development of ADP is described, emphasizing the significance of regulation and tracking control problems. Some effective offline and online algorithms for ADP/adaptive critic control are displayed, where the main results towards discrete-time systems and continuous-time systems are surveyed, respectively.Then, the research progress on adaptive critic control based on the event-triggered framework and under uncertain environment is discussed, respectively, where event-based design, robust stabilization, and game design are reviewed. Moreover, the extensions of ADP for addressing control problems under complex environment attract enormous attention. The ADP architecture is revisited under the perspective of data-driven and RL frameworks,showing how they promote ADP formulation significantly.Finally, several typical control applications with respect to RL and ADP are summarized, particularly in the fields of wastewater treatment processes and power systems, followed by some general prospects for future research. Overall, the comprehensive survey on ADP and RL for advanced control applications has d emonstrated its remarkable potential within the artificial intelligence era. In addition, it also plays a vital role in promoting environmental protection and industrial intelligence.
基金supported in part by Fundamental Research Funds for the Central Universities(2022JBZX024)in part by the National Natural Science Foundation of China(61872037,61273167)。
文摘Aimed at infinite horizon optimal control problems of discrete time-varying nonlinear systems,in this paper,a new iterative adaptive dynamic programming algorithm,which is the discrete-time time-varying policy iteration(DTTV)algorithm,is developed.The iterative control law is designed to update the iterative value function which approximates the index function of optimal performance.The admissibility of the iterative control law is analyzed.The results show that the iterative value function is non-increasingly convergent to the Bellman-equation optimal solution.To implement the algorithm,neural networks are employed and a new implementation structure is established,which avoids solving the generalized Bellman equation in each iteration.Finally,the optimal control laws for torsional pendulum and inverted pendulum systems are obtained by using the DTTV policy iteration algorithm,where the mass and pendulum bar length are permitted to be time-varying parameters.The effectiveness of the developed method is illustrated by numerical results and comparisons.
基金Project supported by the Open Research Project from the SKLMCCS(Grant No.20120106)the Fundamental Research Funds for the Central Universities of China(Grant No.FRF-TP-13-018A)+2 种基金the Postdoctoral Science Foundation of China(Grant No.2013M530527)the National Natural Science Foundation of China(Grant Nos.61304079 and 61374105)the Natural Science Foundation of Beijing,China(Grant No.4132078 and 4143065)
文摘We develop an online adaptive dynamic programming (ADP) based optimal control scheme for continuous-time chaotic systems. The idea is to use the ADP algorithm to obtain the optimal control input that makes the performance index function reach an optimum. The expression of the performance index function for the chaotic system is first presented. The online ADP algorithm is presented to achieve optimal control. In the ADP structure, neural networks are used to construct a critic network and an action network, which can obtain an approximate performance index function and the control input, respectively. It is proven that the critic parameter error dynamics and the closed-loop chaotic systems are uniformly ultimately bounded exponentially. Our simulation results illustrate the performance of the established optimal control method.
基金supported in part by National Natural Science Foundation of China(61533017,61273140,61304079,61374105,61379099,61233001)Fundamental Research Funds for the Central Universities(FRF-TP-15-056A3)the Open Research Project from SKLMCCS(20150104)