Reinforcement learning(RL) has roots in dynamic programming and it is called adaptive/approximate dynamic programming(ADP) within the control community. This paper reviews recent developments in ADP along with RL and ...Reinforcement learning(RL) has roots in dynamic programming and it is called adaptive/approximate dynamic programming(ADP) within the control community. This paper reviews recent developments in ADP along with RL and its applications to various advanced control fields. First, the background of the development of ADP is described, emphasizing the significance of regulation and tracking control problems. Some effective offline and online algorithms for ADP/adaptive critic control are displayed, where the main results towards discrete-time systems and continuous-time systems are surveyed, respectively.Then, the research progress on adaptive critic control based on the event-triggered framework and under uncertain environment is discussed, respectively, where event-based design, robust stabilization, and game design are reviewed. Moreover, the extensions of ADP for addressing control problems under complex environment attract enormous attention. The ADP architecture is revisited under the perspective of data-driven and RL frameworks,showing how they promote ADP formulation significantly.Finally, several typical control applications with respect to RL and ADP are summarized, particularly in the fields of wastewater treatment processes and power systems, followed by some general prospects for future research. Overall, the comprehensive survey on ADP and RL for advanced control applications has d emonstrated its remarkable potential within the artificial intelligence era. In addition, it also plays a vital role in promoting environmental protection and industrial intelligence.展开更多
A continuum opinion dynamic model is presented based on two rules. The first one considers the mobilities of the individuals, the second one supposes that the individuals update their opinions independently. The resul...A continuum opinion dynamic model is presented based on two rules. The first one considers the mobilities of the individuals, the second one supposes that the individuals update their opinions independently. The results of the model indicate that the bounded confidence εc, separating consensus and incoherent states, of a scale-free network is much smaller than the one of a lattice. If the system can reach the consensus state, the sum of all individuals' opinion change Oc(t) quickly decreases in an exponential form, while if it reaches the incoherent state finally, Oc(t) decreases slowly and has the punctuated equilibrium characteristic.展开更多
We present a simple rule which could generate scale-free networks with very large clustering coefficient and very small average distance. These networks, called the multistage random growing networks (MRGNs), are co...We present a simple rule which could generate scale-free networks with very large clustering coefficient and very small average distance. These networks, called the multistage random growing networks (MRGNs), are constructed by a two-stage adding process for each new node. The analytic results of the power-law exponent = 3 and the clustering coefficient C = 0.81 are obtained, which agree with the simulation results approximately. In addition, we find that the average distance of the networks increases logarithmically with the network size, which is consistent with the theoretical predictions. Since many real-world networks are both scale-free and small-world, the MRGNs may perform well in mimicking reality.展开更多
Social propagation denotes the spread phenomena directly correlated to the human world and society, which includes but is not limited to the diffusion of human epidemics, human-made malicious viruses, fake news, socia...Social propagation denotes the spread phenomena directly correlated to the human world and society, which includes but is not limited to the diffusion of human epidemics, human-made malicious viruses, fake news, social innovation, viral marketing, etc. Simulation and optimization are two major themes in social propagation, where network-based simulation helps to analyze and understand the social contagion, and problem-oriented optimization is devoted to contain or improve the infection results. Though there have been many models and optimization techniques, the matter of concern is that the increasing complexity and scales of propagation processes continuously refresh the former conclusions. Recently, evolutionary computation(EC) shows its potential in alleviating the concerns by introducing an evolving and developing perspective. With this insight, this paper intends to develop a comprehensive view of how EC takes effect in social propagation. Taxonomy is provided for classifying the propagation problems, and the applications of EC in solving these problems are reviewed. Furthermore, some open issues of social propagation and the potential applications of EC are discussed.This paper contributes to recognizing the problems in application-oriented EC design and paves the way for the development of evolving propagation dynamics.展开更多
基金supported in part by the National Natural Science Foundation of China(62222301, 62073085, 62073158, 61890930-5, 62021003)the National Key Research and Development Program of China (2021ZD0112302, 2021ZD0112301, 2018YFC1900800-5)Beijing Natural Science Foundation (JQ19013)。
文摘Reinforcement learning(RL) has roots in dynamic programming and it is called adaptive/approximate dynamic programming(ADP) within the control community. This paper reviews recent developments in ADP along with RL and its applications to various advanced control fields. First, the background of the development of ADP is described, emphasizing the significance of regulation and tracking control problems. Some effective offline and online algorithms for ADP/adaptive critic control are displayed, where the main results towards discrete-time systems and continuous-time systems are surveyed, respectively.Then, the research progress on adaptive critic control based on the event-triggered framework and under uncertain environment is discussed, respectively, where event-based design, robust stabilization, and game design are reviewed. Moreover, the extensions of ADP for addressing control problems under complex environment attract enormous attention. The ADP architecture is revisited under the perspective of data-driven and RL frameworks,showing how they promote ADP formulation significantly.Finally, several typical control applications with respect to RL and ADP are summarized, particularly in the fields of wastewater treatment processes and power systems, followed by some general prospects for future research. Overall, the comprehensive survey on ADP and RL for advanced control applications has d emonstrated its remarkable potential within the artificial intelligence era. In addition, it also plays a vital role in promoting environmental protection and industrial intelligence.
基金Supported by the National Basic Research Programme of China under Grant No 2006CB705500, the National Natural Science Foundation of China under Grant Nos 10635040, 10532060, 70571074 and 10472116, the Special Research Funds for Theoretical Physics Frontier Problems (A0524701), the President Fund of Chinese Academy of Sciences, the Specialized Research Fund for the Doctoral Programme of Higher Education of China, and the Research Fund of the Education Department of Liaoning Province (20060140). The authors thank Dr Ming Zhao for her comments and suggestions.
文摘A continuum opinion dynamic model is presented based on two rules. The first one considers the mobilities of the individuals, the second one supposes that the individuals update their opinions independently. The results of the model indicate that the bounded confidence εc, separating consensus and incoherent states, of a scale-free network is much smaller than the one of a lattice. If the system can reach the consensus state, the sum of all individuals' opinion change Oc(t) quickly decreases in an exponential form, while if it reaches the incoherent state finally, Oc(t) decreases slowly and has the punctuated equilibrium characteristic.
基金Supported by the National Natural Science Foundation of China under Grant Nos 70431001 and 70271046.
文摘We present a simple rule which could generate scale-free networks with very large clustering coefficient and very small average distance. These networks, called the multistage random growing networks (MRGNs), are constructed by a two-stage adding process for each new node. The analytic results of the power-law exponent = 3 and the clustering coefficient C = 0.81 are obtained, which agree with the simulation results approximately. In addition, we find that the average distance of the networks increases logarithmically with the network size, which is consistent with the theoretical predictions. Since many real-world networks are both scale-free and small-world, the MRGNs may perform well in mimicking reality.
基金by National Key Research and Development Project,Ministry of Science and Technology,China(No.2018AAA0101300)National Natural Science Foundation of China(Nos.61976093 and 61873097)+1 种基金Guangdong-Hong Kong Joint Innovative Platform of Big Data and Computational Intelligence(No.2018B050502006)Guangdong Natural Science Foundation Research Team(No.2018B030312003).
文摘Social propagation denotes the spread phenomena directly correlated to the human world and society, which includes but is not limited to the diffusion of human epidemics, human-made malicious viruses, fake news, social innovation, viral marketing, etc. Simulation and optimization are two major themes in social propagation, where network-based simulation helps to analyze and understand the social contagion, and problem-oriented optimization is devoted to contain or improve the infection results. Though there have been many models and optimization techniques, the matter of concern is that the increasing complexity and scales of propagation processes continuously refresh the former conclusions. Recently, evolutionary computation(EC) shows its potential in alleviating the concerns by introducing an evolving and developing perspective. With this insight, this paper intends to develop a comprehensive view of how EC takes effect in social propagation. Taxonomy is provided for classifying the propagation problems, and the applications of EC in solving these problems are reviewed. Furthermore, some open issues of social propagation and the potential applications of EC are discussed.This paper contributes to recognizing the problems in application-oriented EC design and paves the way for the development of evolving propagation dynamics.