Dear Editor,This letter explores optimal formation control for a network of unmanned surface vessels(USVs).By designing an individual objective function for each USV,the optimal formation problem is transformed into a...Dear Editor,This letter explores optimal formation control for a network of unmanned surface vessels(USVs).By designing an individual objective function for each USV,the optimal formation problem is transformed into a noncooperative game.Under this game theoretic framework,the optimal formation is achieved by seeking the Nash equilibrium of the regularized game.A modular structure consisting of a distributed Nash equilibrium seeker and a regulator is proposed.展开更多
In this paper we study a bilinear optimal control problem for a diffusive Lotka-Volterra competition model with chemo-repulsion in a bounded domain of ℝ^(ℕ),N=2,3.This model describes the competition of two species in...In this paper we study a bilinear optimal control problem for a diffusive Lotka-Volterra competition model with chemo-repulsion in a bounded domain of ℝ^(ℕ),N=2,3.This model describes the competition of two species in which one of them avoid encounters with rivals through a chemo-repulsion mechanism.We prove the existence and uniqueness of weak-strong solutions,and then we analyze the existence of a global optimal solution for a related bilinear optimal control problem,where the control is acting on the chemical signal.Posteriorly,we derive first-order optimality conditions for local optimal solutions using the Lagrange multipliers theory.Finally,we propose a discrete approximation scheme of the optimality system based on the gradient method,which is validated with some computational experiments.展开更多
We present an optimal and robust quantum control method for efficient population transfer in asymmetric double quantum-dot molecules.We derive a long-duration control scheme that allows for highly efficient population...We present an optimal and robust quantum control method for efficient population transfer in asymmetric double quantum-dot molecules.We derive a long-duration control scheme that allows for highly efficient population transfer by accurately controlling the amplitude of a narrow-bandwidth pulse.To overcome fluctuations in control field parameters,we employ a frequency-domain quantum optimal control theory method to optimize the spectral phase of a single pulse with broad bandwidth while preserving the spectral amplitude.It is shown that this spectral-phase-only optimization approach can successfully identify robust and optimal control fields,leading to efficient population transfer to the target state while concurrently suppressing population transfer to undesired states.The method demonstrates resilience to fluctuations in control field parameters,making it a promising approach for reliable and efficient population transfer in practical applications.展开更多
This paper presents a novel sequential inverse optimal control(SIOC)method for discrete-time systems,which calculates the unknown weight vectors of the cost function in real time using the input and output of an optim...This paper presents a novel sequential inverse optimal control(SIOC)method for discrete-time systems,which calculates the unknown weight vectors of the cost function in real time using the input and output of an optimally controlled discrete-time system.The proposed method overcomes the limitations of previous approaches by eliminating the need for the invertible Jacobian assumption.It calculates the possible-solution spaces and their intersections sequentially until the dimension of the intersection space decreases to one.The remaining one-dimensional vector of the possible-solution space’s intersection represents the SIOC solution.The paper presents clear conditions for convergence and addresses the issue of noisy data by clarifying the conditions for the singular values of the matrices that relate to the possible-solution space.The effectiveness of the proposed method is demonstrated through simulation results.展开更多
The small and scattered enterprise pattern in the county economy has formed numerous sporadic pollution sources, hindering the centralized treatment of the water environment, increasing the cost and difficulty of trea...The small and scattered enterprise pattern in the county economy has formed numerous sporadic pollution sources, hindering the centralized treatment of the water environment, increasing the cost and difficulty of treatment. How enterprises can make reasonable decisions on their water environment behavior based on the external environment and their own factors is of great significance for scientifically and effectively designing water environment regulation mechanisms. Based on optimal control theory, this study investigates the design of contractual mechanisms for water environmental regulation for small and medium-sized enterprises. The enterprise is regarded as an independent economic entity that can adopt optimal control strategies to maximize its own interests. Based on the participation of multiple subjects including the government, enterprises, and the public, an optimal control strategy model for enterprises under contractual water environmental regulation is constructed using optimal control theory, and a method for calculating the amount of unit pollutant penalties is derived. The water pollutant treatment cost data of a paper company is selected to conduct empirical numerical analysis on the model. The results show that the increase in the probability of government regulation and public participation, as well as the decrease in local government protection for enterprises, can achieve the same regulatory effect while reducing the number of administrative penalties per unit. Finally, the implementation process of contractual water environmental regulation for small and medium-sized enterprises is designed.展开更多
Building emission reduction is an important way to achieve China’s carbon peaking and carbon neutrality goals.Aiming at the problem of low carbon economic operation of a photovoltaic energy storage building system,a ...Building emission reduction is an important way to achieve China’s carbon peaking and carbon neutrality goals.Aiming at the problem of low carbon economic operation of a photovoltaic energy storage building system,a multi-time scale optimal scheduling strategy based on model predictive control(MPC)is proposed under the consideration of load optimization.First,load optimization is achieved by controlling the charging time of electric vehicles as well as adjusting the air conditioning operation temperature,and the photovoltaic energy storage building system model is constructed to propose a day-ahead scheduling strategy with the lowest daily operation cost.Second,considering inter-day to intra-day source-load prediction error,an intraday rolling optimal scheduling strategy based on MPC is proposed that dynamically corrects the day-ahead dispatch results to stabilize system power fluctuations and promote photovoltaic consumption.Finally,taking an office building on a summer work day as an example,the effectiveness of the proposed scheduling strategy is verified.The results of the example show that the strategy reduces the total operating cost of the photovoltaic energy storage building system by 17.11%,improves the carbon emission reduction by 7.99%,and the photovoltaic consumption rate reaches 98.57%,improving the system’s low-carbon and economic performance.展开更多
In this paper, the matrix Riccati equation is considered. There is no general way for solving the matrix Riccati equation despite the many fields to which it applies. While scalar Riccati equation has been studied tho...In this paper, the matrix Riccati equation is considered. There is no general way for solving the matrix Riccati equation despite the many fields to which it applies. While scalar Riccati equation has been studied thoroughly, matrix Riccati equation of which scalar Riccati equations is a particular case, is much less investigated. This article proposes a change of variable that allows to find explicit solution of the Matrix Riccati equation. We then apply this solution to Optimal Control.展开更多
In this paper,a new optimal adaptive backstepping control approach for nonlinear systems under deception attacks via reinforcement learning is presented in this paper.The existence of nonlinear terms in the studied sy...In this paper,a new optimal adaptive backstepping control approach for nonlinear systems under deception attacks via reinforcement learning is presented in this paper.The existence of nonlinear terms in the studied system makes it very difficult to design the optimal controller using traditional methods.To achieve optimal control,RL algorithm based on critic–actor architecture is considered for the nonlinear system.Due to the significant security risks of network transmission,the system is vulnerable to deception attacks,which can make all the system state unavailable.By using the attacked states to design coordinate transformation,the harm brought by unknown deception attacks has been overcome.The presented control strategy can ensure that all signals in the closed-loop system are semi-globally ultimately bounded.Finally,the simulation experiment is shown to prove the effectiveness of the strategy.展开更多
In this paper we study optimal advertising problems that model the introduction of a new product into the market in the presence of carryover effects of the advertisement and with memory effects in the level of goodwi...In this paper we study optimal advertising problems that model the introduction of a new product into the market in the presence of carryover effects of the advertisement and with memory effects in the level of goodwill. In particular, we let the dynamics of the product goodwill to depend on the past, and also on past advertising efforts. We treat the problem by means of the stochastic Pontryagin maximum principle, that here is considered for a class of problems where in the state equation either the state or the control depend on the past. Moreover the control acts on the martingale term and the space of controls U can be chosen to be non-convex but now the space of controls U can be chosen to be non-convex. The maximum principle is thus formulated using a first-order adjoint Backward Stochastic Differential Equations (BSDEs), which can be explicitly computed due to the specific characteristics of the model, and a second-order adjoint relation.展开更多
Dear Editor,This letter focuses on the distributed optimal containment control of continuous-time multi-agent systems(CTMASs)with respect to the minimum-energy performance index over fixed topology.To achieve this,we ...Dear Editor,This letter focuses on the distributed optimal containment control of continuous-time multi-agent systems(CTMASs)with respect to the minimum-energy performance index over fixed topology.To achieve this,we firstly investigate the optimal containment control problem using the inverse optimal control method,where all states of followers asymptotically converge to the convex hull spanned by the leaders while some quadratic performance indexes get minimized.A sufficient condition for existence of the distributed optimal containment control protocol is derived.By introducing the parametric algebraic Riccati equation(PARE),it is strictly proved that the global performance index can be used to approximate the standard minimumenergy performance index as the parameters tends to infinity.In consequence,the standard minimum-energy cooperative containment control can be solved by local steady state feedback protocols.展开更多
Dear Editor,In this letter,the multi-objective optimal control problem of nonlinear discrete-time systems is investigated.A data-driven policy gradient algorithm is proposed in which the action-state value function is...Dear Editor,In this letter,the multi-objective optimal control problem of nonlinear discrete-time systems is investigated.A data-driven policy gradient algorithm is proposed in which the action-state value function is used to evaluate the policy.In the policy improvement process,the policy gradient based method is employed.展开更多
DC-DC converter-based multi-bus DC microgrids(MGs) in series have received much attention, where the conflict between voltage recovery and current balancing has been a hot topic. The lack of models that accurately por...DC-DC converter-based multi-bus DC microgrids(MGs) in series have received much attention, where the conflict between voltage recovery and current balancing has been a hot topic. The lack of models that accurately portray the electrical characteristics of actual MGs while is controller design-friendly has kept the issue active. To this end, this paper establishes a large-signal model containing the comprehensive dynamical behavior of the DC MGs based on the theory of high-order fully actuated systems, and proposes distributed optimal control based on this. The proposed secondary control method can achieve the two goals of voltage recovery and current sharing for multi-bus DC MGs. Additionally, the simple structure of the proposed approach is similar to one based on droop control, which allows this control technique to be easily implemented in a variety of modern microgrids with different configurations. In contrast to existing studies, the process of controller design in this paper is closely tied to the actual dynamics of the MGs. It is a prominent feature that enables engineers to customize the performance metrics of the system. In addition, the analysis of the stability of the closed-loop DC microgrid system, as well as the optimality and consensus of current sharing are given. Finally, a scaled-down solar and battery-based microgrid prototype with maximum power point tracking controller is developed in the laboratory to experimentally test the efficacy of the proposed control method.展开更多
In this article, the transmission dynamics of a Hand-Foot-Mouth disease model with treatment and vaccination interventions are studied. We calculated the basic reproduction number and proved the global stability of di...In this article, the transmission dynamics of a Hand-Foot-Mouth disease model with treatment and vaccination interventions are studied. We calculated the basic reproduction number and proved the global stability of disease-free equilibrium when R0 R0 > 1. Meanwhile, we obtained the optimal control strategies minimizing the cost of intervention and minimizing the infected person. We also give some numerical simulations to verify our theoretical results.展开更多
This paper studies a single degree of freedom system under free vibration and controlled by a general semiactive damping.A general integral of squared error is considered as the performance index.A one-time switching ...This paper studies a single degree of freedom system under free vibration and controlled by a general semiactive damping.A general integral of squared error is considered as the performance index.A one-time switching damping controller is proposed and optimized.The pontryagin maximum principle is used to prove that no other form of semi-active damping can provide the better performance than the proposed one-time switching damping.展开更多
In this paper, we propose the nonconforming virtual element method (NCVEM) discretization for the pointwise control constraint optimal control problem governed by elliptic equations. Based on the NCVEM approximation o...In this paper, we propose the nonconforming virtual element method (NCVEM) discretization for the pointwise control constraint optimal control problem governed by elliptic equations. Based on the NCVEM approximation of state equation and the variational discretization of control variables, we construct a virtual element discrete scheme. For the state, adjoint state and control variable, we obtain the corresponding prior estimate in H<sup>1</sup> and L<sup>2</sup> norms. Finally, some numerical experiments are carried out to support the theoretical results.展开更多
The paper addresses the decentralized optimal control and stabilization problems for interconnected systems subject to asymmetric information.Compared with previous work,a closed-loop optimal solution to the control p...The paper addresses the decentralized optimal control and stabilization problems for interconnected systems subject to asymmetric information.Compared with previous work,a closed-loop optimal solution to the control problem and sufficient and necessary conditions for the stabilization problem of the interconnected systems are given for the first time.The main challenge lies in three aspects:Firstly,the asymmetric information results in coupling between control and estimation and failure of the separation principle.Secondly,two extra unknown variables are generated by asymmetric information(different information filtration)when solving forward-backward stochastic difference equations.Thirdly,the existence of additive noise makes the study of mean-square boundedness an obstacle.The adopted technique is proving and assuming the linear form of controllers and establishing the equivalence between the two systems with and without additive noise.A dual-motor parallel drive system is presented to demonstrate the validity of the proposed algorithm.展开更多
BACKGROUND The controlling nutritional status(CONUT)score effectively reflects a patient’s nutritional status,which is closely related to cancer prognosis.This study invest-igated the relationship between the CONUT s...BACKGROUND The controlling nutritional status(CONUT)score effectively reflects a patient’s nutritional status,which is closely related to cancer prognosis.This study invest-igated the relationship between the CONUT score and prognosis after radical surgery for colorectal cancer,and compared the predictive ability of the CONUT score with other indexes.AIM To analyze the predictive performance of the CONUT score for the survival rate of colorectal cancer patients who underwent potentially curative resection.METHODS This retrospective analysis included 217 patients with newly diagnosed colorectal.The CONUT score was calculated based on the serum albumin level,total lymphocyte count,and total cholesterol level.The cutoff value of the CONUT score for predicting prognosis was 4 according to the Youden Index by the receiver operating characteristic curve.The associations between the CONUT score and the prognosis were performed using Kaplan-Meier curves and Cox regression analysis.RESULTS Using the cutoff value of the CONUT score,patients were stratified into CONUT low(n=189)and CONUT high groups(n=28).The CONUT high group had worse overall survival(OS)(P=0.013)and relapse-free survival(RFS)(P=0.015).The predictive performance of CONUT was superior to the modified Glasgow prognostic score,the prognostic nutritional index,and the neutrophil-to-lymphocyte ratio.Meanwhile,the predictive performances of CONUT+tumor node metastasis(TNM)stage for 3-year OS[area under the receiver operating characteristics curve(AUC)=0.803]and 3-year RFS(AUC=0.752)were no less than skeletal muscle mass index(SMI)+TNM stage.The CONUT score was negatively correlated with SMI(P<0.01).CONCLUSION As a nutritional indicator,the CONUT score could predict long-term outcomes after radical surgery for colorectal cancer,and its predictive ability was superior to other indexes.The correlation between the CONUT score and skeletal muscle may be one of the factors that play a predictive role.展开更多
Motivated by recent advances made in the study of dividend control and risk management problems involving the U.S.bankruptcy code,in this paper we follow[44]to revisit the De Finetti dividend control problem under the...Motivated by recent advances made in the study of dividend control and risk management problems involving the U.S.bankruptcy code,in this paper we follow[44]to revisit the De Finetti dividend control problem under the reorganization process and the regulator's intervention documented in U.S.Chapter 11 bankruptcy.We do this by further accommodating the fixed transaction costs on dividends to imitate the real-world procedure of dividend payments.Incorporating the fixed transaction costs transforms the targeting optimal dividend problem into an impulse control problem rather than a singular control problem,and hence computations and proofs that are distinct from[44]are needed.To account for the financial stress that is due to the more subtle concept of Chapter 11 bankruptcy,the surplus process after dividends is driven by a piece-wise spectrally negative Lévy process with endogenous regime switching.Some explicit expressions of the expected net present values under a double barrier dividend strategy,new to the literature,are established in terms of scale functions.With the help of these expressions,we are able to characterize the optimal strategy among the set of admissible double barrier dividend strategies.When the tail of the Lévy measure is log-convex,this optimal double barrier dividend strategy is then verified as the optimal dividend strategy,solving our optimal impulse control problem.展开更多
This paper studies a novel distributed optimization problem that aims to minimize the sum of the non-convex objective functionals of the multi-agent network under privacy protection, which means that the local objecti...This paper studies a novel distributed optimization problem that aims to minimize the sum of the non-convex objective functionals of the multi-agent network under privacy protection, which means that the local objective of each agent is unknown to others. The above problem involves complexity simultaneously in the time and space aspects. Yet existing works about distributed optimization mainly consider privacy protection in the space aspect where the decision variable is a vector with finite dimensions. In contrast, when the time aspect is considered in this paper, the decision variable is a continuous function concerning time. Hence, the minimization of the overall functional belongs to the calculus of variations. Traditional works usually aim to seek the optimal decision function. Due to privacy protection and non-convexity, the Euler-Lagrange equation of the proposed problem is a complicated partial differential equation.Hence, we seek the optimal decision derivative function rather than the decision function. This manner can be regarded as seeking the control input for an optimal control problem, for which we propose a centralized reinforcement learning(RL) framework. In the space aspect, we further present a distributed reinforcement learning framework to deal with the impact of privacy protection. Finally, rigorous theoretical analysis and simulation validate the effectiveness of our framework.展开更多
Aimed at infinite horizon optimal control problems of discrete time-varying nonlinear systems,in this paper,a new iterative adaptive dynamic programming algorithm,which is the discrete-time time-varying policy iterati...Aimed at infinite horizon optimal control problems of discrete time-varying nonlinear systems,in this paper,a new iterative adaptive dynamic programming algorithm,which is the discrete-time time-varying policy iteration(DTTV)algorithm,is developed.The iterative control law is designed to update the iterative value function which approximates the index function of optimal performance.The admissibility of the iterative control law is analyzed.The results show that the iterative value function is non-increasingly convergent to the Bellman-equation optimal solution.To implement the algorithm,neural networks are employed and a new implementation structure is established,which avoids solving the generalized Bellman equation in each iteration.Finally,the optimal control laws for torsional pendulum and inverted pendulum systems are obtained by using the DTTV policy iteration algorithm,where the mass and pendulum bar length are permitted to be time-varying parameters.The effectiveness of the developed method is illustrated by numerical results and comparisons.展开更多
基金supported by the National Key R&D Program of China(2022ZD0119604)the National Natural Science Foundation of China(NSFC),(62222308,62173181,62221004)+1 种基金the Natural Science Foundation of Jiangsu Province(BK20220139)the Young Elite Scientists Sponsorship Program by CAST(2021QNRC001)。
文摘Dear Editor,This letter explores optimal formation control for a network of unmanned surface vessels(USVs).By designing an individual objective function for each USV,the optimal formation problem is transformed into a noncooperative game.Under this game theoretic framework,the optimal formation is achieved by seeking the Nash equilibrium of the regularized game.A modular structure consisting of a distributed Nash equilibrium seeker and a regulator is proposed.
基金supported by Vicerrectoría de Investigación y Extensión of Universidad Industrial de Santander,Colombia,project 3704.
文摘In this paper we study a bilinear optimal control problem for a diffusive Lotka-Volterra competition model with chemo-repulsion in a bounded domain of ℝ^(ℕ),N=2,3.This model describes the competition of two species in which one of them avoid encounters with rivals through a chemo-repulsion mechanism.We prove the existence and uniqueness of weak-strong solutions,and then we analyze the existence of a global optimal solution for a related bilinear optimal control problem,where the control is acting on the chemical signal.Posteriorly,we derive first-order optimality conditions for local optimal solutions using the Lagrange multipliers theory.Finally,we propose a discrete approximation scheme of the optimality system based on the gradient method,which is validated with some computational experiments.
基金This work was supported by the National Natural Science Foundations of China(Grant Nos.12275033,61973317,and 12274470)the Natural Science Foundation of Hunan Province for Distinguished Young Scholars(Grant No.2022JJ10070)+1 种基金the Natural Science Foundation of Hunan Province(Grant No.2022JJ30582)the Scientific Research Fund of Hunan Provincial Education Department(Grant No.20A025).
文摘We present an optimal and robust quantum control method for efficient population transfer in asymmetric double quantum-dot molecules.We derive a long-duration control scheme that allows for highly efficient population transfer by accurately controlling the amplitude of a narrow-bandwidth pulse.To overcome fluctuations in control field parameters,we employ a frequency-domain quantum optimal control theory method to optimize the spectral phase of a single pulse with broad bandwidth while preserving the spectral amplitude.It is shown that this spectral-phase-only optimization approach can successfully identify robust and optimal control fields,leading to efficient population transfer to the target state while concurrently suppressing population transfer to undesired states.The method demonstrates resilience to fluctuations in control field parameters,making it a promising approach for reliable and efficient population transfer in practical applications.
文摘This paper presents a novel sequential inverse optimal control(SIOC)method for discrete-time systems,which calculates the unknown weight vectors of the cost function in real time using the input and output of an optimally controlled discrete-time system.The proposed method overcomes the limitations of previous approaches by eliminating the need for the invertible Jacobian assumption.It calculates the possible-solution spaces and their intersections sequentially until the dimension of the intersection space decreases to one.The remaining one-dimensional vector of the possible-solution space’s intersection represents the SIOC solution.The paper presents clear conditions for convergence and addresses the issue of noisy data by clarifying the conditions for the singular values of the matrices that relate to the possible-solution space.The effectiveness of the proposed method is demonstrated through simulation results.
文摘The small and scattered enterprise pattern in the county economy has formed numerous sporadic pollution sources, hindering the centralized treatment of the water environment, increasing the cost and difficulty of treatment. How enterprises can make reasonable decisions on their water environment behavior based on the external environment and their own factors is of great significance for scientifically and effectively designing water environment regulation mechanisms. Based on optimal control theory, this study investigates the design of contractual mechanisms for water environmental regulation for small and medium-sized enterprises. The enterprise is regarded as an independent economic entity that can adopt optimal control strategies to maximize its own interests. Based on the participation of multiple subjects including the government, enterprises, and the public, an optimal control strategy model for enterprises under contractual water environmental regulation is constructed using optimal control theory, and a method for calculating the amount of unit pollutant penalties is derived. The water pollutant treatment cost data of a paper company is selected to conduct empirical numerical analysis on the model. The results show that the increase in the probability of government regulation and public participation, as well as the decrease in local government protection for enterprises, can achieve the same regulatory effect while reducing the number of administrative penalties per unit. Finally, the implementation process of contractual water environmental regulation for small and medium-sized enterprises is designed.
文摘Building emission reduction is an important way to achieve China’s carbon peaking and carbon neutrality goals.Aiming at the problem of low carbon economic operation of a photovoltaic energy storage building system,a multi-time scale optimal scheduling strategy based on model predictive control(MPC)is proposed under the consideration of load optimization.First,load optimization is achieved by controlling the charging time of electric vehicles as well as adjusting the air conditioning operation temperature,and the photovoltaic energy storage building system model is constructed to propose a day-ahead scheduling strategy with the lowest daily operation cost.Second,considering inter-day to intra-day source-load prediction error,an intraday rolling optimal scheduling strategy based on MPC is proposed that dynamically corrects the day-ahead dispatch results to stabilize system power fluctuations and promote photovoltaic consumption.Finally,taking an office building on a summer work day as an example,the effectiveness of the proposed scheduling strategy is verified.The results of the example show that the strategy reduces the total operating cost of the photovoltaic energy storage building system by 17.11%,improves the carbon emission reduction by 7.99%,and the photovoltaic consumption rate reaches 98.57%,improving the system’s low-carbon and economic performance.
文摘In this paper, the matrix Riccati equation is considered. There is no general way for solving the matrix Riccati equation despite the many fields to which it applies. While scalar Riccati equation has been studied thoroughly, matrix Riccati equation of which scalar Riccati equations is a particular case, is much less investigated. This article proposes a change of variable that allows to find explicit solution of the Matrix Riccati equation. We then apply this solution to Optimal Control.
基金supported in part by the National Key R&D Program of China under Grants 2021YFE0206100in part by the National Natural Science Foundation of China under Grant 62073321+2 种基金in part by National Defense Basic Scientific Research Program JCKY2019203C029in part by the Science and Technology Development Fund,Macao SAR under Grants FDCT-22-009-MISE,0060/2021/A2 and 0015/2020/AMJin part by the financial support from the National Defense Basic Scientific Research Project(JCKY2020130C025).
文摘In this paper,a new optimal adaptive backstepping control approach for nonlinear systems under deception attacks via reinforcement learning is presented in this paper.The existence of nonlinear terms in the studied system makes it very difficult to design the optimal controller using traditional methods.To achieve optimal control,RL algorithm based on critic–actor architecture is considered for the nonlinear system.Due to the significant security risks of network transmission,the system is vulnerable to deception attacks,which can make all the system state unavailable.By using the attacked states to design coordinate transformation,the harm brought by unknown deception attacks has been overcome.The presented control strategy can ensure that all signals in the closed-loop system are semi-globally ultimately bounded.Finally,the simulation experiment is shown to prove the effectiveness of the strategy.
文摘In this paper we study optimal advertising problems that model the introduction of a new product into the market in the presence of carryover effects of the advertisement and with memory effects in the level of goodwill. In particular, we let the dynamics of the product goodwill to depend on the past, and also on past advertising efforts. We treat the problem by means of the stochastic Pontryagin maximum principle, that here is considered for a class of problems where in the state equation either the state or the control depend on the past. Moreover the control acts on the martingale term and the space of controls U can be chosen to be non-convex but now the space of controls U can be chosen to be non-convex. The maximum principle is thus formulated using a first-order adjoint Backward Stochastic Differential Equations (BSDEs), which can be explicitly computed due to the specific characteristics of the model, and a second-order adjoint relation.
基金supported by the National Nat-ural Science Foundation of China(61873215,62103342)the Natural Science Foundation of Sichuan Province(2022NSFSC0470,2022NSFSC0892).
文摘Dear Editor,This letter focuses on the distributed optimal containment control of continuous-time multi-agent systems(CTMASs)with respect to the minimum-energy performance index over fixed topology.To achieve this,we firstly investigate the optimal containment control problem using the inverse optimal control method,where all states of followers asymptotically converge to the convex hull spanned by the leaders while some quadratic performance indexes get minimized.A sufficient condition for existence of the distributed optimal containment control protocol is derived.By introducing the parametric algebraic Riccati equation(PARE),it is strictly proved that the global performance index can be used to approximate the standard minimumenergy performance index as the parameters tends to infinity.In consequence,the standard minimum-energy cooperative containment control can be solved by local steady state feedback protocols.
基金the National Natural Science Foundation of China(61922063,62273255,62150026)in part by the Shanghai International Science and Technology Cooperation Project(21550760900,22510712000)+1 种基金the Shanghai Municipal Science and Technology Major Project(2021SHZDZX0100)the Fundamental Research Funds for the Central Universities。
文摘Dear Editor,In this letter,the multi-objective optimal control problem of nonlinear discrete-time systems is investigated.A data-driven policy gradient algorithm is proposed in which the action-state value function is used to evaluate the policy.In the policy improvement process,the policy gradient based method is employed.
基金supported in part by the National Natural Science Foundation of China(62173255, 62188101)Shenzhen Key Laboratory of Control Theory and Intelligent Systems,(ZDSYS20220330161800001)。
文摘DC-DC converter-based multi-bus DC microgrids(MGs) in series have received much attention, where the conflict between voltage recovery and current balancing has been a hot topic. The lack of models that accurately portray the electrical characteristics of actual MGs while is controller design-friendly has kept the issue active. To this end, this paper establishes a large-signal model containing the comprehensive dynamical behavior of the DC MGs based on the theory of high-order fully actuated systems, and proposes distributed optimal control based on this. The proposed secondary control method can achieve the two goals of voltage recovery and current sharing for multi-bus DC MGs. Additionally, the simple structure of the proposed approach is similar to one based on droop control, which allows this control technique to be easily implemented in a variety of modern microgrids with different configurations. In contrast to existing studies, the process of controller design in this paper is closely tied to the actual dynamics of the MGs. It is a prominent feature that enables engineers to customize the performance metrics of the system. In addition, the analysis of the stability of the closed-loop DC microgrid system, as well as the optimality and consensus of current sharing are given. Finally, a scaled-down solar and battery-based microgrid prototype with maximum power point tracking controller is developed in the laboratory to experimentally test the efficacy of the proposed control method.
文摘In this article, the transmission dynamics of a Hand-Foot-Mouth disease model with treatment and vaccination interventions are studied. We calculated the basic reproduction number and proved the global stability of disease-free equilibrium when R0 R0 > 1. Meanwhile, we obtained the optimal control strategies minimizing the cost of intervention and minimizing the infected person. We also give some numerical simulations to verify our theoretical results.
基金supported by Vietnam Academy of Science and Technology(Grant No.VAST01.04/22-23)。
文摘This paper studies a single degree of freedom system under free vibration and controlled by a general semiactive damping.A general integral of squared error is considered as the performance index.A one-time switching damping controller is proposed and optimized.The pontryagin maximum principle is used to prove that no other form of semi-active damping can provide the better performance than the proposed one-time switching damping.
文摘In this paper, we propose the nonconforming virtual element method (NCVEM) discretization for the pointwise control constraint optimal control problem governed by elliptic equations. Based on the NCVEM approximation of state equation and the variational discretization of control variables, we construct a virtual element discrete scheme. For the state, adjoint state and control variable, we obtain the corresponding prior estimate in H<sup>1</sup> and L<sup>2</sup> norms. Finally, some numerical experiments are carried out to support the theoretical results.
基金supported by the National Natural Science Foundation of China(62273213,62073199,62103241)Natural Science Foundation of Shandong Province for Innovation and Development Joint Funds(ZR2022LZH001)+4 种基金Natural Science Foundation of Shandong Province(ZR2020MF095,ZR2021QF107)Taishan Scholarship Construction Engineeringthe Original Exploratory Program Project of National Natural Science Foundation of China(62250056)Major Basic Research of Natural Science Foundation of Shandong Province(ZR2021ZD14)High-level Talent Team Project of Qingdao West Coast New Area(RCTD-JC-2019-05)。
文摘The paper addresses the decentralized optimal control and stabilization problems for interconnected systems subject to asymmetric information.Compared with previous work,a closed-loop optimal solution to the control problem and sufficient and necessary conditions for the stabilization problem of the interconnected systems are given for the first time.The main challenge lies in three aspects:Firstly,the asymmetric information results in coupling between control and estimation and failure of the separation principle.Secondly,two extra unknown variables are generated by asymmetric information(different information filtration)when solving forward-backward stochastic difference equations.Thirdly,the existence of additive noise makes the study of mean-square boundedness an obstacle.The adopted technique is proving and assuming the linear form of controllers and establishing the equivalence between the two systems with and without additive noise.A dual-motor parallel drive system is presented to demonstrate the validity of the proposed algorithm.
基金Clinical Trials from the Affiliated Drum Tower Hospital,Medical School of Nanjing University,2022-LCYJ-PY-17CIMF-CSPEN Project,Z-2017-24-2211Project of Chinese Hospital Reform and Development Institute,Nanjing University and Aid project of Nanjing Drum Tower Hospital Health,Education&Research Foundation,NDYG2022090。
文摘BACKGROUND The controlling nutritional status(CONUT)score effectively reflects a patient’s nutritional status,which is closely related to cancer prognosis.This study invest-igated the relationship between the CONUT score and prognosis after radical surgery for colorectal cancer,and compared the predictive ability of the CONUT score with other indexes.AIM To analyze the predictive performance of the CONUT score for the survival rate of colorectal cancer patients who underwent potentially curative resection.METHODS This retrospective analysis included 217 patients with newly diagnosed colorectal.The CONUT score was calculated based on the serum albumin level,total lymphocyte count,and total cholesterol level.The cutoff value of the CONUT score for predicting prognosis was 4 according to the Youden Index by the receiver operating characteristic curve.The associations between the CONUT score and the prognosis were performed using Kaplan-Meier curves and Cox regression analysis.RESULTS Using the cutoff value of the CONUT score,patients were stratified into CONUT low(n=189)and CONUT high groups(n=28).The CONUT high group had worse overall survival(OS)(P=0.013)and relapse-free survival(RFS)(P=0.015).The predictive performance of CONUT was superior to the modified Glasgow prognostic score,the prognostic nutritional index,and the neutrophil-to-lymphocyte ratio.Meanwhile,the predictive performances of CONUT+tumor node metastasis(TNM)stage for 3-year OS[area under the receiver operating characteristics curve(AUC)=0.803]and 3-year RFS(AUC=0.752)were no less than skeletal muscle mass index(SMI)+TNM stage.The CONUT score was negatively correlated with SMI(P<0.01).CONCLUSION As a nutritional indicator,the CONUT score could predict long-term outcomes after radical surgery for colorectal cancer,and its predictive ability was superior to other indexes.The correlation between the CONUT score and skeletal muscle may be one of the factors that play a predictive role.
基金the financial support from the National Natural Science Foundation of China(12171405 and 11661074)the Program for New Century Excellent Talents in Fujian Province University+2 种基金the financial support from the Characteristic&Preponderant Discipline of Key Construction Universities in Zhejiang Province(Zhejiang Gongshang University-Statistics)Collaborative Innovation Center of Statistical Data Engineering Technology&ApplicationDigital+Discipline Construction Project(SZJ2022B004)。
文摘Motivated by recent advances made in the study of dividend control and risk management problems involving the U.S.bankruptcy code,in this paper we follow[44]to revisit the De Finetti dividend control problem under the reorganization process and the regulator's intervention documented in U.S.Chapter 11 bankruptcy.We do this by further accommodating the fixed transaction costs on dividends to imitate the real-world procedure of dividend payments.Incorporating the fixed transaction costs transforms the targeting optimal dividend problem into an impulse control problem rather than a singular control problem,and hence computations and proofs that are distinct from[44]are needed.To account for the financial stress that is due to the more subtle concept of Chapter 11 bankruptcy,the surplus process after dividends is driven by a piece-wise spectrally negative Lévy process with endogenous regime switching.Some explicit expressions of the expected net present values under a double barrier dividend strategy,new to the literature,are established in terms of scale functions.With the help of these expressions,we are able to characterize the optimal strategy among the set of admissible double barrier dividend strategies.When the tail of the Lévy measure is log-convex,this optimal double barrier dividend strategy is then verified as the optimal dividend strategy,solving our optimal impulse control problem.
基金supported in part by the National Natural Science Foundation of China(NSFC)(61773260)the Ministry of Science and Technology (2018YFB130590)。
文摘This paper studies a novel distributed optimization problem that aims to minimize the sum of the non-convex objective functionals of the multi-agent network under privacy protection, which means that the local objective of each agent is unknown to others. The above problem involves complexity simultaneously in the time and space aspects. Yet existing works about distributed optimization mainly consider privacy protection in the space aspect where the decision variable is a vector with finite dimensions. In contrast, when the time aspect is considered in this paper, the decision variable is a continuous function concerning time. Hence, the minimization of the overall functional belongs to the calculus of variations. Traditional works usually aim to seek the optimal decision function. Due to privacy protection and non-convexity, the Euler-Lagrange equation of the proposed problem is a complicated partial differential equation.Hence, we seek the optimal decision derivative function rather than the decision function. This manner can be regarded as seeking the control input for an optimal control problem, for which we propose a centralized reinforcement learning(RL) framework. In the space aspect, we further present a distributed reinforcement learning framework to deal with the impact of privacy protection. Finally, rigorous theoretical analysis and simulation validate the effectiveness of our framework.
基金supported in part by Fundamental Research Funds for the Central Universities(2022JBZX024)in part by the National Natural Science Foundation of China(61872037,61273167)。
文摘Aimed at infinite horizon optimal control problems of discrete time-varying nonlinear systems,in this paper,a new iterative adaptive dynamic programming algorithm,which is the discrete-time time-varying policy iteration(DTTV)algorithm,is developed.The iterative control law is designed to update the iterative value function which approximates the index function of optimal performance.The admissibility of the iterative control law is analyzed.The results show that the iterative value function is non-increasingly convergent to the Bellman-equation optimal solution.To implement the algorithm,neural networks are employed and a new implementation structure is established,which avoids solving the generalized Bellman equation in each iteration.Finally,the optimal control laws for torsional pendulum and inverted pendulum systems are obtained by using the DTTV policy iteration algorithm,where the mass and pendulum bar length are permitted to be time-varying parameters.The effectiveness of the developed method is illustrated by numerical results and comparisons.