期刊文献+
共找到1,815篇文章
< 1 2 91 >
每页显示 20 50 100
Policy Gradient Adaptive Dynamic Programming for Model-Free Multi-Objective Optimal Control
1
作者 Hao Zhang Yan Li +2 位作者 Zhuping Wang Yi Ding Huaicheng Yan 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2024年第4期1060-1062,共3页
Dear Editor,In this letter,the multi-objective optimal control problem of nonlinear discrete-time systems is investigated.A data-driven policy gradient algorithm is proposed in which the action-state value function is... Dear Editor,In this letter,the multi-objective optimal control problem of nonlinear discrete-time systems is investigated.A data-driven policy gradient algorithm is proposed in which the action-state value function is used to evaluate the policy.In the policy improvement process,the policy gradient based method is employed. 展开更多
关键词 policy GRADIENT optimal
下载PDF
Policy Iteration for Optimal Control of Discrete-Time Time-Varying Nonlinear Systems 被引量:1
2
作者 Guangyu Zhu Xiaolu Li +2 位作者 Ranran Sun Yiyuan Yang Peng Zhang 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2023年第3期781-791,共11页
Aimed at infinite horizon optimal control problems of discrete time-varying nonlinear systems,in this paper,a new iterative adaptive dynamic programming algorithm,which is the discrete-time time-varying policy iterati... Aimed at infinite horizon optimal control problems of discrete time-varying nonlinear systems,in this paper,a new iterative adaptive dynamic programming algorithm,which is the discrete-time time-varying policy iteration(DTTV)algorithm,is developed.The iterative control law is designed to update the iterative value function which approximates the index function of optimal performance.The admissibility of the iterative control law is analyzed.The results show that the iterative value function is non-increasingly convergent to the Bellman-equation optimal solution.To implement the algorithm,neural networks are employed and a new implementation structure is established,which avoids solving the generalized Bellman equation in each iteration.Finally,the optimal control laws for torsional pendulum and inverted pendulum systems are obtained by using the DTTV policy iteration algorithm,where the mass and pendulum bar length are permitted to be time-varying parameters.The effectiveness of the developed method is illustrated by numerical results and comparisons. 展开更多
关键词 Adaptive critic designs adaptive dynamic programming approximate dynamic programming optimal control policy iteration TIME-VARYING
下载PDF
THE OPTIMAL DEDUCTIBLE AND COVERAGE IN INSURANCE CONTRACTS AND EQUILIBRIUM RISK SHARING POLICIES
3
作者 蹇玲玲 《Acta Mathematica Scientia》 SCIE CSCD 2023年第3期1347-1364,共18页
In this paper, we consider the optimal risk sharing problem between two parties in the insurance business: the insurer and the insured. The risk is allocated between the insurer and the insured by setting a deductible... In this paper, we consider the optimal risk sharing problem between two parties in the insurance business: the insurer and the insured. The risk is allocated between the insurer and the insured by setting a deductible and coverage in the insurance contract. We obtain the optimal deductible and coverage by considering the expected product of the two parties' utilities of terminal wealth according to stochastic optimal control theory. An equilibrium policy is also derived for when there are both a deductible and coverage;this is done by modelling the problem as a stochastic game in a continuous-time framework. A numerical example is provided to illustrate the results of the paper. 展开更多
关键词 deductible and coverage equilibrium policy stochastic optimal control Hamilton-Jacobi-Bellman equation
下载PDF
Optimal quasi-periodic maintenance policies for two-unit series system 被引量:2
4
作者 高文科 张志胜 +1 位作者 周一帆 甘淑媛 《Journal of Southeast University(English Edition)》 EI CAS 2013年第4期450-455,共6页
To investigate the effects of various random factors on the preventive maintenance (PM) decision-making of one type of two-unit series system, an optimal quasi-periodic PM policy is introduced. Assume that PM is per... To investigate the effects of various random factors on the preventive maintenance (PM) decision-making of one type of two-unit series system, an optimal quasi-periodic PM policy is introduced. Assume that PM is perfect for unit 1 and only mechanical service for unit 2 in the model. PM activity is randomly performed according to a dynamic PM plan distributed in each implementation period. A replacement is determined based on the competing results of unplanned and planned replacements. The unplanned replacement is trigged by a catastrophic failure of unit 2, and the planned replacement is executed when the PM number reaches the threshold N. Through modeling and analysis, a solution algorithm for an optimal implementation period and the PM number is given, and optimal process and parametric sensitivity are provided by a numerical example. Results show that the implementation period should be decreased as soon as possible under the condition of meeting the needs of practice, which can increase mean operating time and decrease the long-run cost rate. 展开更多
关键词 maintenance policy optimization quasi-periodic preventive maintenance two-unit series system
下载PDF
OPTIMAL HARVESTING POLICY FOR INSHORE-OFFSHORE FISHERY MODEL WITH IMPULSIVE DIFFUSION 被引量:7
5
作者 董玲珍 陈兰荪 孙丽华 《Acta Mathematica Scientia》 SCIE CSCD 2007年第2期405-412,共8页
This article studies the inshore-offshore fishery model with impulsive diffusion. The existence and global asymptotic stability of both the trivial periodic solution and the positive periodic solution are obtained. Th... This article studies the inshore-offshore fishery model with impulsive diffusion. The existence and global asymptotic stability of both the trivial periodic solution and the positive periodic solution are obtained. The complexity of this system is also analyzed. Moreover, the optimal harvesting policy are given for the inshore subpopulation, which includes the maximum sustainable yield and the corresponding harvesting effort. 展开更多
关键词 Impulsive diffusion inshore-offshore fishery model global asymptotic stability periodic solution optimal harvesting policy
下载PDF
Optimal switching policy for performance enhancement of distributed parameter systems based on event-driven control 被引量:1
6
作者 穆文英 崔宝同 +1 位作者 楼旭阳 李纹 《Chinese Physics B》 SCIE EI CAS CSCD 2014年第7期211-217,共7页
This paper aims to improve the performance of a class of distributed parameter systems for the optimal switching of actuators and controllers based on event-driven control. It is assumed that in the available multiple... This paper aims to improve the performance of a class of distributed parameter systems for the optimal switching of actuators and controllers based on event-driven control. It is assumed that in the available multiple actuators, only one actuator can receive the control signal and be activated over an unfixed time interval, and the other actuators keep dormant. After incorporating a state observer into the event generator, the event-driven control loop and the minimum inter-event time are ultimately bounded. Based on the event-driven state feedback control, the time intervals of unfixed length can be obtained. The optimal switching policy is based on finite horizon linear quadratic optimal control at the beginning of each time subinterval. A simulation example demonstrate the effectiveness of the proposed policy. 展开更多
关键词 distributed parameter systems optimal switching policy EVENT-DRIVEN
下载PDF
RECURSIVE UTILITY,PRODUCTIVE GOVERNMENT EXPENDITURE AND OPTIMAL FISCAL POLICY 被引量:1
7
作者 Wang Haijun Hu Shigeng Zhang Xueqing 《Applied Mathematics(A Journal of Chinese Universities)》 SCIE CSCD 2005年第3期277-288,共12页
This paper employs a stochastic endogenous growth model extended to the case of a recursive utility function which can disentangle intertemporal substitution from risk aversion to analyze productive government expendi... This paper employs a stochastic endogenous growth model extended to the case of a recursive utility function which can disentangle intertemporal substitution from risk aversion to analyze productive government expenditure and optimal fiscal policy, particularly stresses the importance of factor income. First, the explicit solutions of the central planner's stochastic optimization problem are derived, the growth maximizing and welfare-maximizing government expenditure policies are obtained and their standing in conflict or coincidence depends upon intertemporal substitution. Second, the explicit solutions of the representative individual's stochastic optimization problem which permits to tax on capital income and labor income separately are derived ,and it is found that the effect of risk on growth crucially depends on the degree of risk aversion,the intertemporal elasticity of substitution and the capital income share. Finally, a flexible optimal tax policy which can be internally adjusted to a certain extent is derived, and it is found that the distribution of factor income plays an important role in designing the optimal tax policy. 展开更多
关键词 endogenous growth recursive utility productive government expenditure optimal fiscal policy.
下载PDF
Analysis of a POMDP Model for an Optimal Maintenance Problem with Multiple Imperfect Repairs
8
作者 Nobuyuki Tamura 《American Journal of Operations Research》 2023年第6期133-146,共14页
I consider a system whose deterioration follows a discrete-time and discrete-state Markov chain with an absorbing state. When the system is put into practice, I may select operation (wait), imperfect repair, or replac... I consider a system whose deterioration follows a discrete-time and discrete-state Markov chain with an absorbing state. When the system is put into practice, I may select operation (wait), imperfect repair, or replacement at each discrete-time point. The true state of the system is not known when it is operated. Instead, the system is monitored after operation and some incomplete information concerned with the deterioration is obtained for decision making. Since there are multiple imperfect repairs, I can select one option from them when the imperfect repair is preferable to operation and replacement. To express this situation, I propose a POMDP model and theoretically investigate the structure of an optimal maintenance policy minimizing a total expected discounted cost for an unbounded horizon. Then two stochastic orders are used for the analysis of our problem. 展开更多
关键词 Partially Observable Markov Decision Process Imperfect Repair Stochastic Order Monotone Property optimal Maintenance policy
下载PDF
Wind Turbine Optimal Preventive Maintenance Scheduling Using Fibonacci Search and Genetic Algorithm
9
作者 Ekamdeep Singh Sajad Saraygord Afshari Xihui Liang 《Journal of Dynamics, Monitoring and Diagnostics》 2023年第3期157-169,共13页
Maintenance scheduling is essential and crucial for wind turbines (WTs) to avoid breakdowns andreduce maintenance costs. Many maintenance models have been developed for WTs’ maintenance planning, suchas corrective, p... Maintenance scheduling is essential and crucial for wind turbines (WTs) to avoid breakdowns andreduce maintenance costs. Many maintenance models have been developed for WTs’ maintenance planning, suchas corrective, preventive, and predictive maintenance. Due to communities’ dependence on WTs for electricityneeds, preventive maintenance is the most widely used method for maintenance scheduling. The downside tousing this approach is that preventive maintenance (PM) is often done in fixed intervals, which is inefficient. In thispaper, a more detailed maintenance plan for a 2 MW WT has been developed. The paper’s focus is to minimize aWT’s maintenance cost based on a WT’s reliability model. This study uses a two-layer optimization framework:Fibonacci and genetic algorithm. The first layer in the optimization method (Fibonacci) finds the optimal numberof PM required for the system. In the second layer, the optimal times for preventative maintenance and optimalcomponents to maintain have been determined to minimize maintenance costs. The Monte Carlo simulationestimates WT component failure times using their lifetime distributions from the reliability model. The estimatedfailure times are then used to determine the overall corrective and PM costs during the system’s lifetime. Finally,an optimal PM schedule is proposed for a 2 MW WT using the presented method. The method used in this papercan be expanded to a wind farm or similar engineering systems. 展开更多
关键词 cost-based maintenance scheduling genetic algorithm hierarchical optimization preventive maintenance reliability modeling wind turbine maintenance policy
下载PDF
Distributive Disturbance and Optimal Policy in Stochastic Control Model
10
作者 汪红初 胡适耕 张学清 《Journal of Southwest Jiaotong University(English Edition)》 2006年第4期408-414,共7页
To investigate the equilibrium relationships between the volatility of capital and income, taxation, and ance in a stochastic control model, the uniqueness of the solution to this model was proved by using the method ... To investigate the equilibrium relationships between the volatility of capital and income, taxation, and ance in a stochastic control model, the uniqueness of the solution to this model was proved by using the method of dynamic programming under the introduction of distributive disturbance and elastic labor supply. Furthermore, the effects of two types of shocks on labor-leisure choice, economic growth rate and welfare were numerically analyzed, and then the optimal tax policy was derived. 展开更多
关键词 Stochastic optimization Dynamic programming Bellman equation Macroeconomic equilibrium optimal policy
下载PDF
Optimal policy for controlling two-server queueing systems with jockeying
11
作者 LIN Bing LIN Yuchen BHATNAGAR Rohit 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2022年第1期144-155,共12页
This paper studies the optimal policy for joint control of admission, routing, service, and jockeying in a queueing system consisting of two exponential servers in parallel.Jobs arrive according to a Poisson process.U... This paper studies the optimal policy for joint control of admission, routing, service, and jockeying in a queueing system consisting of two exponential servers in parallel.Jobs arrive according to a Poisson process.Upon each arrival, an admission/routing decision is made, and the accepted job is routed to one of the two servers with each being associated with a queue.After each service completion, the servers have an option of serving a job from its own queue, serving a jockeying job from another queue, or staying idle.The system performance is inclusive of the revenues from accepted jobs, the costs of holding jobs in queues, the service costs and the job jockeying costs.To maximize the total expected discounted return, we formulate a Markov decision process(MDP) model for this system.The value iteration method is employed to characterize the optimal policy as a hedging point policy.Numerical studies verify the structure of the hedging point policy which is convenient for implementing control actions in practice. 展开更多
关键词 queueing system jockeying optimal policy Markov decision process(MDP) dynamic programming
下载PDF
Optimal Water Pipe Replacement Policy
12
作者 Harrison O. Amuji Chukwudi J. Ogbonna +2 位作者 Geoffrey U. Ugwuanyim Hycinth C. Iwu Okechukwu B. Nwanyibuife 《Open Journal of Optimization》 2018年第2期41-49,共9页
Water scarcity is the major problem confronting both urban and rural dwellers in Enugu State. This scarcity emanated from indiscriminate pipe failure, lack of adequate maintenance, uncertainty on the time of repair or... Water scarcity is the major problem confronting both urban and rural dwellers in Enugu State. This scarcity emanated from indiscriminate pipe failure, lack of adequate maintenance, uncertainty on the time of repair or replacement of pipes etc. There is no systematic approach to determining replacement or repair time of the pipes. Hence, the rule of thumb is used in making such a vital decision. The population is increasing, houses are built but the network is not expanded and the existing ones that were installed for no less than two to three decades ago are not maintained. These compounded the problem of scarcity of water in the state. Replacement or repair of water pipes when they are seen spilling water cannot solve this lingering problem. The solution can be achieved by developing an adequate predictive model for water pipe replacement. Hence, this research is aimed at providing a solution to this problem of water scarcity by suggesting a policy that will be used for better planning. The interests in this paper were to obtain a water pipe failure model, the intensity function λ(t) [failure rate], the reliability R(t) and the optimal time of replacement and they were achieved. It was observed that the failure rate of the pipes increases with time while their reliability deteriorates with time. Hence, the Optimal replacement policy is that each pipe should be replaced after 4th break when the reliability = 0.0011. 展开更多
关键词 Reliability Non-Homogenous POISSON Process REPAIRABLE System optimal Water PIPE REPLACEMENT policy Failure Rate
下载PDF
Optimal Price Strategy under Price-Matching Policy
13
作者 Vivian Okere Wen Chen 《Journal of Applied Mathematics and Physics》 2020年第12期2981-2998,共18页
The paper explores the optimal price strategy under the price-matching policy. First, the paper formulates the demand function under the price match policy and then discovers the retailer’s best response facing the p... The paper explores the optimal price strategy under the price-matching policy. First, the paper formulates the demand function under the price match policy and then discovers the retailer’s best response facing the price-matching pressure. From the theoretical analysis, we discover how the number of retailers plays an important role during the competition. When only two retailers are involved, the final prices may not converge to a single value. However, when more retailers are involved, the final price will converge to a single value. We also use numerical studies to illuminate the change of the prices over the time period, the sensitivity of the final price to the increment/decrement of initial prices. Finally, we provide managerial suggestions to both producers and retailers. 展开更多
关键词 Price-Matching policy optimal Pricing Retail Management
下载PDF
Optimal Static Partition Configuration in ARINC653 System 被引量:4
14
作者 Sheng-Lin Gui Lei Luo Sen-Sen Tang Yang Meng 《Journal of Electronic Science and Technology》 CAS 2011年第4期373-378,共6页
ARINC653 systems, which have been widely used in avionics industry, are an important class of safety-critical applications. Partitions are the core concept in the Arinc653 system architecture. Due to the existence of ... ARINC653 systems, which have been widely used in avionics industry, are an important class of safety-critical applications. Partitions are the core concept in the Arinc653 system architecture. Due to the existence of partitions, the system designer must allocate adequate time slots statically to each partition in the design phase. Although some time slot allocation policies could be borrowed from task scheduling policies, no existing literatures give an optimal allocation policy. In this paper, we present a partition configuration policy and prove that this policy is optimal in the sense that if this policy fails to configure adequate time slots to each partition, nor do other policies. Then, by simulation, we show the effects of different partition configuration policies on time slot allocation of partitions and task response time, respectively. 展开更多
关键词 ARINC653 earliest-next release time first policy optimal partition configuration policy real-time systems.
下载PDF
THE OPTIMAL STRATEGY FOR INSURANCE COMPANY UNDER THE INFLUENCE OF TERMINAL VALUE 被引量:3
15
作者 刘伟 袁海丽 胡亦钧 《Acta Mathematica Scientia》 SCIE CSCD 2011年第3期1077-1090,共14页
This paper considers a model of an insurance company which is allowed to invest a risky asset and to purchase proportional reinsurance. The objective is to find the policy which maximizes the expected total discounted... This paper considers a model of an insurance company which is allowed to invest a risky asset and to purchase proportional reinsurance. The objective is to find the policy which maximizes the expected total discounted dividend pay-out until the time of bankruptcy and the terminal value of the company under liquidity constraint. We find the solution of this problem via solving the problem with zero terminal value. We also analyze the influence of terminal value on the optimal policy. 展开更多
关键词 proportional reinsurance terminal value optimal policy HJB equation
下载PDF
Replenishment policy and inventory optimization for supply-hub with liability period consideration 被引量:1
16
作者 李果 黄焜 +1 位作者 姚琦 马士华 《Journal of Central South University》 SCIE EI CAS 2013年第10期2914-2921,共8页
A replenishment decision-making model for supply-hub is firstly established from the angle of supplier, and optimal replenishment decision of the supplier is analyzed. Then, inventory optimization model for supply-hub... A replenishment decision-making model for supply-hub is firstly established from the angle of supplier, and optimal replenishment decision of the supplier is analyzed. Then, inventory optimization model for supply-hub is formulated from the angle of the manufacturer, and the optimization algorithm for obtaining optimal inventory levels is given. The result shows that liability period decides the share of the inventory cost between two sides in supply chain. With the increase of liability period, the service level has been quickly reduced even though the manufacturer's cost has been cut down by transferring the inventory cost to the supplier. As to the safety inventory, if the lower bound of components safety inventory increases, the supplier's cost will rise up more slowly than the liability period does, while the service levels increases as the safety inventory's lower bound is raised. 展开更多
关键词 LIABILITY PERIOD SUPPLY-HUB REPLENISHMENT policy INVENTORY optimIZATION lead time
下载PDF
Optimization on bicriterion policies for M/G/1 system with second optional service 被引量:1
17
作者 Jau-chuan KE Yunn-kuang CHU 《Journal of Zhejiang University-Science A(Applied Physics & Engineering)》 SCIE EI CAS CSCD 2008年第10期1437-1445,共9页
We compare the optimal operating cost of the two bicriterion policies, <p,T> and <p,N>, for an M/G/1 queueing system with second optional service, in which the length of the vacation period is randomly con... We compare the optimal operating cost of the two bicriterion policies, <p,T> and <p,N>, for an M/G/1 queueing system with second optional service, in which the length of the vacation period is randomly controlled either by the number of arrivals during the idle period or by a timer. After all the customers are served in the queue exhaustively, the server immediately takes a vacation and may operate <p,T> policy or <p,N> policy. For the two bicriterion policies, the total average cost function per unit time is developed to search the optimal stationary operating policies at a minimum cost. Based upon the optimal cost the explicit forms for joint optimum threshold values of (p,T) and (p,N) are obtained. 展开更多
关键词 Average operating cost Bicriterion policy optimization comparisons Optional service optimal threshold values
下载PDF
STUDY ON THE OPTIMIZATION OF TRANSPORT CONTROL POLICY IN COMMUNICATION NETWORK 被引量:1
18
作者 Fan Shuyan Han Weizhan Lu Ran 《Journal of Electronics(China)》 2010年第2期261-266,共6页
In communication networks with policy-based Transport Control on-Demand (TCoD) function,the transport control policies play a great impact on the network effectiveness. To evaluate and optimize the transport policies ... In communication networks with policy-based Transport Control on-Demand (TCoD) function,the transport control policies play a great impact on the network effectiveness. To evaluate and optimize the transport policies in communication network,a policy-based TCoD network model is given and a comprehensive evaluation index system of the network effectiveness is put forward from both network application and handling mechanism perspectives. A TCoD network prototype system based on Asynchronous Transfer Mode/Multi-Protocol Label Switching (ATM/MPLS) is introduced and some experiments are performed on it. The prototype system is evaluated and analyzed with the comprehensive evaluation index system. The results show that the index system can be used to judge whether the communication network can meet the application requirements or not,and can provide references for the optimization of the transport policies so as to improve the communication network effectiveness. 展开更多
关键词 Communication network Comprehensive evaluation index system Network Application Effectiveness (NAE) Transport Control on-Demand (TCoD) policy optimization
下载PDF
Optimization and Regulation Policy for Land Use Changes Based on Low-carbon Emission in Developed Regions of China 被引量:1
19
作者 Degui YU Qun WU 《Asian Agricultural Research》 2017年第6期67-76,共10页
Land use and cover change(LUCC) is one of the important causes of the Earth’s carbon cycle imbalances resulting from failure in optimizing land use. The solution to this problem has been the hotspot of research in la... Land use and cover change(LUCC) is one of the important causes of the Earth’s carbon cycle imbalances resulting from failure in optimizing land use. The solution to this problem has been the hotspot of research in land and environmental science. We took 'low carbon', 'energy saving' and 'high-efficiency' as the goals of land use optimization,and integrated Markov-CA(Cellular Automaton),the Grid-Fractal model and GIS,in order to study carbon emission objective function,to establish a simulation method for land use spatial allocation optimization,to evaluate the effect of the method on carbon emissions. Regulation policy on three types of land use spatial allocation was proposed,including 'low-carbon type', 'low-carbon-economic type' and 'economic type'. We applied the method to analyze the land use spatial allocation in Taixing City of the 'Yangtze River Delta' regions in China,and obtained the following results:(i) The three optimization types would improve carbon emissions by 3. 21%,1. 80% and 0. 36% respectively in 2020,compared with 2010;(ii) The actual planning for 2020 was close to the 'low-carbon-economic type';(iii) The optimization method and regulation policy,combining local optimization and global control,could meet the sustainable multi-objective requirements for low-carbon constraints of land use spatial allocation. The result of this research could also serve as a reference for exploration into patterns of regional low-carbon land use and measures for energy saving and emission reduction. 展开更多
关键词 Regulation policy Land use optimization Low-carbon emission Markov-CA model Developed regions of China
下载PDF
Distributionally Robust Optimal Dispatch of Virtual Power Plant Based on Moment of Renewable Energy Resource 被引量:1
20
作者 Wenlu Ji YongWang +2 位作者 Xing Deng Ming Zhang Ting Ye 《Energy Engineering》 EI 2022年第5期1967-1983,共17页
Virtual power plants can effectively integrate different types of distributed energy resources,which have become a new operation mode with substantial advantages such as high flexibility,adaptability,and economy.This ... Virtual power plants can effectively integrate different types of distributed energy resources,which have become a new operation mode with substantial advantages such as high flexibility,adaptability,and economy.This paper proposes a distributionally robust optimal dispatch approach for virtual power plants to determine an optimal day-ahead dispatch under uncertainties of renewable energy sources.The proposed distributionally robust approach characterizes probability distributions of renewable power output by moments.In this regard,the faults of stochastic optimization and traditional robust optimization can be overcome.Firstly,a second-order cone-based ambiguity set that incorporates the first and second moments of renewable power output is constructed,and a day-ahead two-stage distributionally robust optimization model is proposed for virtual power plants participating in day-ahead electricity markets.Then,an effective solution method based on the affine policy and second-order cone duality theory is employed to reformulate the proposed model into a deterministic mixed-integer second-order cone programming problem,which improves the computational efficiency of the model.Finally,the numerical results demonstrate that the proposed method achieves a better balance between robustness and economy.They also validate that the dispatch strategy of virtual power plants can be adjusted to reduce costs according to the moment information of renewable power output. 展开更多
关键词 Virtual power plant optimal dispatch UNCERTAINTY distributionally robust optimization affine policy
下载PDF
上一页 1 2 91 下一页 到第
使用帮助 返回顶部