期刊文献+
共找到5篇文章
< 1 >
每页显示 20 50 100
Feature-Based Aggregation and Deep Reinforcement Learning:A Survey and Some New Implementations 被引量:14
1
作者 Dimitri P.Bertsekas 《IEEE/CAA Journal of Automatica Sinica》 EI CSCD 2019年第1期1-31,共31页
In this paper we discuss policy iteration methods for approximate solution of a finite-state discounted Markov decision problem, with a focus on feature-based aggregation methods and their connection with deep reinfor... In this paper we discuss policy iteration methods for approximate solution of a finite-state discounted Markov decision problem, with a focus on feature-based aggregation methods and their connection with deep reinforcement learning schemes. We introduce features of the states of the original problem, and we formulate a smaller "aggregate" Markov decision problem, whose states relate to the features. We discuss properties and possible implementations of this type of aggregation, including a new approach to approximate policy iteration. In this approach the policy improvement operation combines feature-based aggregation with feature construction using deep neural networks or other calculations. We argue that the cost function of a policy may be approximated much more accurately by the nonlinear function of the features provided by aggregation, than by the linear function of the features provided by neural networkbased reinforcement learning, thereby potentially leading to more effective policy improvement. 展开更多
关键词 REINFORCEMENT learning dynamic programming Markovian decision problems AGGREGATION feature-based ARCHITECTURES policy ITERATION DEEP neural networks rollout algorithms
下载PDF
Decentralized control in active distribution grids via supervised and reinforcement learning
2
作者 Stavros Karagiannopoulos Petros Aristidou +1 位作者 Gabriela Hug Audun Botterud 《Energy and AI》 EI 2024年第2期137-149,共13页
While moving towards a low-carbon, sustainable electricity system, distribution networks are expected to host a large share of distributed generators, such as photovoltaic units and wind turbines. These inverter-based... While moving towards a low-carbon, sustainable electricity system, distribution networks are expected to host a large share of distributed generators, such as photovoltaic units and wind turbines. These inverter-based resources are intermittent, but also controllable, and are expected to amplify the role of distribution networks together with other distributed energy resources, such as storage systems and controllable loads. The available control methods for these resources are typically categorized based on the available communication network into centralized, distributed, and decentralized or local. Standard local schemes are typically inefficient, whereas centralized approaches show implementation and cost concerns. This paper focuses on optimized decentralized control of distributed generators via supervised and reinforcement learning. We present existing state-of-the-art decentralized control schemes based on supervised learning, propose a new reinforcement learning scheme based on deep deterministic policy gradient, and compare the behavior of both decentralized and centralized methods in terms of computational effort, scalability, privacy awareness, ability to consider constraints, and overall optimality. We evaluate the performance of the examined schemes on a benchmark European low voltage test system. The results show that both supervised learning and reinforcement learning schemes effectively mitigate the operational issues faced by the distribution network. 展开更多
关键词 Supervised learning Reinforcement learning Deep deterministic policy gradient Decentralized control Active distribution systems
原文传递
基于红外相机数据估算浙江省开化县野猪种群数量 被引量:2
3
作者 陈小南 田佳 +5 位作者 刘鸣章 申云逸 余建平 刘锋 申小莉 李晟 《兽类学报》 CAS CSCD 北大核心 2023年第5期523-532,共10页
野猪(Sus scrofa)具有较强的繁殖力和对多种环境的适应能力,近年来在我国多地出现种群明显增长,因野猪损毁农林作物、伤人而引发的人兽冲突事件频发。为针对野猪的计划性捕猎与种群管理提供科学支持,本研究以浙江省开化县为例,基于红外... 野猪(Sus scrofa)具有较强的繁殖力和对多种环境的适应能力,近年来在我国多地出现种群明显增长,因野猪损毁农林作物、伤人而引发的人兽冲突事件频发。为针对野猪的计划性捕猎与种群管理提供科学支持,本研究以浙江省开化县为例,基于红外相机实地调查数据对野猪种群数量进行估算。选用开化县2020年9—10月429个位点的红外相机监测数据,包括有效相机工作日23690 d,共记录到964次野猪的独立探测。利用Royle-Nichols模型,结合地形、植被、人类影响的环境变量因子和野猪家域面积参数,估计了野猪种群密度及分布。结果显示,研究区内野猪数量为(5548±2343)头(mean±95%C.I.);在与野猪家域面积大小相当的网格中,野猪的平均种群密度为每平方千米(2.38±0.61)头(mean±SD);野猪密度与海拔和居民点密度呈负相关关系,与森林覆盖率呈弱正相关关系。基于以上结果,结合实际管理需求,建议研究区域内的野猪捕猎活动选择在其种群密度较高的农田、种植园以及森林与农田交错区域开展。当地主管部门与钱江源国家公园未来应依托系统布设的红外相机监测网络,整合捕猎获取的个体信息记录和其他来源的数据,修正和完善种群动态模型,建立长期种群动态监测机制。 展开更多
关键词 野猪 种群管理 人兽冲突 计划性捕猎 种群密度估计 红外相机
下载PDF
Resilient and Inclusive Prosperity within Planetary Boundaries
4
作者 Cameron Hepburn Eric Beinhocker +1 位作者 J.Doyne Farmer Alexander Teytelboym 《China & World Economy》 SCIE 2014年第5期76-92,共17页
The current model of economic growth generated unprecedented increases in human wealth and prosperity during the 19th and 2Oth centuries. The main mechanisms have been the rapid pace of technological and social innova... The current model of economic growth generated unprecedented increases in human wealth and prosperity during the 19th and 2Oth centuries. The main mechanisms have been the rapid pace of technological and social innovation, human capital accumulation, and the conversion of resources and natural capital into more valuable forms of produced capital. However, there is evidence emerging that this model may be approaching environmental limits and planetary boundaries, and that the conversion of natural capital needs to slow down rapidly and then be reversed Some commentators have asserted that in order for this to occur, we will need to stop growing altogether and, instead, seek prosperity without growth. Others argue that environmental concerns are low-priority luxuries to be contemplated once global growth has properly returned to levels observed prior to the 2008 financial crisis. A third group argues that there is no trade-off and, instead,, promotes green growth: the (politically appealing) idea is that we can simultaneously grow and address our environmental problems. This paper provides a critical perspective on this debate and suggests that asubstantial researc'h agenda is required to come to grips with these challenges. One place to start is with the relevant metrics: measures of per-capitawealth, and, eventually, quantitative measures of prosperity, alongside a dashboard of other sustainability indicators. A public andpoliticalfocus on wealth (a stock), and its annual changes, could realistically complement the current focus on market-based gross output as measured by GDP (a flow). This could have important policy implications, but deeper changes to governance and business models will be required. 展开更多
关键词 green growth limits to growth planetary boundaries RESILIENCE sustainable prosperity
原文传递
Consensus with preserved privacy against neighbor collusion
5
作者 Silun Zhang Thomas Ohlson Timoudas Munther ADahleh 《Control Theory and Technology》 EI CSCD 2020年第4期409-418,共10页
This paper proposes a privacy-preserving algorithm to solve the average-consensus problem based on Shamir's secret shar-ing scheme,in which a network of agents reach an agreement on their states without exposing t... This paper proposes a privacy-preserving algorithm to solve the average-consensus problem based on Shamir's secret shar-ing scheme,in which a network of agents reach an agreement on their states without exposing their individual states until an areement is reached.Unlike other methods,the proposed algoritm renders the network resitant to the cllusion of any given number of nighbors(even with all nighbor'clluling).Another virtue of this work is that such a method can protect the network consensus procedure from eavesdropping. 展开更多
关键词 Privacy preserving consensus Cyber security Network control Secret sharing scheme
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部