Feature-Based Aggregation and Deep Reinforcement Learning:A Survey and Some New Implementations

来源 :自动化学报:英文版 | 被引量 : 0次 | 上传用户：nbywfcom

【摘要】

：

In this paper we discuss policy iteration methods for approximate solution of a finite-state discounted Markov decision problem, with a focus on feature-based a

【作者】

：

Dimitri P.Bertsekas

【机构】

：

theDepartmentofElectricalEngineeringandComputerScience

【出处】

：

自动化学报:英文版

【发表日期】

：

2019年1期

【关键词】

：

REINFORCEMENT learning dynamic programming Markovian decision problems AGGREGATI

下载到本地 , 更方便阅读

下载此文赞助VIP

声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架

论文部分内容阅读

In this paper we discuss policy iteration methods for approximate solution of a finite-state discounted Markov decision problem, with a focus on feature-based aggregation methods and their connection with deep reinforcement learning schemes. We introduce

其他文献

非结构化对等网模型性能研究及仿真

Gnutella和Freenet分别是P2P非结构化模型、文件存储协议的典范。文章对它们从文件存储、文件查询、用户匿名性等3个方面的性能进行深入研究；使用适合分布式网络仿真的新型仿

期刊

GNUTELLAFREENETOMNEeT++仿真Gnutella Freenet OMNeT＋＋ simulation

新媒体时代广告人的精神塑造

广告是通过一定载体面向大众传播的、具有特定目的的宣传形式，广告自产生以来，其最主要的表现形式是商业广告．最主要的载体则包括了电视、广播、报纸、杂志、传单以及户外广告牌

期刊

广告人新媒体时代精神塑造户外广告牌大众传播宣传形式商业广告广告发展

Feature-Based Aggregation and Deep Reinforcement Learning:A Survey and Some New Implementations

其他学术论文