行业报告详情 - 行业报告数据库

行业分类

找到报告 1 篇当前为第 1 页共 1 页

反事实强化学习：如何模拟决策层预见未来

Counter-Factual Reinforcement Learning: How to Model Decision-Makers That Anticipate the Future

作者：Ritchie LeeDavid H. WolpertJames BonoScott BackhausRussell BentBrendan Tracey 加工时间：2014-09-22 信息来源：科技报告（Other）

关键词：计算；电网；网络战；K推理
摘要：This chapter introduces a novel framework for modeling interacting humans in a multi-stage game. This "iterated semi network-form game" framework has the following desirable characteristics: (1) Bounded rational players, (2) strategic players (i.e., players account for one another's reward functions when predicting one another's behavior), and (3) computational tractability even on real-world systems. We achieve these benefits by combining concepts from game theory and reinforcement learning. To be precise, we extend the bounded rational "level-K reasoning" model to apply to games over multiple stages. Our extension allows the decomposition of the overall modeling problem into a series of smaller ones, each of which can be solved by standard reinforcement learning algorithms. We call this hybrid approach "level-K reinforcement learning". We investigate these ideas in a cyber battle scenario over a smart power grid and discuss the relationship between the behavior predicted by our model and what one might expect of real human defenders and attackers.

行业分类

友情链接

联系我们

QQ咨询

电话咨询

微信公众号

感谢访问