北京雁栖湖应用数学研究院 北京雁栖湖应用数学研究院

  • 关于我们
    • 院长致辞
    • 理事会
    • 协作机构
    • 参观来访
  • 人员
    • 管理层
    • 科研人员
    • 博士后
    • 来访学者
    • 行政团队
  • 学术研究
    • 研究团队
    • 公开课
    • 讨论班
  • 招生招聘
    • 教研人员
    • 博士后
    • 学生
  • 会议
    • 学术会议
    • 工作坊
    • 论坛
  • 学院生活
    • 住宿
    • 交通
    • 配套设施
    • 周边旅游
  • 新闻
    • 新闻动态
    • 通知公告
    • 资料下载
关于我们
院长致辞
理事会
协作机构
参观来访
人员
管理层
科研人员
博士后
来访学者
行政团队
学术研究
研究团队
公开课
讨论班
招生招聘
教研人员
博士后
学生
会议
学术会议
工作坊
论坛
学院生活
住宿
交通
配套设施
周边旅游
新闻
新闻动态
通知公告
资料下载
清华大学 "求真书院"
清华大学丘成桐数学科学中心
清华三亚国际数学论坛
上海数学与交叉学科研究院
BIMSA > BIMSA AI Seminar Repeated Games and Reinforcement Learning
Repeated Games and Reinforcement Learning
组织者
邬荣领
演讲者
Feng Fu
时间
2024年07月24日 14:00 至 15:30
地点
A3-2a-302
线上
Zoom 559 700 6085 (BIMSA)
摘要
Very recently, the notion of cooperative AI has been advocated as one of the solutions to ensure beneficial AI technologies. In this light, reciprocal cooperation has been extensively studied using the Iterated Prisoner’s Dilemma (IPD) games. Despite the astronomically vast individual behavioral strategy space for IPD game interactions, the so-called Zero-Determinant (ZD) extortionate strategy is a set of rather simple memory-one strategies that can unilaterally set a linear relationship between itself and its opponent. This new finding of ZD strategies has greatly spurred new waves of work from diverse fields, such as network science, computer science, and applied mathematics, aiming to shed light on the robustness and resilience of cooperation through the natural selection of IPD strategies. However, one open issue remains to be fully addressed: can extortionate ZD strategies be outperformed at all, even in simple head-to-head IPD matches? We will use machine learning and analytical tools to search for strategies that are unyielding to opponents’ extortion (i.e., strategies that can make extortion backfire and even outperform ZD under certain conditions). This will bring a new perspective by combining game theory and AI techniques for optimizing winning strategies in non-zero-sum games.
演讲者介绍
Feng Fu is currently an associate professor of applied mathematics in the Department of Mathematics and holds an adjunct appointment in the Department of Biomedical Data Science at the Geisel School of Medicine, Dartmouth. Before joining Dartmouth in September 2015, he was a senior postdoc in theoretical biology under Sebastian Bonhoeffer at ETH Zurich starting in September 2012. Prior to that, he did his dissertation research and a subsequent postdoc training with Martin Nowak and Nicholas Christakis at Harvard University from 2007 to 2012. He is interested in evolutionary game theory with applications to real-world problems, including reinforcement learning dynamics, neuroscience, and behavioral epidemiology. He received his B.Sc. in Theoretical and Applied Mechanics from Fudan University in 2004 and his Ph.D. in Dynamics and Control from Peking University in 2010. He was the recipient of Dartmouth's Dean of the Faculty Award for Outstanding Mentoring and Advising in 2021.
北京雁栖湖应用数学研究院
CONTACT

No. 544, Hefangkou Village Huaibei Town, Huairou District Beijing 101408

北京市怀柔区 河防口村544号
北京雁栖湖应用数学研究院 101408

Tel. 010-60661855
Email. administration@bimsa.cn

版权所有 © 北京雁栖湖应用数学研究院

京ICP备2022029550号-1

京公网安备11011602001060 京公网安备11011602001060