An introduction to optimal control theory
This course introduces the fundamental theory and applications of Optimal Control. The primary objective is to equip students with the tools to analyze and solve problems where dynamic systems must be operated in an optimal manner. The curriculum is unified by the Dynamic Programming (DP) approach pioneered by Richard Bellman. We will explore how the Bellman Equation (or Hamilton-Jacobi-Bellman equation) provides sufficient conditions for optimality. The course is divided into two halves: the first covers discrete-time systems, including deterministic, stochastic, and Markov Control Processes (MCPs) over finite and infinite horizons. The second half transitions to continuous-time systems, covering ordinary differential equations, general MCPs, and diffusion processes. By the end, students will understand the advantages of DP, including its role in generating feedback controls and its connection to approximation algorithms like value and policy iteration, which are foundational to modern fields like reinforcement learning.
讲师
日期
2026年03月11日 至 06月24日
位置
| Weekday | Time | Venue | Online | ID | Password |
|---|---|---|---|---|---|
| 周三 | 14:20 - 16:55 | A3-1-101 | ZOOM 05 | 293 812 9202 | BIMSA |
修课要求
Linear algebra, probabilistic theory, calculus, optimization
参考资料
Athans, M., & Falb, P. L. (2013). Optimal control: an introduction to the theory and its applications. Courier Corporation.
Berkovitz, L. D. (2013). Optimal control theory. Springer Science & Business Media.
Kirk, D. E. (2004). Optimal control theory: an introduction. Courier Corporation.
Berkovitz, L. D. (2013). Optimal control theory. Springer Science & Business Media.
Kirk, D. E. (2004). Optimal control theory: an introduction. Courier Corporation.
听众
Undergraduate
, Advanced Undergraduate
, Graduate
, 博士后
, Researcher
视频公开
不公开
笔记公开
不公开
语言
中文
, 英文
讲师介绍
焦小沛,于2017年本科毕业于上海交通大学致远学院(物理班),2022年博士毕业于清华大学数学科学系,师从丘成栋教授(IEEE fellow,前美国伊利诺伊大学芝加哥分校终身教授)。先后在北京雁栖湖应用数学研究院,荷兰特文特大学从事博士后工作(导师Johannes Schmidt-Hieber教授,国际数理统计学会会士)。现研究方向包括控制理论,随机滤波,数值微分方程,物理信息深度学习。获得2025年国家青年科学基金[C类]资助。