Deep Q-network (DQN) and Q-learning

Organizer

Stephen S-T. Yau

Speaker

Yangtianze Tao

Time

Tuesday, September 13, 2022 3:00 PM - 3:30 PM

Venue

Online

Tencent 735 7908 4302 ()

Abstract

In this report, we will describe how to approximate the optimal action-value function with a neural network, which we call a deep Q-network (DQN). The time difference algorithm (TD) used to train the DQN will also be introduced.