Deep Q-network (DQN) and Q-learning
Organizer
Speaker
Yangtianze Tao
Time
Tuesday, September 13, 2022 3:00 PM - 3:30 PM
Venue
Online
Online
Tencent 735 7908 4302
()
Abstract
In this report, we will describe how to approximate the optimal action-value function with a neural network, which we call a deep Q-network (DQN). The time difference algorithm (TD) used to train the DQN will also be introduced.