1.1 Deep sequence model
Definition 1.1 (Informal). We use sequence model to refer to a parameterized map on sequences $y=f_\theta(x)$ where inputs and o...
Overview.
This post focuses on Value Functions in reinforcement learning.
We begin with a quick recap of the RL setup, then introduce state-value / action-va...
Overview. This post builds from the MDP framework to the RL objective and value functions, then contrasts pure RL with Imitation Learning (IL), focusing on B...
댓글남기기