24 Jun
2022
24 Jun
'22
2:22 p.m.
1021 I have read that the unit is divided into 2 parts. The first part relates to learning about value-based methods, and the second part to Q-learning. I have also read that two environments will be solved, and that both involving navigating a small grid with an agent.