24 Jun
                
                    2022
                
            
            
                24 Jun
                
                '22
                
            
            
            
        
    
                2:22 p.m.
            
        1021 I have read that the unit is divided into 2 parts. The first part relates to learning about value-based methods, and the second part to Q-learning. I have also read that two environments will be solved, and that both involving navigating a small grid with an agent.