[ot][spam] Behavior Log For Control Data: HFRL Unit 1 Lab

Undiscussed Horrific Abuse, One Victim of Many gmkarl at gmail.com
Fri Jun 24 06:13:03 PDT 2022


0912

This is the solution:

# SOLUTION
# Train it for 500,000 timesteps
model.learn(total_timesteps=500000)
# Save the model
model_name = "ppo-LunarLander-v2"
model.save(model_name)

They did not add extra information like with model construction.
It is indeed much less informative here, the available extra information.

While waiting for the training to complete, I will first go to the
next section and look at it. I am also considering looking up the
parameters I did not know in the model construction.

I am most familiar with learning by looking at the direct source code
of the api function being called.


More information about the cypherpunks mailing list