16 Feb
2023
16 Feb
'23
5:42 p.m.
re Gym similarity, the huggIngface dt takes a vector of floats for both observation and action so that's simplifiable and i updated it to use a vector of rewards