7 Jan
2023
7 Jan
'23
4:43 p.m.
the filesize of bigscience/bloomz-560m is 1.2G which fits if i delete RWKV. RWKV has a less reusable training interface atm. It does have a much, much longer input context though.