[crazy][hobby][spam] Automated Reverse Engineering

k gmkarl at gmail.com
Tue Jan 25 01:40:56 PST 2022


- a large T5 model could be tpu compiled on colab notebooks by calling
pmap() on individual blocks rather than the whole model
- much larger models could be trained by masking the training weights
to reduce autograd memory load as has been done for at-home training
of large text generation models


More information about the cypherpunks mailing list