[ot][spam][crazy][data] transformer model 'attention' improvement
Undiscussed Horrific Abuse, One Victim & Survivor of Many
gmkarl at gmail.com
Tue Feb 1 13:26:22 PST 2022
- torch tensors do not allocate new memory when expanded, and are
documented as views that have that property
next: jax tensors, and model logic
More information about the cypherpunks
mailing list