[ot][spam][crazy][data] transformer model 'attention' improvement

Undiscussed Horrific Abuse, One Victim & Survivor of Many gmkarl at gmail.com
Tue Feb 1 13:26:22 PST 2022


- torch tensors do not allocate new memory when expanded, and are
documented as views that have that property

next: jax tensors, and model logic


More information about the cypherpunks mailing list