[ot][spam][crazy][data] transformer model 'attention' improvement

k gmkarl at gmail.com
Wed Jan 26 06:49:36 PST 2022


hum second run the other was faster, by about the same amount, so
maybe they're comparable.

anyway next step is to add aminrezaei to huggingface's perceiver
implementation.  maybe in a base class.


More information about the cypherpunks mailing list