[ot][spam][crazy][data] transformer model 'attention' improvement

k gmkarl at gmail.com
Tue Jan 25 11:26:48 PST 2022


This is the transcription of the code I'm first trying with.  I
haven't tested it yet.  I need to generate some data with appropriate
dimensions, and I'm somewhat new to jax.numpy .
-------------- next part --------------
A non-text attachment was scrubbed...
Name: chunked_attention.py
Type: text/x-python
Size: 2285 bytes
Desc: not available
URL: <https://lists.cpunks.org/pipermail/cypherpunks/attachments/20220125/87c81554/attachment.py>


More information about the cypherpunks mailing list