[spam][crazy][ot] MCBoss Spinoffs 3

Karl Semich 0xloem at gmail.com
Sun Jun 18 13:55:05 PDT 2023


it’s not immediately apparent where to find documentation on training
in ggml but here’s the optimizer test code which performs a single
optimization pass which might be sufficient if it actually works when
nobody seems to be using it:
https://github.com/ggerganov/ggml/blob/4e1e135b749d2e9a2bf9abf7aa34691d1f95320d/tests/test-opt.c#L117

a further idea is to learn how metamodels encode and decode model
weights, to speed online training. maybe i can find a paper mentioning
this.


More information about the cypherpunks mailing list