18 Jun
2023
18 Jun
'23
8:55 p.m.
it’s not immediately apparent where to find documentation on training in ggml but here’s the optimizer test code which performs a single optimization pass which might be sufficient if it actually works when nobody seems to be using it: https://github.com/ggerganov/ggml/blob/4e1e135b749d2e9a2bf9abf7aa34691d1f953... a further idea is to learn how metamodels encode and decode model weights, to speed online training. maybe i can find a paper mentioning this.