19 Jun
2023
19 Jun
'23
12:12 a.m.
ggml provides an opt function that performs training inside it based on the number of iterations set in the parameters. this is why the example code takes so many minutes to complete, it’s running the opt pass 10k times on a single core. at a glance it looks like the performance stats are near zero because they are disabled. the example drops the loss of a single layer with params on the order of 1k large to something 40% of its starting value in a few minutes. a smidge frustrating that it blackboxes the training loop