2 Apr
2022
2 Apr
'22
11:16 a.m.
Looks like commonvoice will take about an hour to download on google's servers. That's confusing to me. Thoughts: - this approach would be great for recovering whispered speech from poor recordings. You'd finetune off of multiple microphones. - it looks like it would be pretty effective to finetune this small model around some of the correct output. its initial output is very nearly correct. - there's value around combining non-model structures here, such as the heuristic mentioned earlier. this can take more mental steps for me; i have to engage my inhibitions differently.