[ot][spam][crazy] Quickly autotranscribing xkcd 4/1 correctly

Undiscussed Horrific Abuse, One Victim of Many gmkarl at gmail.com
Sat Apr 2 04:16:26 PDT 2022


Looks like commonvoice will take about an hour to download on google's servers.

That's confusing to me.

Thoughts:
- this approach would be great for recovering whispered speech from
poor recordings. You'd finetune off of multiple microphones.
- it looks like it would be pretty effective to finetune this small
model around some of the correct output. its initial output is very
nearly correct.
- there's value around combining non-model structures here, such as
the heuristic mentioned earlier. this can take more mental steps for
me; i have to engage my inhibitions  differently.


More information about the cypherpunks mailing list