one blocker to continueing is being unsure how to generate novel voices with whisperspeech
(another is finding this intense)
maybe i’ll see if there are speaker embeddings in there somewhere i could feed random numbers to or gently mutate or something