one blocker to continueing is being unsure how to generate novel voices with whisperspeech

(another is finding this intense)

maybe i’ll see if there are speaker embeddings in there somewhere i could feed random numbers to or gently mutate or something