Navigated most recently to https://huggingface.co/docs/tokenizers/python/latest/quicktour.html#training-the-tokenizer . Karl's unsure whether it is faster to train a sentencepiece model from scratch or "learn huggingface's boilerplate".