Re: [ot][spam][crazy] Being a tiny part of a goal

2 Apr 2022


      Here are some code chunks:

from tokenizers import Tokenizer
from tokenizers.models import BPE

tokenizer = Tokenizer(BPE(unk_token="[UNK]"))

# To train our tokenizer on the wikitext files, we will need to
instantiate a trainer, in this case a BpeTrainer

from tokenizers.trainers import BpeTrainer

trainer = BpeTrainer(special_tokens=["[UNK]", "[CLS]", "[SEP]",
"[PAD]", "[MASK]"])

Re: [ot][spam][crazy] Being a tiny part of a goal

Undiscussed Horrific Abuse, One Victim of Many