what seems nice for continuing might be considering the idea of mapping logits to how logoish they might be.

one approach to that would be expanding a logo parse tree through all the tokenizer's vocab.

then with some parallel data, simply backsolving from that parse tree, i'm thinking it could work.

it's no longer a quick approach but it nicely demonstrates how the comic links the patterns of machine learning and software engineering together.