what seems nice for continuing might be considering the idea of mapping logits to how logoish they might be. one approach to that would be expanding a logo parse tree through all the tokenizer's vocab. then with some parallel data, simply backsolving from that parse tree, i'm thinking it could work. it's no longer a quick approach but it nicely demonstrates how the comic links the patterns of machine learning and software engineering together.