I'm thinking I'd like to try training a bytestokenizer for bigbird and extend its sequence length to entire binaries.  I expect the result to be about 30% successful given my lack of experience and time.