i added breakpoints on lines 100 and 413, which are control paths via which it can exit the iteration, so i can examine the state and differentiate if it exits the loop correctly. theoretically, i could hit 'cont' through every tag and see what happens. meanwhile, i've been trying to pack the source code, and w3 encountered a symbolic link loop preventing packing. it hit 6 gigabytes before it bailed O_o; very large for this source code: datagen$ w3 put . ⠙ Packing 20570 files (6349.2MB)Error: ELOOP: too many symbolic links encountered, stat '/media/extradisk/src/codefudge/codefudge/datage n/tinytokenizers/target/release/build/tinytokenizers-094d245b8aabc62c/out/cxxbridge/crate/tinytokenizers/target/release/build/tinytokenizers-094d245b8aabc62c/out/cxxbridge/crate/tinytokenizers/target/release/build/tinytokenizers-094d245b8aabc62c/out/cxxbridge/crate/tinytokenizers/target/release/build/tinytokenizers-094d245b8aabc62c/out/cxxbridge/crate/tinytokenizers/target/release/build/tinytokenizers-094d245b8aabc62c/out/cxxbridge/crate/tinytokenizers/target/release/build/tinytokenizers-094d245b8aabc62c/out/cxxbridge/crate/tinytokenizers/target/release/build/tinytokenizers-094d245b8aabc62c/out/cxxbridge/crate/tinytokenizers/target/release/build/tinytokenizers-094d245b8aabc62c/out/cxxbridge/crate/tinytokenizers/target/release/build/tinytokenizers-094d245b8aabc62c/out/cxxbridge/crate/tinytokenizers/target/release/build/tinytokenizers-094d245b8aabc62c/out/cxxbridge/crate/tinytokenizers/target/release/build/tinytokenizers-094d245b8aabc62c/out/cxxbridge/crate/tinytokenizers/target/release/build/tinytokenizers-094d245b8aabc62c/out/cxxbridge/crate/tinytokenizers/target/release/build/tinytokenizers-094d245b8aabc62c/out/cxxbridge/crate/tinytokenizers/target/release/build/tinytokenizers-094d245b8aabc62c/out/cxxbridge/crate/tinytokenizers/target/release/build/tinytokenizers-094d245b8aabc62c/out/cxxbridge/crate/tinytokenizers/target/release/build/ti nytokenizers-094d245b8aabc62c/out/cxxbridge/crate/tinytokenizers/target/release/build/tinytokenizers-094d245b8aabc62c/out/cxxbridge/crat e/tinytokenizers/target/release/build/tinytokenizers-094d245b8aabc62c/out/cxxbridge/crate/tinytokenizers/target/release/build/tinytokeni zers-094d245b8aabc62c/out/cxxbridge/crate/tinytokenizers/target/release/build/tinytokenizers-094d245b8aabc62c/out/cxxbridge/crate/tinyto kenizers/target/release/build/tinytokenizers-094d245b8aabc62c/out/cxxbridge/crate/tinytokenizers/target/release/build/rayon-core-8ca1c07 368254e7f/build_script_build-8ca1c07368254e7f.d' looks like it's inside my tinytokenizers hack-build to access huggingface's rust tokenizer library within c++. the code isn't presently using tokenization, at all.