Bandwidth limitations, DNA binary coding

Perry E. Metzger pmetzger at lehman.com
Fri Nov 12 18:49:56 PST 1993



VACCINIA at uncvx1.oit.unc.edu says:
>         There are four base pair combinations, but HUGO (Human Genome 
> Organization) has elected to use 15 letter symbols in it's representation
> of the genome coding sequence (X is any base, for instance). 15 symbols, 
> 1 byte.

15 symbols, HALF a byte (actually a touch less.) One nybble can
express 16 possible symbols (or one Hex digit, or whatever.) 

Plus, of course, the genome is highly compressable -- lots of repeated
sequences, especially in interons.

Perry






More information about the cypherpunks-legacy mailing list