Just don't forget to use markov chains + blot-width inference to fill in the censored portions. :) On 31/01/15 02:29, coderman wrote:
On 1/30/15, grarpamp <grarpamp@gmail.com> wrote:
https://www.nsa-observer.net/ https://github.com/nsa-observer/
fyi, coderman et al.
thanks, checking them out. one thing i don't see mentioned is how the OCR was performed. same as Reuters DocumentCloud service, or open source tool, or ?
next bigsun update will demonstrate this challenge better, as i am using a handful of techniques for text extraction, character recognition, and annotation, as well. in a sense, this is how the sausage making gets started...
(i will see if there is a convenient way i can feed back out again, like to nsa-observer, since bigsun is intended to be operated entirely within hidden services - no public services, especially not github or document cloud)
best regards,
-- Twitter: @onetruecathal Phone: +353876363185 miniLock: JjmYYngs7akLZUjkvFkuYdsZ3PyPHSZRBKNm6qTYKZfAM peerio.com: Use email or phone. Uses above miniLock key.