There is some interesting research coming out from a team of Stanford & Berkeley researchers about large-scale de-anonymization of blog posts based on writing style, i.e. stylometry. For a given blog post, the researchers were able to positively identify an individual author from among 100,000 possibilities 20% of the time. However, their method does not work if authors deliberately obfuscate their writing style. Here's the paper draft: http://randomwalker.info/publications/author-identification-draft.pdf And a blog post about it: http://33bits.org/2012/02/20/is-writing-style-sufficient-to-deanonymize-mate... _______________________________________________ liberationtech mailing list liberationtech@lists.stanford.edu Should you need to change your subscription options, please go to: https://mailman.stanford.edu/mailman/listinfo/liberationtech If you would like to receive a daily digest, click "yes" (once you click above) next to "would you like to receive list mail batched in a daily digest?" You will need the user name and password you receive from the list moderator in monthly reminders. Should you need immediate assistance, please contact the list moderator. Please don't forget to follow us on http://twitter.com/#!/Liberationtech ----- End forwarded message ----- -- Eugen* Leitl <a href="http://leitl.org">leitl</a> http://leitl.org ______________________________________________________________ ICBM: 48.07100, 11.36820 http://www.ativel.com http://postbiota.org 8B29F6BE: 099D 78BA 2FD3 B014 B08A 7779 75B0 2443 8B29 F6BE