Check out the OpenThreads project[1][2]. In addition to being just generally awesome, it's also a starting point if you want to parse out the topics in an entire mailing list. It was written by a colleague who's very interested in gender diversity in tech, so the project is mostly known for researching what pronouns are used most often on LibTech, and also for posting a list of the most common names mentioned (the top ten are all dudes). Caution: some assembly required. happy hacking ;) Griffin [1] https://github.com/elationfoundation/openThreads/wiki/A-journey-in-Regular-Expressions-and-ListServ's <https://github.com/elationfoundation/openThreads/wiki/A-journey-in-Regular-Expressions-and-ListServ%27s> [2] https://github.com/elationfoundation/openThreads Jayvan Santos wrote:
Is there an open algorithm that keeps a summarization of what normally gets posted here? Pointing out the topics, references, commonly used terms?
"Joseph Holsten" <joseph@josephholsten.com <mailto:joseph@josephholsten.com>> wrote:
It does beat the spam (which might just be stego, but still). +1 to dadaist cpunkery. -- ~j
-- "Cypherpunks write code not flame wars." --Jurre van Bergen #Foucault / PGP: 0xAE792C97 / OTR: saint@jabber.ccc.de My posts are my own, not my employer's.