MESSAGE
DATE | 2007-11-16 |
FROM | Ron Guerin
|
SUBJECT | Re: [NYLXS - HANGOUT] Website Updates
|
Kevin Mark wrote: > On Thu, Nov 15, 2007 at 06:32:01PM -0500, Ruben Safir wrote: >> I'm busy rewriting a lot of the functionality of the NYLXS and the Freeom-IT >> websites. It's been a bit longer to do that I thought, much because of the >> first step is parsing all the old mail boxes I have and pulling out all the >> HANGOUT mailings to store in a database. This is well over 700,000 lines of >> data (thanks for the binaries in the mail guys!). >> >> This makes testing go slower than I'd like and the large sample is important >> for the purpose of shaking out my regex routines on the From lines in the >> mboxes. > Whats the problem with parsing email from mail boxes into a database?
I can think of one. Speaking from pain of experience, you need to maintain the same level of privacy that your posters expected when they posted to the list. That's possibly going to be a problem when the expectations on this list in regard to an archive were pretty restrictive.
Oh! Make that two. You need to honor the "X-No-Archive: yes" headers unless you had a stated policy about not honoring it. We no longer honor it on nylug-talk, and I know we lost one fairly active list member who left after we announced that.
- Ron
|
|