MESSAGE
DATE | 2007-11-16 |
FROM | Kevin Mark
|
SUBJECT | Re: [NYLXS - HANGOUT] Website Updates
|
On Fri, Nov 16, 2007 at 03:24:40PM -0500, Ron Guerin wrote: > Kevin Mark wrote: > > On Thu, Nov 15, 2007 at 06:32:01PM -0500, Ruben Safir wrote: > >> I'm busy rewriting a lot of the functionality of the NYLXS and the Freeom-IT > >> websites. It's been a bit longer to do that I thought, much because of the > >> first step is parsing all the old mail boxes I have and pulling out all the > >> HANGOUT mailings to store in a database. This is well over 700,000 lines of > >> data (thanks for the binaries in the mail guys!). > >> > >> This makes testing go slower than I'd like and the large sample is important > >> for the purpose of shaking out my regex routines on the From lines in the > >> mboxes. > > Whats the problem with parsing email from mail boxes into a database? > > I can think of one. Speaking from pain of experience, you need to > maintain the same level of privacy that your posters expected when they > posted to the list. That's possibly going to be a problem when the > expectations on this list in regard to an archive were pretty restrictive. > > Oh! Make that two. You need to honor the "X-No-Archive: yes" headers > unless you had a stated policy about not honoring it. We no longer > honor it on nylug-talk, and I know we lost one fairly active list member > who left after we announced that. > > - Ron Hmm. ok. But Ruben didn't specify any additional parameters other than just getting it into a DB. -K ps. cool that you made a smaller video of the talk available this time ;-) -- | .''`. == Debian GNU/Linux == | my web site: | | : :' : The Universal |mysite.verizon.net/kevin.mark/| | `. `' Operating System | go to counter.li.org and | | `- http://www.debian.org/ | be counted! #238656 | | my keyserver: subkeys.pgp.net | my NPO: cfsg.org | |join the new debian-community.org to help Debian! | |_______ Unless I ask to be CCd, assume I am subscribed _______|
|
|