MESSAGE
DATE | 2003-05-22 |
FROM | Marco Scoffier
|
SUBJECT | Subject: [hangout] mbox_splitter.py
|
Well this is the first time I have done this, and I am a little wary about the scrutiny of wide usage. But, I wrote a small script for my own usage that I think might be useful for many of you.
mbox_splitter.py : splits the mailboxes it recieves as arguements into user defined sizes (say -S=300000 bytes or about 3M).
It also strips out the attachments which are larger than a certain size and saves them to a seperate file. It also saves only a specific set of headers, so you keep only the imporant ones if you wish (this can reduce the size of an mbox by 20-50%).
Basically I have procmail sort my incoming mails into several mboxes. Because I am on so many lists these mboxes tend to get really large, to the point where it is tough to even open them. So, wrote this script which I run in a batch:
mbox_splitter.py ~/Mail/*
To download the code and view some examples which include the output of parsing the 19M of hangout we have received since Feb 12th. please visit:
http://marco.metm.org/code/
Oh and it's python so it is pretty slow.
-- Marco ____________________________ NYLXS: New Yorker Free Software Users Scene Fair Use - because it's either fair use or useless.... NYLXS is a trademark of NYLXS, Inc
|
|