MESSAGE
DATE | 2007-11-18 |
FROM | Ruben Safir
|
SUBJECT | Re: [NYLXS - HANGOUT] Website Updates
|
On Sun, Nov 18, 2007 at 01:08:12PM -0500, Ron Guerin wrote: > Ron Guerin wrote: > > Ruben Safir wrote: > >> From: ruben-at-mrbrklyn Sun Nov 18 01:22:56 2007 > >> > >> I think that literally written by sendmail when the mail is recieved > >> and entered on the first line of the mail, which is why the only > >> thing you can trust in your headers with regard to spam is the > >> very first line. > > > > I've gotta say, that Sendmail's choice of delimiter is almost grossly > > inappropriate since it makes it much harder than necessary to sort out > > message delimiters from From: headers. They could have and should have > > used any string that wasn't the letters "f","r","o","m" and a colon. > > You sure there's a colon there? >
There is no colon
> The best description of an mbox that I was able to turn up in Google, > suggests the delimiter you're showing above is invalid. > > A message encoded in mbox format begins with a From_ line, > continues with a series of non-From_ lines, and ends with a > blank line. A From_ line means any line that begins with > the characters F, r, o, m, space > > The final line is a completely blank line (no spaces or > tabs). Notice that blank lines may also appear elsewhere in > the message. If the last line of the message was a partial > line, it writes two newlines; otherwise it writes one. > > The From_ line always looks like From envsender date > moreinfo. envsender is one word, without spaces or tabs; it > is usually the envelope sender of the message. date is the > delivery date of the message. It always contains exactly 24 > characters in asctime format. moreinfo is optional; it may > contain arbitrary information. >
That should be the regex.
Meanwhile I have a new problem tonight. I have a dead computer in the livingroom. The soundcard lost all but two channels, so I cleaned the machine and now its not starting. Any clues on where to find an ATX mainboard that will accept a Duron 800 CPU.
Ruben > Between the From_ line and the blank line is a message in > RFC 822 format. > > >From quoting ensures that the resulting lines are not From_ > lines: the program prepends a > to any From_ line, > >From_ line, >>From_ line, >>>From_ line, etc. > > > HOW A MESSAGE IS READ > A reader scans through an mbox file looking for From_ lines. > Any From_ line marks the beginning of a message. The reader > should not attempt to take advantage of the fact that every > From_ line (past the beginning of the file) is preceded by a > blank line. > > Once the reader finds a message, it extracts a (possibly > corrupted) envelope sender and delivery date out of the > From_ line. It then reads until the next From_ line or end > of file, whichever comes first. It strips off the final > blank line and deletes the quoting of >From_ lines and > >>From_ lines and so on. The result is an RFC 822 message. > > - Ron
-- http://www.mrbrklyn.com - Interesting Stuff http://www.nylxs.com - Leadership Development in Free Software
So many immigrant groups have swept through our town that Brooklyn, like Atlantis, reaches mythological proportions in the mind of the world - RI Safir 1998
http://fairuse.nylxs.com DRM is THEFT - We are the STAKEHOLDERS - RI Safir 2002
"Yeah - I write Free Software...so SUE ME"
"The tremendous problem we face is that we are becoming sharecroppers to our own cultural heritage -- we need the ability to participate in our own society."
"> I'm an engineer. I choose the best tool for the job, politics be damned.< You must be a stupid engineer then, because politcs and technology have been attached at the hip since the 1st dynasty in Ancient Egypt. I guess you missed that one."
© Copyright for the Digital Millennium
|
|