I just checked to see how much spam I've been catching with SpamAssassin lately. It'd been a while since I last did this (October 26th according to the message headers).
Well, the mbox file was 649MB in size and contained 56,652 messages.
Christ on toast! That's an annual personal spam volume of over 1GB/year. It's a good thing the government passed that law, otherwise I'd... wait. That law didn't do a damned bit of good, did it?
gzip -9 took it down to 236MB, proving once again that spam doesn't compress very well.
Sigh.
Update: As Craig suggested in the comments, I tried rzip. It got the file down to 70MB. Nice!
Posted by jzawodn at March 21, 2004 12:37 PM
Spam's generally not super-compressible within a message, but you might find better compressability between messages, and these messages might be far-separated in your archive if it's so large. I'd try seeing if maybe rzip gives you better compression. http://rzip.samba.org/
Why do you save all your spam? Might need a better interest rate in the future? :P.
Then again, you might just need some good ol' porn.
Hey, AOL says their spam rate is dropping (in the midst of a total decline in volume). Maybe this is a harbinger of good things to come?
Bah lossless compression! Try lzip. Never look back again. Make your hard drive feel bigger than it really is.