[clug] Spambayes

Nemo -earth native- nemo at cheeky.house.cx
Thu May 22 12:29:33 EST 2003


On Wed, May 21, 2003 at 12:15:19PM +1000, David Gibson did utter:
> > 
> > Personally, I'd be interested to see a comparison of the relative
> > effectiveness of bogofilter, spambayes, spamassassin, etc... for
> > example, train each system on the same block of spam and ham, and then
> > test each on a new block of email known to contain both ham and spam... 

I'm increasingly finding I'm putting thought into this... SO, straw poll
time. 

Q: What spamfiltering system do people use?

Q: Is it trainable?

Q: If it is, do you update it with ALL messages once classified as
spam/ham, or only with false pos/neg results? (or not at all? Or
something else?)

> > 
> I've actually been thinking about doing something like this for a
> while.  I already have the training and testing sets:  a complete
> archive of mail for nearly three years, with hand sorted spam for all
> that time as well.
> 
> Now, in my copious free time...

Oooh, three years of mail sounds perfect. If I get some scripts written
up... 

I've also put some vague ramblings on my wiki:
http://www.nut.house.cx/cgi-bin/nemwiki.pl?SpamMark

.../nemo
-- 
  ------------------------------------------ --------------------------
                                                    earth native



More information about the linux mailing list