Spam Filter ISP Support Forum

  New Posts New Posts RSS Feed - Does Baysian Filter Properly learn Whitelisted Entries
  FAQ FAQ  Forum Search   Register Register  Login Login

Does Baysian Filter Properly learn Whitelisted Entries

 Post Reply Post Reply
Author
dougs View Drop Down
Guest Group
Guest Group
Post Options Post Options   Thanks (0) Thanks(0)   Quote dougs Quote  Post ReplyReply Direct Link To This Post Topic: Does Baysian Filter Properly learn Whitelisted Entries
    Posted: 06 May 2004 at 12:44am

We have recently reset our Bayesian filter.  It was not picking up enough of the junk mail.  At the same time we whitelisted the domains of all of our clients, while increasing the words in the blacklist keywords filter.  The result has been very positive, less spam, and less false positives, until today.

Today the Bayesian filter hit the 5000 good email mark again, (30,000+ spam) and now every email that comes in that is not on the whitelist is flagged as 100% Spam.  Words like "the", "and", "to" have a .99 percent spam probablity. 

This was not the case before we put in the whitelist.  Does the filter properly learn the words in a whitelisted email as well as the blacklisted emails?  Also, I read postings of a tool to edit the Corpus.  When is that due out?

Back to Top
LogSat View Drop Down
Admin Group
Admin Group
Avatar

Joined: 25 January 2005
Location: United States
Status: Offline
Points: 4065
Post Options Post Options   Thanks (0) Thanks(0)   Quote LogSat Quote  Post ReplyReply Direct Link To This Post Posted: 06 May 2004 at 11:53pm

Doug,

SpamFilter does not "learn" from the black/white lists, but rather it examines the incoming emails, analyzes their content, and updates the statistical database with them. The more accurate your existing "regular" filter, the better trained the statistical filter becomes. It helps a lot at the beginning to check the quarantine to force delivery of false positives (legitimate email that has been blocked). This process trains the filter to recognize the mistake it has made, and to "weigh" more those emails as good mail in the future. Please also note that with time the filter becomes more and more accurate.

Also make sure that when you "reset" the Bayesian filter, you delete ALL files in the SpamFilter\Corpus directory to properly start from scratch.

If you continue to have incccurate result, it may help to double or triple the minimum threshold for the Bayes activation (MinEmailsForBayesKickIm setting in SpamFilter.ini file) so that it kicks in later after it has performed more learning.

Roberto F.
LogSat Software

Back to Top
 Post Reply Post Reply
  Share Topic   

Forum Jump Forum Permissions View Drop Down



This page was generated in 0.055 seconds.