Does Baysian Filter Properly learn Whitelisted Entries | 
 
    Post Reply  
   | 
  
| Author | |
   
   dougs  
   
   Guest Group  
    | 
  
   
      Post Options
    
        Thanks(0)
      Quote   Reply
   
     Topic: Does Baysian Filter Properly learn Whitelisted EntriesPosted: 06 May 2004 at 12:44am  | 
 
| 
   
    We have recently reset our Bayesian filter. It was not picking up enough of the junk mail. At the same time we whitelisted the domains of all of our clients, while increasing the words in the blacklist keywords filter. The result has been very positive, less spam, and less false positives, until today. Today the Bayesian filter hit the 5000 good email mark again, (30,000+ spam) and now every email that comes in that is not on the whitelist is flagged as 100% Spam. Words like "the", "and", "to" have a .99 percent spam probablity. This was not the case before we put in the whitelist. Does the filter properly learn the words in a whitelisted email as well as the blacklisted emails? Also, I read postings of a tool to edit the Corpus. When is that due out?  | 
 |
![]()  | 
 |
   
   LogSat  
   
   Admin Group  
   Joined: 25 January 2005 Location: United States Status: Offline Points: 4106  | 
  
   
      Post Options
    
        Thanks(0)
      Quote   Reply
   
     Posted: 06 May 2004 at 11:53pm | 
 
| 
   
    Doug, SpamFilter does not "learn" from the black/white lists, but rather it examines the incoming emails, analyzes their content, and updates the statistical database with them. The more accurate your existing "regular" filter, the better trained the statistical filter becomes. It helps a lot at the beginning to check the quarantine to force delivery of false positives (legitimate email that has been blocked). This process trains the filter to recognize the mistake it has made, and to "weigh" more those emails as good mail in the future. Please also note that with time the filter becomes more and more accurate. Also make sure that when you "reset" the Bayesian filter, you delete ALL files in the SpamFilter\Corpus directory to properly start from scratch. If you continue to have incccurate result, it may help to double or triple the minimum threshold for the Bayes activation (MinEmailsForBayesKickIm setting in SpamFilter.ini file) so that it kicks in later after it has performed more learning. Roberto F.  | 
 |
![]()  | 
 |
    Post Reply  
   | 
  |
|       
  
  Tweet   	
    | 
 
| Forum Jump | Forum Permissions  ![]() You cannot post new topics in this forum You cannot reply to topics in this forum You cannot delete your posts in this forum You cannot edit your posts in this forum You cannot create polls in this forum You cannot vote in polls in this forum  | 
 
This page was generated in 0.125 seconds.
 
 
 
 
 
 
 Topic Options
   
 Post Options
 Thanks(0)
 

   