Bayesian Filtering Unsatisfactory |
Post Reply ![]() |
Author | |
Jeff ![]() Guest Group ![]() |
![]() ![]() ![]() ![]() ![]() Posted: 11 May 2004 at 9:18pm |
Is there going to be any way to get false negative emails back into the corpus DB? I've reduces the Bayesian threshold to 2% but it still only catches a few a day. Currently my Spam to Good email ratio is about 4:1 but I suspect it is more like 6:1. Can you update us on the status of the capability to reverse email falsely tagged as good? One thought I have is to have an "optional" Good Email DB similar to the Quarantine DB. If one chose to use this optional DB, they could go through and send false negatives (spam) to the corpus and reverse the good score they got before. I think getting false negatives out of the DB is crucial to get any kind of accurate filtering. At a probablitiy of 95% setting, it would take 20 spam catches to reverse one false negative. Thanks. -jeff
|
|
![]() |
|
LogSat ![]() Admin Group ![]() ![]() Joined: 25 January 2005 Location: United States Status: Offline Points: 4105 |
![]() ![]() ![]() ![]() ![]() |
Jeff, Have you tried, just for the initial training period, to implement more aggressive filtering that will catch more spam? Rejecting "no reverse DNS", adding more powerful keyword searhes, and more MAPS servers could increase your spam/good ratio to the more realistic 6:1 - 8:1 as you correctly pointed out. We will provide for the ability to add "false negatives" soon, but it will still be very helpful to have strong training during the initial deployment. Roberto F. |
|
![]() |
Post Reply ![]() |
|
Tweet
|
Forum Jump | Forum Permissions ![]() You cannot post new topics in this forum You cannot reply to topics in this forum You cannot delete your posts in this forum You cannot edit your posts in this forum You cannot create polls in this forum You cannot vote in polls in this forum |
This page was generated in 0.156 seconds.