I downloaded your software this weekend in an effort to find a decent solution to my spam woes. Everything seems to work fine so far. The documentation leaves a lot to be desired, but I managed none the less.
From what I read on your web site and through this forum, your Bayesian filter only works by learning in real time (having actual messages pass through it). And you say this is done so that the filter is customized to my environment. I have a collection of emails (about 10,000 each both spam and ham), that I would use to train any Bayesian filter I might try and use. They are my emails, hence my environment. From my understanding these emails are useless to your filter (yes, I realize there are other ways of filtering other than using the Bayesian exclusively), but I would like to train the Bayesian filter from these emails.
So, I have two questions which are basically the same: Am I wasting my time, by writing a script to "redeliver" all of these messages through you program to a dummy account and then just delete them? When I deliver the spam, I would just tell it to consider it all spam and then just delete them, and then deliver the ham and set it to let all of these pass through.
Question 2: When you force a quarantined message to be delivered, does it learn anything more than just the "From" and "To" message fields that are stored in the autoforcewhitelist file? Would it be better (and very time consuming) to redeliver all of my ham messages, let them get quarantined, and then force deliver them, or is that useless as far as training the Bayesian filter?
thanks.
|