Spam Filter ISP Support Forum

  New Posts New Posts RSS Feed - BUG - Nothing in the corpus folder but qu
  FAQ FAQ  Forum Search   Register Register  Login Login

BUG - Nothing in the corpus folder but qu

 Post Reply Post Reply
Author
fischer View Drop Down
Newbie
Newbie


Joined: 05 August 2005
Status: Offline
Points: 12
Post Options Post Options   Thanks (0) Thanks(0)   Quote fischer Quote  Post ReplyReply Direct Link To This Post Topic: BUG - Nothing in the corpus folder but qu
    Posted: 05 August 2005 at 1:31am
I've seen this reported before, but haven't seen a fix. I just installed 2.6.3.473 as a new install on Win2k3 Web edition, and the corpus folder is empty except for an empty queue folder. Other thread responses from admins seem to indicate that there should be more there. I even did the whole delete the folder and restart SF thing, and that didn't change anything. Consequently bayesian filtering does not work right now. Anyone have any ideas?

Edited by fischer
Back to Top
LogSat View Drop Down
Admin Group
Admin Group
Avatar

Joined: 25 January 2005
Location: United States
Status: Offline
Points: 4065
Post Options Post Options   Thanks (0) Thanks(0)   Quote LogSat Quote  Post ReplyReply Direct Link To This Post Posted: 05 August 2005 at 7:51am
Please check to see if on the "Settings - Bayesian Filter" tab, the checkbox for "Learn new incoming emails" is checked. Underneath it there should be a lable that describes the "Bayesian Learning Status". Can you please let us know what that value is? Also, note that it may take 10 minutes for the database files to appear in the corpus directory as incoming email is processed. Also note that there needs to be a steady arrival of incoming emails to be processed for the Bayesian database to be built.
Roberto Franceschetti

LogSat Software

Spam Filter ISP
Back to Top
fischer View Drop Down
Newbie
Newbie


Joined: 05 August 2005
Status: Offline
Points: 12
Post Options Post Options   Thanks (0) Thanks(0)   Quote fischer Quote  Post ReplyReply Direct Link To This Post Posted: 05 August 2005 at 9:20am
Learn new incoming emails is checked, but Bayesian Learning Status shows as inactive. Unchecking the box and saving then rechecking and saving again does not change the status. My SF installation has been running for several hours now and still nothing in the corpus folder except the empty queue folder.
Back to Top
LogSat View Drop Down
Admin Group
Admin Group
Avatar

Joined: 25 January 2005
Location: United States
Status: Offline
Points: 4065
Post Options Post Options   Thanks (0) Thanks(0)   Quote LogSat Quote  Post ReplyReply Direct Link To This Post Posted: 07 August 2005 at 6:51pm
Can you please zip and email a copy of your SpamFilter.ini file, along with SpamFilter's activity logfile for the day you checked/unchecked that box?
Roberto Franceschetti

LogSat Software

Spam Filter ISP
Back to Top
fischer View Drop Down
Newbie
Newbie


Joined: 05 August 2005
Status: Offline
Points: 12
Post Options Post Options   Thanks (0) Thanks(0)   Quote fischer Quote  Post ReplyReply Direct Link To This Post Posted: 07 August 2005 at 7:47pm
What email address should I sent it to?
Back to Top
LogSat View Drop Down
Admin Group
Admin Group
Avatar

Joined: 25 January 2005
Location: United States
Status: Offline
Points: 4065
Post Options Post Options   Thanks (0) Thanks(0)   Quote LogSat Quote  Post ReplyReply Direct Link To This Post Posted: 07 August 2005 at 8:55pm
support @ logsat.com
Roberto Franceschetti

LogSat Software

Spam Filter ISP
Back to Top
LogSat View Drop Down
Admin Group
Admin Group
Avatar

Joined: 25 January 2005
Location: United States
Status: Offline
Points: 4065
Post Options Post Options   Thanks (0) Thanks(0)   Quote LogSat Quote  Post ReplyReply Direct Link To This Post Posted: 08 August 2005 at 10:33pm
Form the 1st logfile you sent for the 4th, we can see that the timeframe it contains, from 22:03 to 23:57 (almost two hours) you only received one email. As mentioned in our first reply, that there needs to be a steady arrival of incoming emails to be processed for the Bayesian database to be built.

From the 2nd logfile from the following day instead, while the email traffic is slightly higher, it may not be enough to trigger the saving of the statistical database. The statistical data is cached in memory for a few minutes, and only on occasion is then saved to disk (to optimize performance). If there is not enough data in memory (in the order of an email every 4-5 minutes) that data is lost and not saved. The Bayesian filter really needs a steady flow of data for the statistical information to be built, and will not activate itself until at least 5000 spam and 5000 good emails are received. At the rate of an email every 5 minutes you can see that it will not be practical to use that filter.

Furthermore, from the SpamFilter.ini file you sent us, we see that your settings for the Bayesian filter are set to 100%, which, as stated in the GUI itself, indicates that the Bayesian filter is disabled. When disabled, the filter will not create/update the bayesian database.
Roberto Franceschetti

LogSat Software

Spam Filter ISP
Back to Top
 Post Reply Post Reply
  Share Topic   

Forum Jump Forum Permissions View Drop Down



This page was generated in 0.063 seconds.