Print Page | Close Window

Corpus INI

Printed From: LogSat Software
Category: Spam Filter ISP
Forum Name: Spam Filter ISP Support
Forum Description: General support for Spam Filter ISP
URL: https://www.logsat.com/spamfilter/forums/forum_posts.asp?TID=5922
Printed Date: 14 July 2025 at 12:05pm


Topic: Corpus INI
Posted By: DavidF
Subject: Corpus INI
Date Posted: 12 December 2006 at 4:33pm
Hi

We have recently had a problem with the Baysian filter tagging a large volume of good emails as spam and causing them to be quarantined. We couldn't see why this should be happening so removed the existing corpus DB and set SF rebuilding a new one. The rebuild progressed Ok for the first 1/2 day but corpus.ini has stopped for two days now at spam 4066 and good 297.

I have checked the baysian settings and the "learn new email" is checked and the status notes that it is active. I am not sure where else to look to correct this or in fact if it is even a problem.

Also looking in the spamfilter.ini the scanreceived headers setting is set to 1 is that the default setting and what are the ramifications of setting this to 0.

Thanks

David



Replies:
Posted By: LogSat
Date Posted: 12 December 2006 at 6:48pm
David,

If the numbers in the corpus.ini file have not changed (the should be updated every 10-20 minutes), that does indicate a problem with the process within SpamFilter that analyzes incoming emails and updates the database. Youcould try to uncheck the "learn new mail" checkbox, click on "Save Settings", then re-check the box, and again "Save Settings". If within 20 minutes the corpus.ini file has not been updated, please try restarting SpamFilter.

If all this does not work, please zip and email us SpamFilter's activity logfile for the day so we can take a look.


-------------
Roberto Franceschetti

http://www.logsat.com" rel="nofollow - LogSat Software

http://www.logsat.com/sfi-spam-filter.asp" rel="nofollow - Spam Filter ISP


Posted By: DavidF
Date Posted: 13 December 2006 at 12:17am
Ok - I have tried unchecking the learn new mail and rechecking but this doesn't seem to have made any difference. Looking at the scrolling log files I haven't seen the "Time to add msg to bayes corpus" line that usually appears after each message is processed. I have zipped up todays partial log file and emailed it to you.

Thanks



Print Page | Close Window