<?xml version="1.0" encoding="utf-8" ?>
<?xml-stylesheet type="text/xsl" href="RSS_xslt_style.asp" version="1.0" ?>
<rss version="2.0" xmlns:WebWizForums="http://syndication.webwiz.co.uk/rss_namespace/">
 <channel>
  <title>Spam Filter ISP Forums : 2.0 Beta Question</title>
  <link>https://www.logsat.com/spamfilter/forums/</link>
  <description><![CDATA[This is an XML content feed of; Spam Filter ISP Forums : Spam Filter ISP Support : 2.0 Beta Question]]></description>
  <pubDate>Fri, 17 Apr 2026 02:09:32 +0000</pubDate>
  <lastBuildDate>Wed, 22 Oct 2003 23:24:00 +0000</lastBuildDate>
  <docs>http://blogs.law.harvard.edu/tech/rss</docs>
  <generator>Web Wiz Forums 11.04</generator>
  <ttl>360</ttl>
  <WebWizForums:feedURL>https://www.logsat.com/spamfilter/forums/RSS_post_feed.asp?TID=2276</WebWizForums:feedURL>
  <image>
   <title><![CDATA[Spam Filter ISP Forums]]></title>
   <url>https://www.logsat.com/spamfilter/forums/forum_images/web_wiz_forums.png</url>
   <link>https://www.logsat.com/spamfilter/forums/</link>
  </image>
  <item>
   <title><![CDATA[2.0 Beta Question : Ideally we&amp;#039;d like statistics,...]]></title>
   <link>https://www.logsat.com/spamfilter/forums/forum_posts.asp?TID=2276&amp;PID=2304&amp;title=20-beta-question#2304</link>
   <description>
    <![CDATA[<strong>Author:</strong> <a href="https://www.logsat.com/spamfilter/forums/member_profile.asp?PF=8">LogSat</a><br /><strong>Subject:</strong> 2276<br /><strong>Posted:</strong> 22 October 2003 at 11:24pm<br /><br /><P>Ideally we'd like statistics, not email content itself. We'd&nbsp;like to know what happens to the percentages of false positives (good emails being blocked), and percentage of "misses" (spam slipping thru the filters) as the corpus increases in size.</P><P>Roberto F.<BR>LogSat Software</P>]]>
   </description>
   <pubDate>Wed, 22 Oct 2003 23:24:00 +0000</pubDate>
   <guid isPermaLink="true">https://www.logsat.com/spamfilter/forums/forum_posts.asp?TID=2276&amp;PID=2304&amp;title=20-beta-question#2304</guid>
  </item> 
  <item>
   <title><![CDATA[2.0 Beta Question : R, &amp;gt;&amp;gt;We still do not have...]]></title>
   <link>https://www.logsat.com/spamfilter/forums/forum_posts.asp?TID=2276&amp;PID=2293&amp;title=20-beta-question#2293</link>
   <description>
    <![CDATA[<strong>Author:</strong> <a href="https://www.logsat.com/spamfilter/forums/member_profile.asp?PF=44">Dan B</a><br /><strong>Subject:</strong> 2276<br /><strong>Posted:</strong> 22 October 2003 at 9:34am<br /><br /><P>R,</P><P>&gt;&gt;We still do not have any valid data on possible high rejections with large corpus databases. Any user input on this, where the corpus db.dat file is &gt; 10MB will be appreciated.</P><P>What info do you need?&nbsp; Some emails that are rejected that are legitimate emails, our corpus files?<BR>Just let me know,</P><P>Dan B</P>]]>
   </description>
   <pubDate>Wed, 22 Oct 2003 09:34:00 +0000</pubDate>
   <guid isPermaLink="true">https://www.logsat.com/spamfilter/forums/forum_posts.asp?TID=2276&amp;PID=2293&amp;title=20-beta-question#2293</guid>
  </item> 
  <item>
   <title><![CDATA[2.0 Beta Question : Dan, The current beta blocks...]]></title>
   <link>https://www.logsat.com/spamfilter/forums/forum_posts.asp?TID=2276&amp;PID=2280&amp;title=20-beta-question#2280</link>
   <description>
    <![CDATA[<strong>Author:</strong> <a href="https://www.logsat.com/spamfilter/forums/member_profile.asp?PF=8">LogSat</a><br /><strong>Subject:</strong> 2276<br /><strong>Posted:</strong> 21 October 2003 at 12:26am<br /><br /><P>Dan,</P><P>The current beta blocks emails if the Bayesian probability is above 0.9 (90%). The second beta we are going to release in a few days will have either this value hardcoded to 0.99 or it will be user-selectable. The final version will definetly have it user-selectable.</P><P>The s2nd beta will also automatically prune the corpus database removing old keyword tokens that have not been seen in emails recently. This should decrease the size of the corpus and help with memory leak issues present in the current beta. </P><P>We still do not have any valid data on possible high rejections with large corpus databases. Any user input on this, where the corpus db.dat file is &gt; 10MB will be appreciated.</P><P>Roberto F.<BR>LogSat Software</P>]]>
   </description>
   <pubDate>Tue, 21 Oct 2003 00:26:00 +0000</pubDate>
   <guid isPermaLink="true">https://www.logsat.com/spamfilter/forums/forum_posts.asp?TID=2276&amp;PID=2280&amp;title=20-beta-question#2280</guid>
  </item> 
  <item>
   <title><![CDATA[2.0 Beta Question : R, We have been running the 2.0...]]></title>
   <link>https://www.logsat.com/spamfilter/forums/forum_posts.asp?TID=2276&amp;PID=2276&amp;title=20-beta-question#2276</link>
   <description>
    <![CDATA[<strong>Author:</strong> <a href="https://www.logsat.com/spamfilter/forums/member_profile.asp?PF=2">Guests</a><br /><strong>Subject:</strong> 2276<br /><strong>Posted:</strong> 20 October 2003 at 4:29pm<br /><br /><P>R,</P><P>We have been running the 2.0 beta for 4 days of so.&nbsp; On what percentage does the spam filter mark the email message as spam?&nbsp; Is there a way to change that percentage possible with the GUI in future release or even turn off the Bayesian portion?&nbsp;&nbsp; I did read on your website that there is a issue of legitimate emails to be blocked after the corpus has grown to several MB in size.&nbsp; Any updates on this?&nbsp; We are seeing legitimate emails being blocked.</P><P>Thanks,<BR>Dan B</P>]]>
   </description>
   <pubDate>Mon, 20 Oct 2003 16:29:00 +0000</pubDate>
   <guid isPermaLink="true">https://www.logsat.com/spamfilter/forums/forum_posts.asp?TID=2276&amp;PID=2276&amp;title=20-beta-question#2276</guid>
  </item> 
 </channel>
</rss>