<?xml version="1.0" encoding="utf-8" ?>
<?xml-stylesheet type="text/xsl" href="RSS_xslt_style.asp" version="1.0" ?>
<rss version="2.0" xmlns:WebWizForums="http://syndication.webwiz.co.uk/rss_namespace/">
 <channel>
  <title>Spam Filter ISP Forums : Another thought on getting missed spam back into the corpus</title>
  <link>https://www.logsat.com/spamfilter/forums/</link>
  <description><![CDATA[This is an XML content feed of; Spam Filter ISP Forums : Spam Filter ISP Support : Another thought on getting missed spam back into the corpus]]></description>
  <pubDate>Sun, 08 Mar 2026 12:57:58 +0000</pubDate>
  <lastBuildDate>Fri, 26 Mar 2004 14:27:00 +0000</lastBuildDate>
  <docs>http://blogs.law.harvard.edu/tech/rss</docs>
  <generator>Web Wiz Forums 11.04</generator>
  <ttl>360</ttl>
  <WebWizForums:feedURL>https://www.logsat.com/spamfilter/forums/RSS_post_feed.asp?TID=3346</WebWizForums:feedURL>
  <image>
   <title><![CDATA[Spam Filter ISP Forums]]></title>
   <url>https://www.logsat.com/spamfilter/forums/forum_images/web_wiz_forums.png</url>
   <link>https://www.logsat.com/spamfilter/forums/</link>
  </image>
  <item>
   <title><![CDATA[Another thought on getting missed spam back into the corpus : &amp;gt;&amp;gt;My idea is to have a seperate...]]></title>
   <link>https://www.logsat.com/spamfilter/forums/forum_posts.asp?TID=3346&amp;PID=3347&amp;title=another-thought-on-getting-missed-spam-back-into-the-corpus#3347</link>
   <description>
    <![CDATA[<strong>Author:</strong> <a href="https://www.logsat.com/spamfilter/forums/member_profile.asp?PF=2">Guests</a><br /><strong>Subject:</strong> 3346<br /><strong>Posted:</strong> 26 March 2004 at 2:27pm<br /><br /><P>&gt;&gt;My idea is to have a seperate database to <BR>&gt;&gt;store a COPY of all passed email.&nbsp; <BR></P><P>That was certainly what I was thinking as well.</P><P>I doubt that anyone wants the personally vet all the good mail on an individual basis...</P>]]>
   </description>
   <pubDate>Fri, 26 Mar 2004 14:27:00 +0000</pubDate>
   <guid isPermaLink="true">https://www.logsat.com/spamfilter/forums/forum_posts.asp?TID=3346&amp;PID=3347&amp;title=another-thought-on-getting-missed-spam-back-into-the-corpus#3347</guid>
  </item> 
  <item>
   <title><![CDATA[Another thought on getting missed spam back into the corpus : The new Bayesian filtering seems...]]></title>
   <link>https://www.logsat.com/spamfilter/forums/forum_posts.asp?TID=3346&amp;PID=3346&amp;title=another-thought-on-getting-missed-spam-back-into-the-corpus#3346</link>
   <description>
    <![CDATA[<strong>Author:</strong> <a href="https://www.logsat.com/spamfilter/forums/member_profile.asp?PF=2">Guests</a><br /><strong>Subject:</strong> 3346<br /><strong>Posted:</strong> 26 March 2004 at 12:33pm<br /><br /><P>The new Bayesian filtering seems to work well but there is still the nagging bit of how to get those few pieces of spam into the corpus so they can be scanned for in the future.</P><P>It has been suggested to offer tha ability to hold all mail for an admin to sift through.&nbsp; To me this would be a ridiculous chore to have to go through and the holdup on getting urgent email would anger a lot of users.</P><P>The problem seems to be how to keep the email format from modification by the end users email application.&nbsp; My idea is to have a seperate database to store a COPY of all passed email.&nbsp; This can be set with an aging period much like the regular quarantine to keep it from getting too large.&nbsp; When spam passes through undetected the admin can go to the gui and find the copy and select to add it to the corpus as spam.&nbsp; Since this copy is still in the correct format it would not cause any problems with SF.&nbsp;&nbsp; In the process this would also remove the incorrect "good" counts (that were tallied for the original email) from the corpus.&nbsp; </P><P>What do you think?</P>]]>
   </description>
   <pubDate>Fri, 26 Mar 2004 12:33:00 +0000</pubDate>
   <guid isPermaLink="true">https://www.logsat.com/spamfilter/forums/forum_posts.asp?TID=3346&amp;PID=3346&amp;title=another-thought-on-getting-missed-spam-back-into-the-corpus#3346</guid>
  </item> 
 </channel>
</rss>