Spam Filter ISP Support Forum

  New Posts New Posts RSS Feed - Import messages
  FAQ FAQ  Forum Search   Register Register  Login Login

Import messages

 Post Reply Post Reply
Author
Derk View Drop Down
Groupie
Groupie


Joined: 26 January 2005
Location: United States
Status: Offline
Points: 39
Post Options Post Options   Thanks (0) Thanks(0)   Quote Derk Quote  Post ReplyReply Direct Link To This Post Topic: Import messages
    Posted: 26 January 2005 at 4:24pm

is it, or could it be, possible to have the spam filter application read a text file containing e-mail messages to be analyzed and included in the Bayesian filter database? after they are imported the application could erase the file.

I would like to be able to forward messages that get by the filter to a spam@mydomain catch all account.  Then I can set up a scheduled task to retrieve the messages and write them to a text file to be imported by the application.

Back to Top
LogSat View Drop Down
Admin Group
Admin Group
Avatar

Joined: 25 January 2005
Location: United States
Status: Offline
Points: 4065
Post Options Post Options   Thanks (0) Thanks(0)   Quote LogSat Quote  Post ReplyReply Direct Link To This Post Posted: 26 January 2005 at 8:34pm
Derk,

That is currently not possible. SpamFilter does scan a folder (\SpamFilter\corpus\queue\*.token) for temporary text files  with a .token extension. However the token files must only contain the email's "tokens", not the full email text. Tokens are the keywords that SpamFilter extracts from the email itself. The 1st line of the text file containt the words ".good" or ".spam" or ".falsepositive" to catalog the tokens as belonging to a good or spam email.
In theory you could write an application to create tokens out of an email, but we're not going to be able to provide any details on how and using which rules we use to create tokens from an email. We don't want any spammer to find out about the innerworkings of our software...
Roberto Franceschetti

LogSat Software

Spam Filter ISP
Back to Top
Web123 View Drop Down
Newbie
Newbie
Avatar

Joined: 26 January 2005
Location: Finland
Status: Offline
Points: 31
Post Options Post Options   Thanks (0) Thanks(0)   Quote Web123 Quote  Post ReplyReply Direct Link To This Post Posted: 27 January 2005 at 1:30am
I also get quite a lot false positives trough my system, and would
love to "import" them somehow to Spamfilter?
 
regards,
Kim
Back to Top
Marrab View Drop Down
Newbie
Newbie
Avatar

Joined: 27 January 2005
Location: Russian Federation
Status: Offline
Points: 12
Post Options Post Options   Thanks (0) Thanks(0)   Quote Marrab Quote  Post ReplyReply Direct Link To This Post Posted: 27 January 2005 at 1:38am

Thanks. Now i can although somehow control bayesian filter. One more question: does any quarantined mail be considered bayesian filter as spam?

Back to Top
Marrab View Drop Down
Newbie
Newbie
Avatar

Joined: 27 January 2005
Location: Russian Federation
Status: Offline
Points: 12
Post Options Post Options   Thanks (0) Thanks(0)   Quote Marrab Quote  Post ReplyReply Direct Link To This Post Posted: 27 January 2005 at 1:52am
It seems i understand why my bayesian filter working so strange. Token files contains only mail headers. Interestingly why?

Edited by Marrab
Back to Top
Derk View Drop Down
Groupie
Groupie


Joined: 26 January 2005
Location: United States
Status: Offline
Points: 39
Post Options Post Options   Thanks (0) Thanks(0)   Quote Derk Quote  Post ReplyReply Direct Link To This Post Posted: 27 January 2005 at 4:20pm

Originally posted by LogSat LogSat wrote:

Derk,

That is currently not possible. SpamFilter does scan a folder (\SpamFilter\corpus\queue\*.token) for temporary text files  with a .token extension. However the token files must only contain the email's "tokens", not the full email text. Tokens are the keywords that SpamFilter extracts from the email itself. The 1st line of the text file containt the words ".good" or ".spam" or ".falsepositive" to catalog the tokens as belonging to a good or spam email.
In theory you could write an application to create tokens out of an email, but we're not going to be able to provide any details on how and using which rules we use to create tokens from an email. We don't want any spammer to find out about the innerworkings of our software...

what about if I set up a subdomain that I can forward messages to, like spam@junk.mydomain.com?  Can I configure the application to consider every single message spam to be added to the Bayesian database?

Back to Top
LogSat View Drop Down
Admin Group
Admin Group
Avatar

Joined: 25 January 2005
Location: United States
Status: Offline
Points: 4065
Post Options Post Options   Thanks (0) Thanks(0)   Quote LogSat Quote  Post ReplyReply Direct Link To This Post Posted: 27 January 2005 at 11:16pm
We recommend strongly against "interfering" with the Bayesian filter by "forcing" emails or worse spam thru it. The bayesian filter needs to see the true email received by the mail server to function properly. If junk mail is fed to it, that junk is not representitative of the true emails received by SpamFilter, and the bayesian filter will loose its accuracy.

To answer Marrab, yes, all email that is quarantined is considered to be spam by the statistical filter.
Roberto Franceschetti

LogSat Software

Spam Filter ISP
Back to Top
Web123 View Drop Down
Newbie
Newbie
Avatar

Joined: 26 January 2005
Location: Finland
Status: Offline
Points: 31
Post Options Post Options   Thanks (0) Thanks(0)   Quote Web123 Quote  Post ReplyReply Direct Link To This Post Posted: 28 January 2005 at 1:20am

How about at least Blacklist all emails sent to spam@junk.mydomain.com? Could it be done?

Back to Top
LogSat View Drop Down
Admin Group
Admin Group
Avatar

Joined: 25 January 2005
Location: United States
Status: Offline
Points: 4065
Post Options Post Options   Thanks (0) Thanks(0)   Quote LogSat Quote  Post ReplyReply Direct Link To This Post Posted: 28 January 2005 at 5:06pm
Yes, it can:

  • Local TO EMail Blacklist - The recipient's email address is checked against your local list of blacklisted email addresses. If present, it is rejected.

Roberto Franceschetti

LogSat Software

Spam Filter ISP
Back to Top
Web123 View Drop Down
Newbie
Newbie
Avatar

Joined: 26 January 2005
Location: Finland
Status: Offline
Points: 31
Post Options Post Options   Thanks (0) Thanks(0)   Quote Web123 Quote  Post ReplyReply Direct Link To This Post Posted: 29 January 2005 at 2:54am
sorry!,
meant that if I get a spam-message that is not caught by the filter,
I could then forward it to a address(spam@junk.mydomain.com)
and it would parse the message and blacklist all the non local addressees.
 
Kim
Back to Top
Guests View Drop Down
Guest Group
Guest Group
Post Options Post Options   Thanks (0) Thanks(0)   Quote Guests Quote  Post ReplyReply Direct Link To This Post Posted: 30 January 2005 at 8:28am
In theory you could, but you would have to forward the original, unmodified message. This means that the email headers MUST be exactly as the original ones, and the same applies to the email body. Sipmly taking the email and forwarding it using an email client will not work accurately, as email clients, especially Outlook, completely modiffy the original email's format.
Back to Top
Blaine View Drop Down
Guest Group
Guest Group
Post Options Post Options   Thanks (0) Thanks(0)   Quote Blaine Quote  Post ReplyReply Direct Link To This Post Posted: 01 March 2005 at 9:42am
We host many domains and some clients as that we bypass the filter for their domain.  When we pass their mail though the filter does this impact our bayesian filter or are these emails not considered in the filering rules?

It seems the longer we run our system the less effective the bayesian filter becomes we are considering dumping the database and starting over but we have process 500000 plus emails and hate to lose all those statistics?

Any help would be really apprecitated.
Back to Top
 Post Reply Post Reply
  Share Topic   

Forum Jump Forum Permissions View Drop Down



This page was generated in 0.078 seconds.