Statistics: keyword |
Post Reply
|
| Author | |
meatboy
Newbie
Joined: 26 June 2006 Status: Offline Points: 18 |
Post Options
Thanks(0)
Quote Reply
Topic: Statistics: keywordPosted: 15 August 2006 at 2:21am |
|
Hi, As a suggestion to improve Spamfilter would it be possible to add statistics on how often a keyword has been used to find spam? A count showing the keywords/regex effectiveness? I suspect the order that the keywords are scanned would mean that keywords that are "higher up" in the list would tend to score more but the information would at least show those keywords that are not of any use. The idea is to reduce the number of useless words. Could this be implemented and would it be of any use? Tim Edited by meatboy |
|
![]() |
|
sgeorge
Senior Member
Joined: 23 August 2005 Status: Offline Points: 178 |
Post Options
Thanks(0)
Quote Reply
Posted: 16 August 2006 at 11:40am |
|
Hi meatboy. Actually, this is very possible if you are using a quarantine database and quarantine all messages that blocked because of keyword matches. Here's some SQL that should give you a list of the keywords that sent messages to the quarantine, sorted by greatest # of occurances per keyword:
Stephen |
|
![]() |
|
LogSat
Admin Group
Joined: 25 January 2005 Location: United States Status: Offline Points: 4106 |
Post Options
Thanks(0)
Quote Reply
Posted: 16 August 2006 at 4:13pm |
|
Thanks sgeorge, excellent idea, we'll be using that ourselves
!!
|
|
![]() |
|
sgeorge
Senior Member
Joined: 23 August 2005 Status: Offline Points: 178 |
Post Options
Thanks(0)
Quote Reply
Posted: 16 August 2006 at 4:57pm |
|
Oh my
, well thanks!Stephen |
|
![]() |
|
meatboy
Newbie
Joined: 26 June 2006 Status: Offline Points: 18 |
Post Options
Thanks(0)
Quote Reply
Posted: 16 August 2006 at 7:15pm |
|
Hi Sgeorge, I have tried a quarantine DB but only the access one that LogSat provide. I like you idea though. Perhaps I can swap over to a SQL Db instead. Thanks for the idea! Tim |
|
![]() |
|
Desperado
Senior Member
Joined: 27 January 2005 Location: United States Status: Offline Points: 1143 |
Post Options
Thanks(0)
Quote Reply
Posted: 18 August 2006 at 9:09am |
|
Hmmm ... I still think Sawmill works better for this as it looka at ALL the blocked items rather than just the ones that are still in quarantine as below: (not sure how this will post)
|
|
|
The Desperado
Dan Seligmann. Work: http://www.mags.net Personal: http://www.desperado.com |
|
![]() |
|
sgeorge
Senior Member
Joined: 23 August 2005 Status: Offline Points: 178 |
Post Options
Thanks(0)
Quote Reply
Posted: 18 August 2006 at 10:08am |
|
meatboy, I use an Access DB as well and that query does the trick for me... I believe that the query should work unchanged in SQL and mySQL as well.
Desperado, good point. Only reason I don't use Sawmill is because my evaluation expired. Hey, glad to see that some of my keywords are in your top 5 list. ![]() Stephen |
|
![]() |
|
Desperado
Senior Member
Joined: 27 January 2005 Location: United States Status: Offline Points: 1143 |
Post Options
Thanks(0)
Quote Reply
Posted: 18 August 2006 at 10:24am |
|
Stephen, Top 1 even! Thanks Edited by Desperado |
|
|
The Desperado
Dan Seligmann. Work: http://www.mags.net Personal: http://www.desperado.com |
|
![]() |
|
Post Reply
|
|
|
Tweet
|
| Forum Jump | Forum Permissions ![]() You cannot post new topics in this forum You cannot reply to topics in this forum You cannot delete your posts in this forum You cannot edit your posts in this forum You cannot create polls in this forum You cannot vote in polls in this forum |
This page was generated in 0.164 seconds.


Topic Options
Post Options
Thanks(0)


!!
, well thanks!
Hey, glad to see that some of my keywords are in your top 5 list. 
