Print Page | Close Window

Help blocking HTML source code in email

Printed From: LogSat Software
Category: Spam Filter ISP
Forum Name: Spam Filter ISP Support
Forum Description: General support for Spam Filter ISP
URL: https://www.logsat.com/spamfilter/forums/forum_posts.asp?TID=4368
Printed Date: 30 April 2025 at 12:40pm


Topic: Help blocking HTML source code in email
Posted By: CyberBob
Subject: Help blocking HTML source code in email
Date Posted: 30 September 2004 at 11:00am

Spamfilter is working great! Our keyword list combined with blocked domains etc. is working well.

The vast majority of emails that are "making it through" are the HTML based email's that pull an image from an external site. I have been viewing the source of common SPAM messages and I have been adding the "href" domains (medsonline.com) to the keyword list and this works well but it's very time consuming. The emails are sent from another domain like yahoo.com so you cannot block the receiving domain or sender as those change with every email.

Am I missing a simple way to filter the source code or does anyone have an idea?

I have the Baesian filter at 99.07% but our keyword list is so extensive the filter never has a chance to filter much but will the filter pick up on source code emails?

Thanks,

Bob




Replies:
Posted By: LogSat
Date Posted: 30 September 2004 at 11:16pm
Bob,

Several users have implemented keyword lists based on Regular Expressions (RegEx) which are very powerful and effective on html emails. If you scan thru the forum you will find several samples and contacts. I'll also email you an extensive keyword list we have that also contains some powerful RegEx expressions.

Roberto F. LogSat Software


Posted By: CyberBob
Date Posted: 01 October 2004 at 1:52pm

Thanks for the keyword list and feedback but we still aren't on the same page.

I started with your keyword list many months ago and we have continued to add and grow the list. Your keyword list was simply a starting poing and we had too much coming through so we continued to add to this list.

In your keyword list I don't see you using the "Subject:" in your list, why is this?

In our experience you need both keywords in the body and subject. If we put viagra in our keyword list it would NOT be blocked if it were in the subject line only so we have keywords listed for both instances sometimes.

Also there are only 3-4 lines of Regex in the entire keyword list. I understand Regex is powerful (if anyone can take the time to understand RegEx) but you must have more than this on your server?

I had to remove ((<font color="#ffffff">.*){5}) as people were complaining about HTML emails from valid companies like Home Depot, Staples etc being blocked.

I will PAY someone out there to mentor me on how to write the RegEx but in the meantime my original post was about emails that are getting through because the use very common subject lines that we cannot block and then source code to pull down an image from an external site.  Here is the code I'm talking about:

<html><body>
<center><a href=" undefined" CLASS="ASPForums" TITLE="WARNING: URL created by poster. - http://thethingsonline.com" CLASS="ASPForums" TITLE="WARNING: URL created by poster. - http://thethingsonline.com "><img src=" http://www.thethingsonline.com/acq30LL2fy5gkpc.gif" CLASS="ASPForums" TITLE="WARNING: URL created by poster. - http://www.thethingsonline.com/acq30LL2fy5gkpc.gif" CLASS="ASPForums" TITLE="WARNING: URL created by poster. - http://www.thethingsonline.com/acq30LL2fy5gkpc.gif " border=0></a></center>
</body></html>

The subject line said "Hello"

We have started to block many of the href domains like thethingsonline.com 

Is there a way to block these or do I just need to learn more about RegEx?

Take a look at  this one.

What looked like Vicodin in simple text looked like this in the source code: V<kcumfutxe6xdp>i<kgx5xs2gg75172>co<kc3ohrq1xy3ea>din

Although you don't suggest our keyword list is around 1300 lines but it works other than the code above I'm trying to block. I'd like to learn more about RegEx so if someone can mentor me I'm happy to pay!

 



Posted By: Guests
Date Posted: 04 October 2004 at 3:06am

Can you please also forward me your keyword list as I have been looking for suitable ones and haven't succeded.

Thanks
Stephen



Posted By: kspare
Date Posted: 04 October 2004 at 3:24am

Roberto, can you please send me your keywords list, I'll also need to talk to you via email about some issues I have with spamfilter and some workarounds I have found.

 

Kevin



Posted By: Guests
Date Posted: 04 October 2004 at 3:03pm

Can you send it also to me ?
Thanks 8-)

I think it might be nice if in the area for registered users such usful information will be found,
also like latest dnsrbl lists and etc...

 




Print Page | Close Window