Print Page | Close Window

Foreign language keyword filtering

Printed From: LogSat Software
Category: Spam Filter ISP
Forum Name: Spam Filter ISP Support
Forum Description: General support for Spam Filter ISP
URL: https://www.logsat.com/spamfilter/forums/forum_posts.asp?TID=7141
Printed Date: 15 October 2024 at 2:52pm


Topic: Foreign language keyword filtering
Posted By: yapadu
Subject: Foreign language keyword filtering
Date Posted: 13 November 2016 at 7:56pm
Can anyone provide some advice on the best way to filter foreign languages?

Developing keyword filters for English and other alphabet languages is no problem, but what about Russian, Chinese, Japanese etc?

For example I am looking at a Russian spam message.  The character encoding is windows-1251 and the spam is about ordering a chess board for the upcoming holidays.

At the bottom is says something (thanks Google translate!) about ordering right now by phone.  I'm seeing a lot of mail like this so wanted to develop a keyword filter.

Problem is I have no idea how to filter on these characters.  Here is a snippet from the spam.

ЗАКАЖИТЕ БЕСПЛАТНЫЙ ОБРАТНЫЙ ЗВОНОК ПРЯМО СЕЙЧАС >>

I was going to filter on this bit:

ПРЯМО СЕЙЧАС

How can I get these characters into spamfilter so they match?




-------------
--------------------------------------------------------------
I am a user of SF, not an employee. Use any advice offered at your own risk.



Replies:
Posted By: LogSat
Date Posted: 14 November 2016 at 8:10am
yapadu,

As of SpamFilter v4.6.1.119, we added support for unicode characters in the blacklist/whitelists. You are thus able to add this line to your list of keywords to stop emails with that content:
ПРЯМО,СЕЙЧАС




-------------
Roberto Franceschetti

http://www.logsat.com" rel="nofollow - LogSat Software

http://www.logsat.com/sfi-spam-filter.asp" rel="nofollow - Spam Filter ISP



Print Page | Close Window