Spamassassin rules, words with accent

Feb 8, 2022
6
0
6
38
Hello,
I have a general question about Spamassassin.

If I want to block terms with accents ( é, è , à..), what is the best way to proceed?

- Characters substitutions ( .+ ) or regex containing all possible accents ? (all possibles because spammer use variantes like é,è)

The problem of ".+" is that it will look for the characters before in the whole text, which inevitably can generate false positives.

- Should we double the rules in order to take into account different encodings?

Eg. of rule for "Veulliez débloquer votre accès" :

header LOCAL_SBJ_0008 Subject =~ /Veuillez d.+bloquer votre acc.+/i
score LOCAL_SBJ_0008 5.0
describe LOCAL_SBJ_0008 Sujet : Phishing bancaire

Thanks,
Best regards,
Philippe