Bayes training?

taurix · Sep 12, 2019

Hi all

I notice that bayes is not working well for me, mostly dutch language mails and real SPAM almost always gets a score of -1.9 (BAYES). Disabling it actually improves spam blocking...

Is there a way to train the filter or do I just use the default sa-learn on the CLI?

heutger · Sep 12, 2019

taurix said:
Hi all

I notice that bayes is not working well for me, mostly dutch language mails and real SPAM almost always gets a score of -1.9 (BAYES). Disabling it actually improves spam blocking...

Is there a way to train the filter or do I just use the default sa-learn on the CLI?

How did you got this scores? From bayes autolearn? You should get your users copy (not forward!) mails with spam to a spam folder/mailbox as well as ham mails, which got recognized as spam. Then use sa-learn to learn such messages. If you're on a linux based system and familiar with coding, I still look for someone, who could use Dovecot Sieve (there are tutorials therefor) to invoke a script, in this case invoke a remote sa-learn on the PMG system. That's why I'm asking, if it wouldn't be an idea to combine PMG with e.g. Kolab or sth. similar, which would make PMG a full mail solution. It would then be easier to adopt a direct spam and ham learning just by user actions.

guletz · Sep 12, 2019

Hi @heutger,

Maybe a stupid ideea ...

On the PMG you can create some kind of remote share who is mounted on your server where the users imbox's are. Then you can use this remote share for ham/spam global for all of your users. On the PMG you can use incrond daemon to watch any new created files/mails(new spam or ham) and on this event you can run any bash script like:
- run sa-learn spam/ham
- remove the mails who was learned

I do not think that you need a lot of I/O operations in your case at least

, so the share will not be need to be so fast!

Good luck!

heutger · Sep 12, 2019

guletz said:
Hi @heutger,

Maybe a stupid ideea ...

On the PMG you can create some kind of remote share who is mounted on your server where the users imbox's are. Then you can use this remote share for ham/spam global for all of your users. On the PMG you can use incrond daemon to watch any new created files/mails(new spam or ham) and on this event you can run any bash script like:
- run sa-learn spam/ham
- remove the mails who was learned

I do not think that you need a lot of I/O operations in your case at least , so the share will not be need to be so fast!

Good luck!

Maybe an idea as well, however, not really nice.

DerDanilo · Oct 5, 2019

Does anybody have a service setup to collect ham and spam from a mailcluster, build new bayes and distribute them on PMGs?

Backend:
- Dovecot
- Postfix
- Dovecot Director
- Dovecot LMTP
- Other Cluster stuff

heutger · Oct 8, 2019

DerDanilo said:
Does anybody have a service setup to collect ham and spam from a mailcluster, build new bayes and distribute them on PMGs?

Backend:
- Dovecot
- Postfix
- Dovecot Director
- Dovecot LMTP
- Other Cluster stuff

I haven't but you could use Sieve to script calls to PMG to learn spam.

Janko · Oct 12, 2019

I import the Mails via Rsync over SSH an put them through SA-Lern.

Code:

#!/bin/bash

rsync -av -e "ssh -p 8899" root@first.server.com:/var/qmail/mailnames/*/*/Maildir/.Spam/cur/ /var/spam/
rsync -av -e "ssh -p 8899" root@second.server.com:/var/qmail/mailnames/*/*/Maildir/.Spam/cur/ /var/spam/
rsync -av -e "ssh -p 8899" root@third.server.com:/var/qmail/mailnames/*/*/Maildir/.Spam/cur/ /var/spam/



sa-learn --dbpath /var/lib/amavis/.spamassassin --progress --spam /var/spam/
rm -rf /var/spam/*

Code:

#!/bin/bash

rsync -av -e "ssh -p 8899" root@first.server.com:/var/qmail/mailnames/*/*/Maildir/cur/ /var/ham/
rsync -av -e "ssh -p 8899" root@second.server.com:/var/qmail/mailnames/*/*/Maildir/cur/ /var/ham/
rsync -av -e "ssh -p 8899" root@third.server.com:/var/qmail/mailnames/*/*/Maildir/cur/ /var/ham/



sa-learn --dbpath /var/lib/amavis/.spamassassin --progress --ham /var/ham/
rm -rf /var/ham/*

This is for a Plesk Mail Server.

Search

Search

Bayes training?

taurix

Member

heutger

Famous Member

guletz

Distinguished Member

heutger

Famous Member

DerDanilo

Famous Member

heutger

Famous Member

Janko

Renowned Member

We value your privacy