so when a spam e-mail gets through to a mail box.
How do you teach the mail gateway that it was a spam message?
How do you teach the mail gateway that it was a spam message?
Can you confirm, when a spam message is held in the Quarantine, if the "Delete" button is pressed, does this run the message though sa-learn or the spamassassin reporting function before it deletes it?
if you want to fine tune your bayes filtering than that would be the way to goOtherwise, I'm probably better off approving it to my own mailbox, then feeding it back via my scripts to be reported / learned from...
I wonder if a good feature for the roadmap would be a 'Report' button to go along with the existing that would feed the message via `spamassassin -r` - then delete it?
depends on where you kept your bayes db in the old system - it could be as easy as enabling bayes filtering, copying it over (see https://spamassassin.apache.org/full/3.4.x/doc/sa-learn.html - the backup+restore option should work when run as root on PMG) and rebootingI don't have my old manual config of SA from my old filtering since I switched to PMG - but that had close to 10 years of filtering history... As such, its kinda hard to give solid figures either way...
depends on where you kept your bayes db in the old system - it could be as easy as enabling bayes filtering, copying it over (see https://spamassassin.apache.org/full/3.4.x/doc/sa-learn.html - the backup+restore option should work when run as root on PMG) and rebooting
that way you could compare the filtering without your db and with it
bayes_auto_learn 1
bayes_auto_learn_on_error 1
bayes_auto_learn_threshold_nonspam 0.1
bayes_auto_learn_threshold_spam 6.0
## Basic settings
required_hits 5.0
report_safe 0
## Bayesian scoring
score BAYES_00 -3.0
score BAYES_05 -2.0
score BAYES_20 -1.5
score BAYES_40 -1.0
score BAYES_50 0.8
score BAYES_60 1.5
score BAYES_80 2.0
score BAYES_90 3.0
score BAYES_99 4.0
So with my imported bayes db, Todays stats show:
View attachment 17138
As such, it seems to have really firmed up the middle ground compared to before the import...
At this stage, I haven't customised any of the weighting in the PMG setup.
v 3 db_version # this must be the first line!!!
v 1840 num_spam
v 11744 num_nonspam
From the backup of my old bayes db, I can gather the following:
Code:v 3 db_version # this must be the first line!!! v 1840 num_spam v 11744 num_nonspam
As such, its not really surprising that the detection is better now - as it feels in a history of over 13,000 messages...