spamassassin: unreasonable scores for BAYES_00 and BAYES_05 ?

hellfire

Renowned Member
Aug 17, 2016
79
47
83
46
In /usr/share/spamassassin/50_scores.cf are the following 2 lines:

Code:
score BAYES_00  0  0 -1.5   -1.9
score BAYES_05  0  0 -0.3   -0.5

Since I notice a some spam with image-only content(which is spam-rated for that reason), the bayes element of pmg adds high negative score and thus allows those mails to get through. I'm wondering if this is not a very reasonable default and this should be corrected?

(In my installation I reset the scores to 0, so it stays neutral and only negatives results matter).
 
Last edited:
I think the issue here is that your bayes filter is trained wrong - I would suggest to simply disable bayes filtering (GUI->Configuration->Spam Detector->Options)

I hope this helps!
 
Hmmm. I wonder if I did not understand the documentation right:

Page 2 PMG - Documentation

Bayesian Filter - Automatically trained statistical filters
Certain words have a higher probability of occurring in spam emails than in legitimate emails. By being
trained to recognize those words, the Bayesian filter checks every email and adjusts the probabilities
of it being a spam word or not in its database. This is done automatically.
I assume this works like that: When ever a spam message is detected by other means of the spam detection, the Bayesian Filter is trained so that the figured out bad message is used for training the bayesian filter automatically.

I did not do any manual training myself.

Is my understanding correctly here?
 
Last edited:
I assume this works like that: When ever a spam message is detected by other means of the spam detection, the Bayesian Filter is trained that the figured out bad message is used for training the bayesian filter automatically.
The documentation needs an update - we're currently planning on disabling the autolearn feature of Bayes in spamassassin (which is something I'd rather do on a major version change to give it more visibilty) - mostly due to our experience with it in practice (from reports here and from our enterprise users) - the rationale is quite well summed up in this thread:
https://lists.apache.org/thread/0w4kp9vhmlqjp424oxf1qtn3t09c97bc

So - I would suggest to disable bayes and see how it works in your case (I would expect better than with it enabled)

I hope this explains it!
 

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!