[Gross] N-way cluster

Jesse Thompson jesse.thompson at doit.wisc.edu
Fri Apr 30 17:30:16 EEST 2010


On 04/30/2010 03:25 AM, Alvaro Marín wrote:
> Hi all,
>
> new to this list :)
> Searching for scripts/programs to apply greylisting to dynamic ranges I
> found MARBL, and then I found gross. I've test it and runs fine!
>
> Now, I want to implement it on my production environment, with 13
> antispam servers, but I've read about "problems" with replication with
> clusters with more than 2 servers and this message:
>
> https://lists.utu.fi/pipermail/gross/2008/000142.html
>
> but I suposse that it'll be ready for the v2 of gross, with the new
> architecture:
>
> http://code.google.com/p/gross/wiki/VersionTwoArchitecture
>
> and not for this v1. Am I rigth?

If I interpret Eino's plan correctly, n-node clustering will benefit you 
in terms of scalability, not fault tolerance.  You will still be 
dependent on the 2 master servers being available for the service to 
function.  So, the only reason you would need n-node clustering is if 
your load is so high that 2 nodes can't handle the traffic.  However, if 
the 2 nodes can handle your peak traffic, then you don't actually need 
the n-node cluster.

I'd be willing to bet that 2 nodes could handle your traffic just fine. 
  We had one day where a misconfigured spam bot attempted to send 100 
million messages.  Gross greylisted them all.  I didn't see any adverse 
effects from this load.  I didn't even know about it until I looked at 
our daily message volume report the following day.

Also, keep in mind that you may not need to process as many messages as 
you think.  Greylisting causes a dampening effect on your overall 
volumes.  Before gross we were peaking at 10 million messages/day.  Now 
we are down to a measly 3 million (with only 15% of the messages that 
make it past gross being spam.)


> Another question...I read on the FAQ:
>
> ===
> Q: Are there risks of database problems?
>
> Answer: [...] If you get more than 10 million messages a day you might
> need to adjust the default values.
> ===
>
> I receive ~50mill/day so I've to adjust the default values...should I
> increase these ones?
>
> filter_bits=24
> number_buffers=8

We ran into false matches, and I do remember changing those settings. 
According to the doc: "lowering filter_bits will increase the 
probability of false matches".  We are at 22, which is lower than the 
default of 24.  Perhaps the default value used to be lower than 22?

Or, I think that you could raise number_buffers to decrease the 
probability of false matches.  We do have this setting raised to 12 
(from the default of 8)

I hope this helps!

Jesse

>
> Thank you!
> Regards,
>

-------------- next part --------------
A non-text attachment was scrubbed...
Name: smime.p7s
Type: application/pkcs7-signature
Size: 3317 bytes
Desc: S/MIME Cryptographic Signature
URL: <http://lists.utu.fi/pipermail/gross/attachments/20100430/dec97fc1/attachment.p7s>


More information about the Gross mailing list