Red Hat 5.8 server, 5.5 clients. Xymon 4.3.7 clients and server.
I am trying to put together a quick cluster monitor using depend and combos, but I think I'm stuck. Has anyone else done this?
I have two servers: smtp01 and smtp02 that are in an active/passive cluster.
The smtpd process only runs on the server that is active, but there are other processes that need to be monitored on both hosts (e.g. clurgmgrd).
I would like to go red and have an alert if one of the required processes goes down on either server, but I only want an alert if the active node of the cluster has a problem with the smtpd process.
My initial idea was to have procs go red for clurgmgrd and yellow on smtpd, use NOPROPYELLOW, and then use a combo so I could see what host was active.
hosts.cfg:
1.2.3.4 smtp01 # NOPROPYELLOW:procs
1.2.3.5 smtp02 # NOPROPYELLOW:procs
1.2.3.6 smtpHA # smtp
analysis.cfg:
HOST=smtp01
PROC clurgmgrd
PROC smtpd
HOST=smtp02
PROC clurgmgrd
PROC smtpd
combo.cfg:
smtpHA.procs = (smtp01.conn && smtp01.procs) || (smtp02.conn && smtp02.procs)
Unfortunately a yellow status is equal to a 1 for the combo, so even if both sides of the cluster were down, the combo would still show as up (green&&yellow is 1&&1).
Then I thought I could cover the scenario by making a "depends" test, but depends seems to be focused on disabling tests if another status is red.which doesn't work either since I don't want the page to go red.
This isn't a huge issue because the smtp poll on smtpHA will trigger red if both nodes are down, but it seems like there should be an easy way to do this.
Am I making it more complicated than it needs to be, or am I better writing a custom monitor?
Thanks!
-dan