8 Oct
2010
8 Oct
'10
2:40 a.m.
Hi all
The alerting is starting to take shape but I've a question regarding how the alerting works. If I have a stanza similar to the following, how is it evaluated? Once for all hosts, or for one host at a time?
HOST=%.* # Proliant tests MAIL sms at somecompany.com SERVICE=proliant FORMAT=SMS REPEAT=1440m MAIL sms at somecompany.com SERVICE=proliant FORMAT=SMS RECOVERED
# conn where status is RED
MAIL sms at somecompany.com COLOR=red SERVICE=conn EXPAGE=dev REPEAT=1440m
MAIL sms at somecompany.com COLOR=red SERVICE=conn EXPAGE=dev RECOVERED
# conn where status is RED (dev/test)
MAIL email at somecompany.com COLOR=red SERVICE=conn PAGE=dev REPEAT=1440m
MAIL email at somecompany.com COLOR=red SERVICE=conn PAGE=dev RECOVERED
# cpu,disk,memory where status is RED
MAIL sms at somecompany.com COLOR=red SERVICE=cpu,disk,memory
EXPAGE=dev REPEAT=1440m MAIL sms at somecompany.com COLOR=red SERVICE=cpu,disk,memory EXPAGE=dev RECOVERED
# Dev servers
MAIL email at somecompany.com COLOR=red SERVICE=cpu,disk,memory
PAGE=dev REPEAT=1440m MAIL email at somecompany.com COLOR=red SERVICE=cpu,disk,memory PAGE=dev RECOVERED
# Non-dev status YELLOW
MAIL email at somecompany.com COLOR=yellow
SERVICE=cpu,disk,memory REPEAT=1440m DURATION>30m MAIL email at somecompany.com COLOR=yellow SERVICE=cpu,disk,memory RECOVERED
Also, I've noticed that when a fault occurs I get two emails (or sms') and another when the fault is rectified. I'm thinking this is because of the 'RECOVERED' line but i thought this would only trigger when the fault goes. Have I misunderstood?
Thanks
CC
-- RHCE#805007969328369