No pages when going from yellow to red
On my AIX box running the Hobbit client I found that disk alarms aren't generated if the condition goes from yellow to red.
I have rules to send an Email if it's yellow, and I always get that. I also have rules to send a page if the state is red, and I get those if it jumps from green to red. But, if the state goes to yellow and then to red, the paging rule never fires. There isn't a entry added to the notification log either.
I've mentioned this before on the list, but never got a definite response.
"Pat Vaughan" wrote on 02/11/2005 09:00:01 a.m.:
On my AIX box running the Hobbit client I found that disk alarms aren't generated if the condition goes from yellow to red.
I have rules to send an Email if it's yellow, and I always get that. I also have rules to send a page if the state is red, and I get those if it jumps from green to red. But, if the state goes to yellow and then to red, the paging rule never fires. There isn't a entry added to the notification log either.
I've mentioned this before on the list, but never got a definite response.
I'm pretty sure I've seen the same issue, running 4.1.2 on Solaris 9 x86 for the server. I can't absolutely confirm but will keep and eye out for it next time.
I've also had an incident over the weekend where the status went:
green --> red --> yellow --> green
I got the page for the red alert, but did not get a recovery message. If the status goes:
green --> red --> green
I get both the page and the recovery.
It's my rostered week on call, so if any fixes need to be tested get them in quick!
Cheers, Andy.
#####################################################################################
This email is intended for the person to whom it is addressed only. If you are not the intended recipient, do not read, copy or use the contents in any way. The opinions expressed may not necessarily reflect those of ZESPRI Group of Companies ('ZESPRI').
While every effort has been made to verify the information contained herein, ZESPRI does not make any representations as to the accuracy of the information or to the performance of any data, information or the products mentioned herein. ZESPRI will not accept liability for any losses, damage or consequence, however, resulting directly or indirectly from the use of this e-mail/attachments. #####################################################################################
On Tue, Nov 01, 2005 at 03:00:01PM -0500, Pat Vaughan wrote:
On my AIX box running the Hobbit client I found that disk alarms aren't generated if the condition goes from yellow to red.
I have rules to send an Email if it's yellow, and I always get that. I also have rules to send a page if the state is red, and I get those if it jumps from green to red. But, if the state goes to yellow and then to red, the paging rule never fires. There isn't a entry added to the notification log either.
I've mentioned this before on the list, but never got a definite response.
I'll look into this problem.
Henrik
On Tue, Nov 01, 2005 at 10:25:12PM +0100, Henrik Stoerner wrote:
On Tue, Nov 01, 2005 at 03:00:01PM -0500, Pat Vaughan wrote:
On my AIX box running the Hobbit client I found that disk alarms aren't generated if the condition goes from yellow to red.
I have rules to send an Email if it's yellow, and I always get that. I also have rules to send a page if the state is red, and I get those if it jumps from green to red. But, if the state goes to yellow and then to red, the paging rule never fires. There isn't a entry added to the notification log either.
I've mentioned this before on the list, but never got a definite response.
I'll look into this problem.
I've been trying to re-create this problem, but I do get the alerts I expect to get.
One thing that might confuse some people: The REPEAT setting counts "across colors". E.g. if you have REPEAT=30 (the default, 30 minutes between alerts), and the sequence of events goes like
22:05 Test goes yellow - alert (yellow) is sent 22:35 Test still yellow - repeat alert (yellow) is sent 22:45 Test goes red. No alert is sent because it is only 10 minutes since the last alert went out. 23:05 Test still red - now an alert (red) is sent.
Henrik
ACK! So, if what do we do if we want to get Emails for yellow alerts, and pages for red alerts and not get repeat pages every x minutes? It seems that a red alert is usually pretty important, and we want to know about it immediately, instead of waiting until the repeat time expires (which we set to 30d per a previous recommendation).
I would expect that a change in the state of a test would reset the REPEAT counter.
On Tue, Nov 01, 2005 at 10:25:12PM +0100, Henrik Stoerner wrote: I've been trying to re-create this problem, but I do get the alerts I expect to get.
One thing that might confuse some people: The REPEAT setting counts "across colors". E.g. if you have REPEAT=30 (the default, 30 minutes between alerts), and the sequence of events goes like
22:05 Test goes yellow - alert (yellow) is sent 22:35 Test still yellow - repeat alert (yellow) is sent 22:45 Test goes red. No alert is sent because it is only 10 minutes since the last alert went out. 23:05 Test still red - now an alert (red) is sent.
Henrik
To unsubscribe from the hobbit list, send an e-mail to hobbit-unsubscribe at hswn.dk
Hi,
i got this night a similar problem.
A CPU alert was sent but never get the recovered. We trace our sending script, it was not called to send the recovered msg. Looks like red => green = recovered red -> yellow -> green = no recovered
My conf :
HOST=* SCRIPT /usr2/hobbit/server/sendit.sh SMS COLOR=red,purple REPEAT=15m DURATION>6m RECOVERED FORMAT=sms
Regards
On mer, 2005-11-02 at 14:04 -0500, Pat Vaughan wrote:
ACK! So, if what do we do if we want to get Emails for yellow alerts, and pages for red alerts and not get repeat pages every x minutes? It seems that a red alert is usually pretty important, and we want to know about it immediately, instead of waiting until the repeat time expires (which we set to 30d per a previous recommendation).
I would expect that a change in the state of a test would reset the REPEAT counter.
On Tue, Nov 01, 2005 at 10:25:12PM +0100, Henrik Stoerner wrote: I've been trying to re-create this problem, but I do get the alerts I expect to get.
One thing that might confuse some people: The REPEAT setting counts "across colors". E.g. if you have REPEAT=30 (the default, 30 minutes between alerts), and the sequence of events goes like
22:05 Test goes yellow - alert (yellow) is sent 22:35 Test still yellow - repeat alert (yellow) is sent 22:45 Test goes red. No alert is sent because it is only 10 minutes since the last alert went out. 23:05 Test still red - now an alert (red) is sent.
Henrik
To unsubscribe from the hobbit list, send an e-mail to hobbit-unsubscribe at hswn.dk
To unsubscribe from the hobbit list, send an e-mail to hobbit-unsubscribe at hswn.dk
-- Etienne Roulland -- CVF Bordeaux
participants (5)
-
Andy@zespri.com
-
Etienne.Roulland@cvf.fr
-
henrik@hswn.dk
-
hobbit@pvaughan.us
-
patrick_a_vaughan@hotmail.com