Hi,
So I installed RC2 this morning. Later on, I noticed an alert email for a monitor going into yellow. I had disabled this previously with --alertcolors=red,purple in hobbitlaunch.cfg. Here's the snippet:
[hobbitd] HEARTBEAT ENVFILE /opt/bb/server/etc/hobbitserver.cfg CMD hobbitd --restart=$BBTMP/hobbitd.chk --checkpoint-file=$BBTMP/hobbit d.chk --checkpoint-interval=600 --purple-conn=conn --log=$BBSERVERLOGS/hobbitd.l og --admin-senders=127.0.0.1,$BBSERVERIP --alertcolors=red,purple
And here's the alert I just received:
im68:cpu yellow [-1] yellow Mon Feb 14 13:13:56 PST 2005 up: 208 day(s), 1 users, 115 procs, load=529
LOAD AVG on im68 is 529
Any ideas?
-- Bruce Z. Lysik <blysik at shutterfly.com> Operations Engineer
On Mon, Feb 14, 2005 at 01:28:28PM, Bruce Lysik wrote:
Hi,
So I installed RC2 this morning. Later on, I noticed an alert email for a monitor going into yellow. I had disabled this previously with --alertcolors=red,purple in hobbitlaunch.cfg. Here's the snippet:
[hobbitd] HEARTBEAT ENVFILE /opt/bb/server/etc/hobbitserver.cfg CMD hobbitd --restart=$BBTMP/hobbitd.chk --checkpoint-file=$BBTMP/hobbit d.chk --checkpoint-interval=600 --purple-conn=conn --log=$BBSERVERLOGS/hobbitd.l og --admin-senders=127.0.0.1,$BBSERVERIP --alertcolors=red,purple
And here's the alert I just received:
im68:cpu yellow [-1] yellow Mon Feb 14 13:13:56 PST 2005 up: 208 day(s), 1 users, 115 procs, load=529
What do you get when you run the following test as hobbit user?
cd ~hobbit/server ./bin/bbcmd --test <FQDN of im68> cpu
LOAD AVG on im68 is 529
Any ideas?
-- Bruce Z. Lysik <blysik at shutterfly.com> Operations Engineer
To unsubscribe from the hobbit list, send an e-mail to hobbit-unsubscribe at hswn.dk
-- Asif Iqbal PGP Key: 0xE62693C5 KeyServer: pgp.mit.edu "...it said: Install Windows XP or better...so I installed Solaris..."
On Mon, Feb 14, 2005 at 06:44:37PM, Asif Iqbal wrote:
On Mon, Feb 14, 2005 at 01:28:28PM, Bruce Lysik wrote:
Hi,
So I installed RC2 this morning. Later on, I noticed an alert email for a monitor going into yellow. I had disabled this previously with --alertcolors=red,purple in hobbitlaunch.cfg. Here's the snippet:
[hobbitd] HEARTBEAT ENVFILE /opt/bb/server/etc/hobbitserver.cfg CMD hobbitd --restart=$BBTMP/hobbitd.chk --checkpoint-file=$BBTMP/hobbit d.chk --checkpoint-interval=600 --purple-conn=conn --log=$BBSERVERLOGS/hobbitd.l og --admin-senders=127.0.0.1,$BBSERVERIP --alertcolors=red,purple
And here's the alert I just received:
im68:cpu yellow [-1] yellow Mon Feb 14 13:13:56 PST 2005 up: 208 day(s), 1 users, 115 procs, load=529
What do you get when you run the following test as hobbit user?
cd ~hobbit/server ./bin/bbcmd --test <FQDN of im68> cpu
oops I meant
./bin/bbcmd hobbitd_alert --test FQDN cpu
LOAD AVG on im68 is 529
Any ideas?
-- Bruce Z. Lysik <blysik at shutterfly.com> Operations Engineer
To unsubscribe from the hobbit list, send an e-mail to hobbit-unsubscribe at hswn.dk
-- Asif Iqbal PGP Key: 0xE62693C5 KeyServer: pgp.mit.edu "...it said: Install Windows XP or better...so I installed Solaris..."
To unsubscribe from the hobbit list, send an e-mail to hobbit-unsubscribe at hswn.dk
-- Asif Iqbal PGP Key: 0xE62693C5 KeyServer: pgp.mit.edu "...it said: Install Windows XP or better...so I installed Solaris..."
On Mon, Feb 14, 2005 at 01:28:28PM -0800, Bruce Lysik wrote:
So I installed RC2 this morning. Later on, I noticed an alert email for a monitor going into yellow. I had disabled this previously with --alertcolors=red,purple in hobbitlaunch.cfg.
[config from another mail]
HOST=<snip list of about 100 hosts> SCRIPT /opt/bb/server/ext/email bruce_mail FORMAT=SCRIPT REPEAT=30 DURATION>6 RECOVERED
And here's the alert I just received:
im68:cpu yellow [-1] yellow Mon Feb 14 13:13:56 PST 2005 up: 208 day(s), 1 users, 115 procs, load=529
The alert you show here looks like a recovery-notice (the "-1" I assume is the acknowledgment cookie, and this value indicates that there is no active alert).
If you look in the ~/data/ack/notifications.log file for these notifications, you can tell if it's an alert message or a recovery message by the number of columns in the file. E.g. in my log I have
Wed Feb 16 13:08:43 2005 www.sslug.dk.smtp (130.228.2.150) henrik at hswn.dk 1108555723 725 Wed Feb 16 13:09:44 2005 www.sslug.dk.smtp (130.228.2.150)henrik at hswn.dk 1108555784 725 61
The first one is the alert message, the second is the recovery message. The recovery has an extra field "61", which is the duration of the event (in seconds).
Could you check the following in hobbitlaunch.cfg:
- The "hobbitd" command has "--alertcolors=red,purple --okcolors=green"
- The "hobbitd_alert" command has "--alertcolors=red,purple"
This setup should give you alerts when a status is red (or purple), and recovery notices only when they go green (after being red or purple).
Regards, Henrik
participants (3)
-
blysik@shutterfly.com
-
henrik@hswn.dk
-
iqbala-hobbit@qwestip.net