Hi,
is there a fix for the problem, meanwhile? We have the same situation, here. A filesystem on one of our mail servers ran full, but the admins didn't get an alert from hobbit. The server is listed in our bb-hosts file on the "mail" page with an IP of "0.0.0.0". I found out that only the hosts with a real IP address will get an alert. I could find a workaround: if I add the option "prefer" to the host entry with the "0.0.0.0" address, the host gets an alert. But if there are more entries of the same host on different pages, only the first one with the "prefer" option is recognized by the alert function.
So, if we have the following bb-hosts file:
page linux 123.456.78.9 my.mail.host # ... page mail 0.0.0.0 my.mail.host # noconn page redhat 0.0.0.0 my.mail.host # noconn prefer
and I define alerts for all three pages, the alert only works for page "linux" and page "redhat". The host on page "mail" is being ignored.
Please let me know if there is a solution for the problem.
-- Regards,
Dirk Kastens Universitaet Osnabrueck, Rechenzentrum (Computer Center) Albrechtstr. 28, 49069 Osnabrueck, Germany Tel.: +49-541-969-2347, FAX: -2470
On Wed, Jul 18, 2007 at 09:05:23AM +0200, Dirk Kastens wrote:
is there a fix for the problem, meanwhile? We have the same situation, here. A filesystem on one of our mail servers ran full, but the admins didn't get an alert from hobbit.
James and I managed to track down the cause of his problems, and it turned out to be a configuration problem - specifically, the way the DURATION parameter in hobbit-alerts.cfg works.
James had used the DURATION setting to limit the number of alerts sent, by using "DURATION<30" to only send alerts for 30 minutes. Also, there was one group of people receiving yellow alerts, and another group receiving red alerts. His setup was like this:
HOST=foo SERVICE=disk
MAIL adam at foo.com COLOR=yellow DURATION<30
MAIL brian at foo.com COLOR=red DURATION<30
If the "disk" status went yellow at 6PM and red at 7PM, then brian at foo.com didn't receive any notification.
That's because the DURATION value is counted from the start of the event, which begins when the status goes yellow. So by 7 PM the event has a duration of 60 minutes, which is above the 30-minute threshold - so the red alert was suppressed.
The server is listed in our bb-hosts file on the "mail" page with an IP of "0.0.0.0". I found out that only the hosts with a real IP address will get an alert.
The IP in bb-hosts has nothing to do with alerts.
page linux 123.456.78.9 my.mail.host # ... page mail 0.0.0.0 my.mail.host # noconn page redhat 0.0.0.0 my.mail.host # noconn prefer
and I define alerts for all three pages, the alert only works for page "linux" and page "redhat". The host on page "mail" is being ignored.
This is a different problem. Each host has a "primary" page - only one! It's the first page that defines it (that would be "linux"), except if you use the "prefer" keyword then it is of course the page that has the preferred definition of the host ("redhat", in your example). If you're unsure of what page Hobbit uses as the primary page, then check it on the "info" status page.
Regards, Henrik
Henrik Stoerner wrote:
page linux 123.456.78.9 my.mail.host # ... page mail 0.0.0.0 my.mail.host # noconn page redhat 0.0.0.0 my.mail.host # noconn prefer
and I define alerts for all three pages, the alert only works for page "linux" and page "redhat". The host on page "mail" is being ignored.
This is a different problem. Each host has a "primary" page - only one! It's the first page that defines it (that would be "linux"), except if you use the "prefer" keyword then it is of course the page that has the preferred definition of the host ("redhat", in your example). If you're unsure of what page Hobbit uses as the primary page, then check it on the "info" status page.
Yes, I know. I wrote, that this is only a workaround for the missing alert. When I leave out the "prefer" statement in the above example, the alert only works for the host on the "linux" page. I tested my configuration with the (patched) hobbitd_alert program:
$ bbcmd hobbitd_alert --test my.mail.host --color=red disk
Matching host:service:page 'my.mail.host:--color=red:linux' *** Match with 'PAGE=linux' *** Matching host:service:page 'my.mail.host:--color=red:linux' Failed 'PAGE=mail' (pagename not in include list)
When I add the "prefer" option to the host on the mail page, I get
Matching host:service:page 'my.mail.host:--color=red:mail Failed 'PAGE=linux' (pagename not in include list) Matching host:service:page 'my.mail.host:--color=red:mail *** Match with 'PAGE=mail' ***
So, what can I do to get the alert function working for BOTH pages?
-- Regards,
Dirk Kastens Universitaet Osnabrueck, Rechenzentrum (Computer Center) Albrechtstr. 28, 49069 Osnabrueck, Germany Tel.: +49-541-969-2347, FAX: -2470
On Wed, Jul 18, 2007 at 09:51:45AM +0200, Dirk Kastens wrote:
So, what can I do to get the alert function working for BOTH pages?
This functionality is in the current snapshot, but not in the 4.2.0 version.
Snapshots are available at http://www.hswn.dk/beta/
Regards, Henrik
This functionality is in the current snapshot, but not in the 4.2.0 version.
Aah, thanks. That's what I wanted to know :-)
-- Viele Gruesse,
Dirk Kastens Universitaet Osnabrueck, Rechenzentrum (Computer Center) Albrechtstr. 28, 49069 Osnabrueck, Germany Tel.: +49-541-969-2347, FAX: -2470
Dirk Kastens wrote:
This functionality is in the current snapshot, but not in the 4.2.0 version.
Aah, thanks. That's what I wanted to know :-)
I can confirm that the alerting now works as expected. But the information on the info pages is wrong. The "Alerting:" section only shows the correct info for the preferred host entries. When I define an alert for a page with a secondary host entry (IP 0.0.0.0) the info page of the host says "No alerts defined".
-- Regards,
Dirk Kastens Universitaet Osnabrueck, Rechenzentrum (Computer Center) Albrechtstr. 28, 49069 Osnabrueck, Germany Tel.: +49-541-969-2347, FAX: -2470
I'm not that familiar with RRD. I've loaded it and set it up and it works just fine.
However, some of the developers here have asked me if they could get a dump of the data instead of the graphs.
I've used the rrdtool to do a dump as well as using xml format, but the output isn't in a format the developers want.
I'm wondering if anyone's written any scripts or even an html based cgi script to output rrd data in a more readable format; perhaps associated with the data information: cpu, memory, etc...,
Thanks....James
James,
I've modified the view graph of Stefan tools V0.2.0 adminscripts http://www.fh-augsburg.de/~henk/hobbit/ to dump RRD. You can view the two files code attached.
Put rrdtocsv.sh and adminscripts_functions2.sh in your cgi-bin directory.
Regards,
Mario.
On 7/18/07, James Wade <jkwade at futurefrontiers.com> wrote:
I'm not that familiar with RRD. I've loaded it and set it up and it works just fine.
However, some of the developers here have asked me if they could get a dump of the data instead of the graphs.
I've used the rrdtool to do a dump as well as using xml format, but the output isn't in a format the developers want.
I'm wondering if anyone's written any scripts or even an html based cgi script to output rrd data in a more readable format; perhaps associated with the data information: cpu, memory, etc...,
Thanks....James
To unsubscribe from the hobbit list, send an e-mail to hobbit-unsubscribe at hswn.dk
There is actually a V0.3.0b that has quite a few updates (path changes primarily) that you may want to ensure it works with as well. Stefan has been pretty receptive to updates.
=G=
From: mario andre [mailto:rower.master at gmail.com] Sent: Thursday, July 19, 2007 3:06 PM To: hobbit at hswn.dk Subject: Re: [hobbit] RRD Dump
James,
I've modified the view graph of Stefan tools V0.2.0 adminscripts http://www.fh-augsburg.de/~henk/hobbit/ to dump RRD. You can view the two files code attached.
Put rrdtocsv.sh and adminscripts_functions2.sh in your cgi-bin directory.
Regards,
Mario.
On 7/18/07, James Wade <jkwade at futurefrontiers.com> wrote:
I'm not that familiar with RRD. I've loaded it and set it up and it works just fine.
However, some of the developers here have asked me if they could get a dump of the data instead of the graphs.
I've used the rrdtool to do a dump as well as using xml format, but the output isn't in a format the developers want.
I'm wondering if anyone's written any scripts or even an html based cgi script to output rrd data in a more readable format; perhaps associated with the data information: cpu, memory, etc...,
Thanks....James
To unsubscribe from the hobbit list, send an e-mail to hobbit-unsubscribe at hswn.dk
James,
I've modified the view graph of Stefan tools V0.2.0 adminscripts http://www.fh-augsburg.de/~henk/hobbit/ to dump RRD. Let me know if you want this files so I can send to your email.
Regards,
Mario.
On 7/18/07, James Wade <jkwade at futurefrontiers.com> wrote:
I'm not that familiar with RRD. I've loaded it and set it up and it works just fine.
However, some of the developers here have asked me if they could get a dump of the data instead of the graphs.
I've used the rrdtool to do a dump as well as using xml format, but the output isn't in a format the developers want.
I'm wondering if anyone's written any scripts or even an html based cgi script to output rrd data in a more readable format; perhaps associated with the data information: cpu, memory, etc...,
Thanks....James
To unsubscribe from the hobbit list, send an e-mail to hobbit-unsubscribe at hswn.dk
On Wed, Jul 18, 2007 at 03:20:30PM +0200, Dirk Kastens wrote:
Dirk Kastens wrote:
This functionality is in the current snapshot, but not in the 4.2.0 version.
Aah, thanks. That's what I wanted to know :-)
I can confirm that the alerting now works as expected. But the information on the info pages is wrong.
One small oversight on my part. Apply this patch on top of the snapshot.
Henrik
Henrik Stoerner wrote:
One small oversight on my part. Apply this patch on top of the snapshot.
Great! It works! Thanks for your support.
-- Regards,
Dirk Kastens Universitaet Osnabrueck, Rechenzentrum (Computer Center) Albrechtstr. 28, 49069 Osnabrueck, Germany Tel.: +49-541-969-2347, FAX: -2470
On Wednesday 18 July 2007 07:04, Henrik Stoerner wrote:
On Wed, Jul 18, 2007 at 09:51:45AM +0200, Dirk Kastens wrote:
So, what can I do to get the alert function working for BOTH pages?
This functionality is in the current snapshot, but not in the 4.2.0 version.
Snapshots are available at http://www.hswn.dk/beta/
This sounds awesome, this was something that I was really looking forward to in Hobbit, but was dispondant when I found out that hobbit-alertd only saw a server on one page, regardless of aliases. So thank you !!
Right now I am on 4.2.0 with the all-in-one patch. I am just curious whether all these patches will be included in a new all-in-one or a new release ? I am not try to push for either, just looking for a loose ETA. I am starting my initial migration, and this feature would be a life saver.
Thanks, ~Steve
participants (6)
-
Dirk.Kastens@uni-osnabrueck.de
-
Galen.Johnson@sas.com
-
henrik@hswn.dk
-
jkwade@futurefrontiers.com
-
rower.master@gmail.com
-
s_aiello@comcast.net