Occasional RRD error message
Sent this to the list last Thursday but it hasn't appeared ...
Since upgrading to xymon 4.3.5, I occasionally see this message in the rrd-status.log :-
2011-11-10 00:45:53 Whoops ! Failed to send message (select(2) failed) 2011-11-10 00:45:53 -> Select failure while sending to Xymon daemon at x.x.x.x:1984 2011-11-10 00:45:53 -> Recipient 'x.x.x.x', timeout 15 2011-11-10 00:45:53 -> 1st line: 'config hosts.cfg' 2011-11-10 00:45:53 Cannot load hosts.cfg from xymond, code 6 2011-11-10 00:45:53 Failed to load from xymond, reverting to file-load
What is the problem here?
Regards, Chris Morris
The information contained in this email is intended only for the use of the intended recipient at the email address to which it has been addressed. If the reader of this message is not an intended recipient, you are hereby notified that you have received this document in error and that any review, dissemination or copying of the message or associated attachments is strictly prohibited. If you have received this email in error, please contact the sender by return email or call 01793 877777 and ask for the sender and then delete it immediately from your system.Please note that neither the RWE Group of Companies nor the sender accepts any responsibility for viruses and it is your responsibility to scan attachments (if any).
Chris, I saw your post last week. Sometimes I don't see my own posts in Gmail until someone replies.
I dunno what the problem is, I'm on an older version. I'm guessing xymond is getting busy with something else and can't respond within 15 seconds. Anything in xymond log?
Nothing in any other logs to assist.
From: Jeremy Laidman [mailto:jlaidman at rebel-it.com.au]
Sent: 14 November 2011 19:24
To: Morris, Chris (SS-IS)
Cc: xymon at xymon.com
Subject: Re: [Xymon] Occasional RRD error message
Chris, I saw your post last week. Sometimes I don't see my own
posts in Gmail until someone replies.
I dunno what the problem is, I'm on an older version. I'm
guessing xymond is getting busy with something else and can't respond within 15 seconds. Anything in xymond log?
The information contained in this email is intended only for the use of the intended recipient at the email address to which it has been addressed. If the reader of this message is not an intended recipient, you are hereby notified that you have received this document in error and that any review, dissemination or copying of the message or associated attachments is strictly prohibited. If you have received this email in error, please contact the sender by return email or call 01793 877777 and ask for the sender and then delete it immediately from your system.Please note that neither the RWE Group of Companies nor the sender accepts any responsibility for viruses and it is your responsibility to scan attachments (if any).
On Tue, Nov 15, 2011 at 7:55 PM, <Chris.Morris at rwe.com> wrote:
Nothing in any other logs to assist.
First of all, I don't believe this is likely to cause faults. As I understand it, if the config file can't be retrieved from xymond, it will be fetched directly (from the filesystem) instead.
How often are you seeing the message?
What do you get if you run this: xymon 127.0.0.1 'config hosts.cfg'
You should get xymond's idea of the hosts.cfg file. I'm wondering if xymond is working hard at times, and the timeout is a tad too short in such cases. Perhaps check your CPU usage at the times you see failures (eg using sar).
You could try enabling "--debug" mode in tasks.cfg on the xymond_channel entry that runs xymond_rrd, and/or on the xymond_rrd worker module, and see if anything interesting comes out.
Cheers Jeremy
Not a problem just a question really.
It has happened 6 times since 8th October at different times during the day.
Xymon 127.0.0.1 'config hosts.cfg' produces expected results. System doesn't seem overly busy at the times this occurs.
Thanks.
-----Original Message----- From: Jeremy Laidman [mailto:jlaidman at rebel-it.com.au] Sent: 15 November 2011 23:11 To: Morris, Chris (SS-IS) Cc: xymon at xymon.com Subject: Re: [Xymon] Occasional RRD error message
On Tue, Nov 15, 2011 at 7:55 PM, <Chris.Morris at rwe.com> wrote:
Nothing in any other logs to assist.
First of all, I don't believe this is likely to cause faults. As I understand it, if the config file can't be retrieved from xymond, it will be fetched directly (from the filesystem) instead.
How often are you seeing the message?
What do you get if you run this: xymon 127.0.0.1 'config hosts.cfg'
You should get xymond's idea of the hosts.cfg file. I'm wondering if xymond is working hard at times, and the timeout is a tad too short in such cases. Perhaps check your CPU usage at the times you see failures (eg using sar).
You could try enabling "--debug" mode in tasks.cfg on the xymond_channel entry that runs xymond_rrd, and/or on the xymond_rrd worker module, and see if anything interesting comes out.
Cheers Jeremy
The information contained in this email is intended only for the use of the intended recipient at the email address to which it has been addressed. If the reader of this message is not an intended recipient, you are hereby notified that you have received this document in error and that any review, dissemination or copying of the message or associated attachments is strictly prohibited. If you have received this email in error, please contact the sender by return email or call 01793 877777 and ask for the sender and then delete it immediately from your system.Please note that neither the RWE Group of Companies nor the sender accepts any responsibility for viruses and it is your responsibility to scan attachments (if any).
participants (2)
-
Chris.Morris@rwe.com
-
jlaidman@rebel-it.com.au