Thanks Tom, that got me sorted. It seems that Solaris relies on the BOOT_TIME record held in the /var/adm/utmpx file. This file has been rotated out of the way in order to save disk space so I got no uptime values at all. It looks like this messed with the load average data since the uptime output didn't have any uptime in it.
Like you I have hacked the hobbitclient-sunos.sh file and put in a small perl scriptlet so that if there is no uptime it adds a fake value in just to ensure that the load averages get stored properly.
|\/|
From: Brand, Thomas R. [mailto:TRBrand at cvs.com] Sent: 13 October 2010 17:53 To: xymon at xymon.com Subject: RE: [xymon] CPU load average not being graphed for some servers
From: Ward, Martin [mailto:Martin.Ward at colt.net]
Hi all,
I'm having trouble tracking this one down. For many of my Xymon-monitored servers I can see the graphs on the CPU load page and they have data in them. Yet for many others the CPU graphs are empty. I can see the graphs but there is no data.
Investigation reveals that for those servers who are missing data the .cpu file has not been updated in months, yet when I look at the client data available it shows the load average in the uptime section, as in:
[uptime]
5:07pm 2 users, load average: 0.53, 0.71, 0.62
which is, I believe, where Xymon gets the LA from.
Martin,
This may be related to an issue I came across last year.
The (cpu load and users & processes) graphs appear to be dependent on the exact output of the 'uptime' command.
In my case, the graphs did show for the first 24 hours of uptime, didn't show after 24 hours, then started showing again after 48 hours of uptime.
I was finally able to isolate this to the output of 'uptime' for the first full day - it showed 'day' not 'days'.
uptime
12:41pm up 196 days 21:49, 5 users, load average: 0.23, 0.19, 0.17
uptime
12:41pm up 1 day 21:49, 2 users, load average: 0.06, 0.02, 0.00
I 'fixed' this by updating hobbitclient-linux.sh and modified the output of the 'uptime' command replacing 'day ' with 'days '.
echo "[uptime]"
uptime | perl -pe "s/^(.*) day (.*)/\1 days \2/"
It appears your uptime command does not show the 'up x days HH:MM'.
Cheers,
Tom Brand
[Colt Disclaimer] The message is intended for the named addressee only and may not be disclosed to or used by anyone else, nor may it be copied in any way. The contents of this message and its attachments are confidential and may also be subject to legal privilege. If you are not the named addressee and/or have received this message in error, please advise us by e-mailing abuse at colt.net and delete the message and any attachments without retaining any copies. Internet communications are not secure and Colt does not accept responsibility for this message, its contents nor responsibility for any viruses. No contracts can be created or varied on behalf of Colt Technology Services, its subsidiaries, group companies or affiliates ("Colt") and any other party by email communications unless expressly agreed in writing with such other party. Please note that incoming emails will be automatically scanned to eliminate potential viruses and unsolicited promotional emails. For more information refer to www.colt.net or contact us on +44(0)20 7390 3900