Hi all,
I need some help/suggestions to figure out why my "cpu load" and "users & processes" graphs stop updating about 24 hours after the systems reboot. The updates stop for anywhere from 12 to 24 hours, then simply start back up again. Only the "CPU load" and the "Users and Processes" graphs are having the problem; disk, memory, cpu utilization, network traffic don't miss a beat.
We have a number of identically configured systems, they all reboot around 00:30 local time (they are in different time zones) Wednesday mornings. And they all stop reporting cpu-load/users-and-processes graphs sometime Thursday mornings and then start up again Thursday afternoon/Friday morning. They don't all stop/start at exactly the same time, but the majority do stop/start at the same time. I end up with about a 24 hour gap in my graphs every week.
The rrd files are all updated except for the la.rrd, procs.rrd, and users.rrd: -rw-r--r-- 1 hobbit hobbit 19552 Jan 22 16:56 clock.rrd -rw-r--r-- 1 hobbit hobbit 38536 Jan 22 16:56 disk,cvsrx.rrd -rw-r--r-- 1 hobbit hobbit 38536 Jan 22 16:56 disk,root.rrd -rw-r--r-- 1 hobbit hobbit 38536 Jan 22 16:56 ifstat.eth0.rrd -rw-r--r-- 1 hobbit hobbit 19552 Jan 22 00:38 la.rrd -rw-r--r-- 1 hobbit hobbit 19552 Jan 22 16:56 memory.actual.rrd -rw-r--r-- 1 hobbit hobbit 19552 Jan 22 16:56 memory.real.rrd -rw-r--r-- 1 hobbit hobbit 19552 Jan 22 16:56 memory.swap.rrd -rw-r--r-- 1 hobbit hobbit 57520 Jan 22 16:56 mysql.rrd -rw-r--r-- 1 hobbit hobbit 304312 Jan 22 16:56 netstat.rrd -rw-r--r-- 1 hobbit hobbit 19552 Jan 22 00:38 procs.rrd -rw-r--r-- 1 hobbit hobbit 19552 Jan 22 16:57 tcp.conn.rrd -rw-r--r-- 1 hobbit hobbit 19552 Jan 22 16:59 tcp.ssh.rrd -rw-r--r-- 1 hobbit hobbit 19552 Jan 22 00:38 users.rrd -rw-r--r-- 1 hobbit hobbit 323296 Jan 22 16:56 vmstat.rrd
I've restarted the hobbit client and the hobbit server; no help.
Any pointers/suggestions would be very welcome!
Tom
Tom Brand CVS/pharmacy