purple status for one server and hobbitfetch?
I have one server that keeps coming up purple and I can't get rid of it. The ping it working and it is alerting when it goes red but it just stays purple all the time.
When I look at the conn page it shows: Status message received from hobbitd rather than Status message received from 172.16.225.176
Also as a test someone setup hobbit on this machine at one time to see how hobbit worked without screwing up the current real hobbit server. Since then the machine in question had hobbit server removed and is now just a client.
I have tried to drop the conn column and also the whole server and also removed all files with the server name in hist and histlogs. It still comes back purple and is the only one.
I can't find any file in server/etc that has its name or IP.
Any thoughts on why?
Also we have clients in a DMZ we want monitored and are using hobbitfetch for that. Once I fixed the core dumping of hobbitfetch and it is staying up the status is still purple. Anyone know how to get that not purple? Is it not reporting a status to something? I was trying to figure out how status of the daemons was determined and haven't found it yet but my guess is that hobbitfetch isn't reporting something it should.
Thanks Cade
I have one server that keeps coming up purple and I can't get rid of it. The ping it working and it is alerting when it goes red but it just stays purple all the time.
When I look at the conn page it shows: Status message received from hobbitd rather than Status message received from 172.16.225.176
Can you access this machine and determine who it's server is?
Since you'd stated below that you have nothing in your server etc directory with it, I am guessing you had a two-server setup at one time, the "other" server may be sharing its data with yours. You may also want to see how you have BBDISP setup, whether its setup as one or more than one BBDISP.
We didn't have a "true" two server setup. There was the main server and then this test one.
On the main server the test server was setup as a client but had server software on it. The main server picked up that the test server had server software on it.
Since then the server software on the test server was removed and client software installed.
I checked the BBDISP and that is set right. It points to the main server.
This test should be just the ping test. I can ping the client just fine from the hobbit server and if the ping fails we do get red alerts. I found that out when a network port got changed on accident to a different vlan. So the ping is working it just isn't reporting the correct result.
On Fri, 2010-02-05 at 13:59 -0500, wiskbroom at hotmail.com wrote:
I have one server that keeps coming up purple and I can't get rid of it. The ping it working and it is alerting when it goes red but it just stays purple all the time.
When I look at the conn page it shows: Status message received from hobbitd rather than Status message received from 172.16.225.176
Can you access this machine and determine who it's server is?
Since you'd stated below that you have nothing in your server etc directory with it, I am guessing you had a two-server setup at one time, the "other" server may be sharing its data with yours. You may also want to see how you have BBDISP setup, whether its setup as one or more than one BBDISP.
To unsubscribe from the hobbit list, send an e-mail to hobbit-unsubscribe at hswn.dk
We didn't have a "true" two server setup.
There was the main server and then this test one.
This is defined in your bb-hosts like this, no?
172.16.225.176 host-i-want-to-ping.org # conn
I can tell you that I was getting similar results when I had two hosts defined as BBDISPLAYS, and BBDISP set to 0.0.0.0
Just to recap, you have a host, 172.16.225.176, that is NOT configured to send to your xymon server, but for some reason your Xymon server is reporting it, and it is purple, is this correct?
Lets start over... :) I guess using machine name examples would help.
monitor is the main hobbit server - IP 172.16.225.176 28512dbtst is the client that keeps going purple. IP 172.16.225.182
"monitor" is setup to monitor "28512dbtst". At one point someone put the hobbit server software on "28512dbtst". "monitor" figured out automatically that server software was running on "28512dbtst". "28512dbtst" and "monitor" were never setup in a dual server way. They were separate except that "monitor" was monitoring "28512dbtst".
We got the server software removed from "28512dbtst" and the client software installed. We dropped "28512dbtst" (bb 127.0.0.1 "drop 28512dbtst") on "monitor" and let "monitor" rediscover "28512dbtst"
Now out of 30+ servers being monitored by "monitor", "28512dbtst" is the only one that shows up purple. Even though "28512dbtst" has the same client software as everything else and is setup to push the data to "monitor"
I also just tried to drop "28512dbtst" and stop the client software on it and let "monitor" just ping it. Still purple, but conn is ok:
Fri Feb 5 15:20:44 2010 conn ok
Service conn on 28512dbtst is OK (up)
green 172.16.225.182 is alive (4.31 ms)
Status unchanged in 0 hours, 4 minutes Status message received from hobbitd
On Fri, 2010-02-05 at 14:46 -0500, wiskbroom at hotmail.com wrote:
We didn't have a "true" two server setup.
There was the main server and then this test one.
This is defined in your bb-hosts like this, no?
172.16.225.176 host-i-want-to-ping.org # conn
I can tell you that I was getting similar results when I had two hosts defined as BBDISPLAYS, and BBDISP set to 0.0.0.0
Just to recap, you have a host, 172.16.225.176, that is NOT configured to send to your xymon server, but for some reason your Xymon server is reporting it, and it is purple, is this correct?
To unsubscribe from the hobbit list, send an e-mail to hobbit-unsubscribe at hswn.dk
participants (2)
-
cade.robinson@gmail.com
-
wiskbroom@hotmail.com