Hi everyone
2 days ago I updated xymon to the latest version:4.3.10 and for some reason unknown to me xymond stopped updating the hosts.cfg file.
Please see xymongen.log below:
2012-10-10 16:53:58 -> Recipient '193.61.54.144', timeout 15 2012-10-10 16:53:58 -> 1st line: 'xymondboard fields=hostname,testname,color,flags,lastchange,logtime,validtime,acktime,disabletime,sender,cookie,line1,acklist' 2012-10-10 16:53:58 xymond status-board not available, code 7 2012-10-10 16:53:58 Failed to load current Xymon status, aborting page-update
Could somebody please help me?
Many thanks Yanni
On 11 October 2012 20:08, Yanni <y.goudetsidis at mail.cryst.bbk.ac.uk> wrote:
2 days ago I updated xymon to the latest version:4.3.10 and for some reason unknown to me xymond stopped updating the hosts.cfg file.
Please see xymongen.log below:
2012-10-10 16:53:58 -> Recipient '193.61.54.144', timeout 15
What is this server? If this is your Xymon server, then what we have here is a failure to communicate. Given that there was a timeout (rather than a rejected connection) I suspect a firewall or packet filter is dropping the packets, or perhaps a routing problem. Try to telnet to this IP address from your Xymon server, and if that doesn't work, you know it's unlikely to be a Xymon problem.
Aside from that, I'm confused. Why would Xymon modify the hosts.cfg file? Isn't that supposed to be edited by a human?
J
Hi
Sorry for the confusion. I was trying to say that xymond does not load the hosts.cfg file instead of it doesn't update hosts.cfg.
Yes this IP is my xymon server. I can telnet and ssh to the server from terminal with no problems. For some strange reason the xymon main page does not get updated anymore. I have manually added a few more hosts in 'hosts.cfg' but they don't show up.
Also I have checked that
- xymon.sh is running
- SELINUX was disabled but iptables was not, so I have now turned iptables off as well but the problem persists. :(
I really don't know what is happening. If you have anymore suggestions please let me know.
Many thanks Yanni
On 10/12/2012 06:48 AM, Jeremy Laidman wrote:
On 11 October 2012 20:08, Yanni <y.goudetsidis at mail.cryst.bbk.ac.uk <mailto:y.goudetsidis at mail.cryst.bbk.ac.uk>> wrote:
2 days ago I updated xymon to the latest version:4.3.10 and for some reason unknown to me xymond stopped updating the hosts.cfg file. Please see xymongen.log below: 2012-10-10 16:53:58 -> Recipient '193.61.54.144', timeout 15What is this server? If this is your Xymon server, then what we have here is a failure to communicate. Given that there was a timeout (rather than a rejected connection) I suspect a firewall or packet filter is dropping the packets, or perhaps a routing problem. Try to telnet to this IP address from your Xymon server, and if that doesn't work, you know it's unlikely to be a Xymon problem.
Aside from that, I'm confused. Why would Xymon modify the hosts.cfg file? Isn't that supposed to be edited by a human?
J
On 12 October 2012 21:17, Yanni <y.goudetsidis at mail.cryst.bbk.ac.uk> wrote:
Yes this IP is my xymon server. I can telnet and ssh to the server from terminal with no problems.
Sorry, I meant to ask if you can telnet to it on port 1984, rather than the default telnet port. So:
telnet 193.61.54.144 1984
For some strange reason the xymon main page does not get updated anymore. I
have manually added a few more hosts in 'hosts.cfg' but they don't show up.
The Xymon page generator probably needs to query the xymond process using port 1984 (using the xymon command).
Also I have checked that
- xymon.sh is running
Is xymond running?
Let's try to reproduce the conditions that would have led to the error message. Try running this to see what it gives you:
xymoncmd xymon 193.61.54.144 'xymondboard fields=hostname,testname,**color ,flags,lastchange,**logtime,validtime,acktime,**disabletime,sender,cookie,** line1,acklist'
Cheers Jeremy
It seems that telnet works because it returns "connected to 193.61.54.144", however it looks like it hangs after the: (Escape character is '^]') line. I can type things but nothing is returned when I hit "Enter", only the cursor just goes to the next line. I only use ssh so I can't tell if my telnet works properly.
This is what I get:
telnet 193.61.54.144 1984 Trying 193.61.54.144... Connected to 193.61.54.144. Escape character is '^]'.
Yes xymond is running. It's first on my "top" list:
(23182 xymon 25 0 7992 1984 1380 R 100.0 0.1 8286:29 xymond)
The command you asked me to run returns the following:
2012-10-15 10:00:10 Using default environment file /home/xymon/server/etc/xymonserver.cfg 2012-10-15 10:00:25 Whoops ! Failed to send message (timeout) 2012-10-15 10:00:25 -> 2012-10-15 10:00:25 -> Recipient '193.61.54.144', timeout 15 2012-10-15 10:00:25 -> 1st line: 'xymondboard fields=hostname,testname,color,flags,lastchange,logtime,validtime,acktime,disabletime,sender,cookie,line1,acklist'
A few more log files.
xymonlaunch.log: 2012-10-15 09:56:06 Loading hostnames 2012-10-15 09:56:06 Loading saved state 2012-10-15 09:56:06 Setting up network listener on 0.0.0.0:1984 2012-10-15 09:56:06 Cannot bind to listen socket (Address already in use) 2012-10-15 09:56:06 Task xymond terminated, status 1
hostdata.log: 2012-09-18 15:02:19 Tried to down BOARDBUSY: Invalid argument 2012-09-18 15:02:19 Peer not up, flushing message queue 2012-09-18 15:07:29 Peer not up, flushing message queue
clientdata.log: 2012-09-18 15:02:19 Tried to down BOARDBUSY: Invalid argument 2012-09-18 15:02:40 Peer not up, flushing message queue 2012-10-09 15:46:38 Whoops ! Failed to send message (timeout) 2012-10-09 15:46:38 -> 2012-10-09 15:46:38 -> Recipient '193.61.54.144', timeout 15 2012-10-09 15:46:38 -> 1st line: 'config hosts.cfg' 2012-10-09 15:46:38 Cannot load hosts.cfg from xymond, code 7 2012-10-09 15:46:38 Failed to load from xymond, reverting to file-load
Thanks Jeremy
On 10/15/2012 01:04 AM, Jeremy Laidman wrote:
On 12 October 2012 21:17, Yanni <y.goudetsidis at mail.cryst.bbk.ac.uk <mailto:y.goudetsidis at mail.cryst.bbk.ac.uk>> wrote:
Yes this IP is my xymon server. I can telnet and ssh to the server from terminal with no problems.Sorry, I meant to ask if you can telnet to it on port 1984, rather than the default telnet port. So:
telnet 193.61.54.144 1984
For some strange reason the xymon main page does not get updated anymore. I have manually added a few more hosts in 'hosts.cfg' but they don't show up.The Xymon page generator probably needs to query the xymond process using port 1984 (using the xymon command).
Also I have checked that 1) xymon.sh is runningIs xymond running?
Let's try to reproduce the conditions that would have led to the error message. Try running this to see what it gives you:
xymoncmd xymon 193.61.54.144 'xymondboard fields=hostname,testname,color,flags,lastchange,logtime,validtime,acktime,disabletime,sender,cookie,line1,acklist'
Cheers Jeremy
On Mon, 2012-10-15 at 10:25 +0100, Yanni wrote:
2012-10-15 10:00:10 Using default environment file /home/xymon/server/etc/xymonserver.cfg 2012-10-15 10:00:25 Whoops ! Failed to send message (timeout) 2012-10-15 10:00:25 ->
2012-10-15 10:00:25 -> Recipient '193.61.54.144', timeout 15
Do you have anything like '--status-senders' set for xymond in your tasks.cfg file? If so, is this IP address listed?
xymonlaunch.log: 2012-10-15 09:56:06 Loading hostnames 2012-10-15 09:56:06 Loading saved state 2012-10-15 09:56:06 Setting up network listener on 0.0.0.0:1984
Are you using multiple Xymon servers or should this ('0.0.0.0') be whatever XYMONSERVERIP is set to in xymonserver.cfg?
2012-10-09 15:46:38 Whoops ! Failed to send message (timeout) 2012-10-09 15:46:38 ->
2012-10-09 15:46:38 -> Recipient '193.61.54.144', timeout 15 2012-10-09 15:46:38 -> 1st line: 'config hosts.cfg' 2012-10-09 15:46:38 Cannot load hosts.cfg from xymond, code 7 2012-10-09 15:46:38 Failed to load from xymond, reverting to file-load
When I had accidentally restricted access to our Xymon server, I received errors such as 'connection refused'. In your case you get a timeout, which tends to imply it is not connecting. If iptables and selinux have been stopped, then I would check the settings in the 'server/etc' files to ensure that your Xymon server is configured correctly.
John.
-- John Horne, Plymouth University, UK Tel: +44 (0)1752 587287 Fax: +44 (0)1752 587001
On 15 October 2012 20:25, Yanni <y.goudetsidis at mail.cryst.bbk.ac.uk> wrote:
It seems that telnet works because it returns "connected to 193.61.54.144", however it looks like it hangs after the: (Escape character is '^]') line.
That's to be expected. When doing a telnet test like this, you can get one of three things:
- connection refused (nothing listening on that port)
- time-out (firewall dropping packets)
- connected (works)
You got the last one, so all good.
If you have netcat, try doing this:
$ echo xymondboard | netcat 193.61.54.144 1984
This should get you a list of hosts and their status. The command might be "nc" rather than "netcat".
xymonlaunch.log: 2012-10-15 09:56:06 Setting up network listener on 0.0.0.0:1984 2012-10-15 09:56:06 Cannot bind to listen socket (Address already in use)
Perhaps you have xymond running twice? Stop xymond, and kill all remaining xymond processes before attempting to start it again. Before starting, you should see no output from this:
$ netstat -na|grep 1984.*LISTEN
Cheers Jeremy
OK, I downloaded and installed netcat and run the command: $ echo xymondboard | netcat 193.61.54.144 1984 waited for a couple of seconds but nothing was returned. I tried both 'netcat' and 'nc'.
I then killed all xymond jobs: $ killall -9 xymond
then run your netstat command: $ netstat -na|grep 1984.*LISTEN and it returned: $ tcp 0 0 0.0.0.0:1984 0.0.0.0:* LISTEN
xymond is not anymore in my 'top' list.
$ps auf | grep xymond returns: $ xymon 1465 0.0 0.0 4012 664 pts/1 S+ 14:35 0:00 \_ grep xymond
also xymon.sh is running
I am getting confused and the reason is that although I think I killed xymond now xymon seems to be working and the hosts.cfg file gets loaded. I removed and added a few hosts and the main xymon page gets updated now. However my xymond is not happy because it's posting a yellow dizzy face.
Please see a few lines from xymonlaunch.log:
2012-10-16 11:07:41 FATAL: xymond sees clientcount 1, should be 0. Check for hanging xymond_channel processes or stale semaphores 2012-10-16 11:07:41 Cannot setup page channel 2012-10-16 11:07:41 Task xymond terminated, by signal 9 2012-10-16 11:07:46 Loading hostnames 2012-10-16 11:07:47 Loading saved state 2012-10-16 11:07:47 Setting up network listener on 0.0.0.0:1984 2012-10-16 11:07:47 Setting up signal handlers 2012-10-16 11:07:47 Setting up xymond channels 2012-10-16 11:07:47 Setting up logfiles
So why everything works after I killed all xymond???
I am super-confused.
On 10/16/2012 01:39 AM, Jeremy Laidman wrote:
On 15 October 2012 20:25, Yanni <y.goudetsidis at mail.cryst.bbk.ac.uk <mailto:y.goudetsidis at mail.cryst.bbk.ac.uk>> wrote:
It seems that telnet works because it returns "connected to 193.61.54.144", however it looks like it hangs after the: (Escape character is '^]') line.That's to be expected. When doing a telnet test like this, you can get one of three things:
- connection refused (nothing listening on that port)
- time-out (firewall dropping packets)
- connected (works)
You got the last one, so all good.
If you have netcat, try doing this:
$ echo xymondboard | netcat 193.61.54.144 1984
This should get you a list of hosts and their status. The command might be "nc" rather than "netcat".
xymonlaunch.log: 2012-10-15 09:56:06 Setting up network listener on *MailScanner warning: numerical links are often malicious:* 0.0.0.0:1984 <http://0.0.0.0:1984> 2012-10-15 09:56:06 Cannot bind to listen socket (Address already in use)Perhaps you have xymond running twice? Stop xymond, and kill all remaining xymond processes before attempting to start it again. Before starting, you should see no output from this:
$ netstat -na|grep 1984.*LISTEN
Cheers Jeremy
participants (3)
-
jlaidman@rebel-it.com.au
-
john.horne@plymouth.ac.uk
-
y.goudetsidis@mail.cryst.bbk.ac.uk