Hi Jeremy thank you for replying. Looks like my company is blocking some of the emails coming from xymon so I did not see your reply.
Which logfile did you see this in? - xymongen.log
How often do you see these? - This happened twice on 7/25 and 7/29 never saw this before.
How long ago did this start? - Started 5 ago but only saw it twice.
I did not see anything really strange in any other log. I am not sure why this started happening.
I did see huge spike approx. 5x more than usual on xymongen and xymonnet graphs both spikes occurred at the same time on Wednesday which correlated with the 7/29 xymonboard error in the log. The spike quickly went back to normal.
This is a bit wired since I never saw this type of issue before.
Daniel
On the face of it, it looks like a process (I'm guessing xymongen) is having a problem connecting to the xymond daemon, which would cause it to fail to construct a display page. The problem is most likely that xymond is unable to accept a connection, or respond quickly enough, hence the 15 second timeout. The red/white status you are seeing is possibly other messages being dropped by the xymond daemon.
I would take a look at the xymond status page of the Xymon servers and see if there are any useful messages or counters there. Take a look at other logfiles (eg xymongen.log, xymonnet.log) to see if there are similar messages.
Cheers Jeremy
From: LOZOVSKY, DANIEL Sent: Wednesday, July 29, 2020 11:06 AM To: 'xymon at xymon.com' <xymon at xymon.com> Subject: xymondboard error
I recently started seeing these error messages in my log file periodically. When these messages appeared, I noticed that some of the tests turned red and white status but not all. Could you point me in the right direction of how I can trouble shoot this error messages?
2020-07-29 12:30:13.241732 Whoops ! Failed to send message (timeout) 2020-07-29 12:30:13.456322 -> 2020-07-29 12:30:13.456362 -> Recipient x.x.x.x, timeout 15 2020-07-29 12:30:13.456375 -> 1st line: 'xymondboard fields=hostname,testname,color,flags,lastchange,logtime,validtime,acktime,disabletime,sender,cookie,line1,acklist ' 2020-07-29 12:30:13.456394 xymond status-board not available, code 7 2020-07-29 12:30:13.456409 Failed to load current Xymon status, aborting page-update