I am testing Xymon 4.3.7 and I am trying to monitor about 3000 host pcs. If I get over 2000 hosts, then xymonnet continually crashes. OS is Opensuse 12.1.
I am simply trying to monitor the systems for connectivity right now, using fping.
Is there a limit on the number of hosts that Xymon can monitor?
Are there any other options that I can look at to see what is causing the error?
Does xymonnet give a core dump, or any log messages? Does xymonnet crash when starting to probe, or part-way through?
On Sun, Jul 8, 2012 at 3:04 PM, Scott J Post Sr <sjpostsr at gmail.com> wrote:
I am testing Xymon 4.3.7 and I am trying to monitor about 3000 host pcs. If I get over 2000 hosts, then xymonnet continually crashes. OS is Opensuse 12.1.****
I am simply trying to monitor the systems for connectivity right now, using fping.****
Is there a limit on the number of hosts that Xymon can monitor?****
Are there any other options that I can look at to see what is causing the error?
Xymon mailing list Xymon at xymon.com http://lists.xymon.com/mailman/listinfo/xymon
Sorry for the delay in responding to this:
There does not appear to be any core file or log messagess that indicated why it crashes,
However, most of the hosts that I am pinging are DHCP and I am seeing a lot of issues where is it unable to resolve hostname.
Statistics: Hosts total : 2552 Hosts with no tests : 0 Total test count : 2554 Status messages : 2555 Alert status msgs : 0 Transmissions : 28
DNS statistics:
hostnames resolved : 2553
succesful : 3902
failed : 1202
calls to dnsresolve : 2554
TCP test statistics:
TCP tests total : 2
HTTP tests : 1
Simple TCP tests : 1
Connection attempts : 2
bytes written : 128
bytes read : 320
...... <list of hosts.. cannot resolve hostnames> WARNING: Runtime 621 longer than time limit (300)
TIME SPENT Event Start time Duration xymonnet startup 574671.360650 - Service definitions loaded 574671.436934 0.076284 Tests loaded, hostname lookups done 575212.690797 541.253862 Test engine setup completed 575212.757481 0.066683 TCP tests completed 575213.058966 0.301485 PING test completed (2552 hosts) 575282.431094 69.372128 PING test results sent 575291.815708 9.384613 Test result collection completed 575291.859808 0.044100 LDAP test engine setup completed 575291.859810 0.000001 LDAP tests executed 575291.859814 0.000003 LDAP tests result collection completed 575291.859816 0.000001 Test results transmitted 575292.261780 0.401964 xymonnet completed 575292.522109 0.260329 TIME TOTAL 621.161459
I make backups of the hosts.cfg file before adding hosts, so when it crashes, I just restore the last working hosts.cfg file. I am trying to add hosts in smaller increments to determine just at what point the the program crashes again.
On Sun, Jul 8, 2012 at 12:04 AM, Scott J Post Sr <sjpostsr at gmail.com> wrote:
I am testing Xymon 4.3.7 and I am trying to monitor about 3000 host pcs. If I get over 2000 hosts, then xymonnet continually crashes. OS is Opensuse 12.1.****
I am simply trying to monitor the systems for connectivity right now, using fping.****
Is there a limit on the number of hosts that Xymon can monitor?****
Are there any other options that I can look at to see what is causing the error?
On 13-07-2012 17:43, Scott Post wrote:
There does not appear to be any core file or log messagess that indicated why it crashes,
They should appear in the ~xymon/data/tmp/ directory
However, most of the hosts that I am pinging are DHCP and I am seeing a lot of issues where is it unable to resolve hostname. ...... <list of hosts.. cannot resolve hostnames> WARNING: Runtime 621 longer than time limit (300)
Definitely some issue with DNS lookups. I would be interested to hear if the soon-to-be-released 4.3.8 version works better for you ? You can download it from http://xymon.svn.sourceforge.net/viewvc/xymon/branches/4.3.8/?view=tar
Is there a limit on the number of hosts that Xymon can monitor?____
No.
Are there any other options that I can look at to see what is causing the error?
Anything in the xymonnet logfile ?
Regards, Henrik
participants (3)
-
henrik@hswn.dk
-
jlaidman@rebel-it.com.au
-
sjpostsr@gmail.com