We're monitoring 1420 IPs in hobbit, and it takes fping ~40 seconds to go through them all:
<snip> [root at hobbit fping]# fping -i5 -b12 -f ips -r1 -t250 -B2 -q -s 1430 targets 1419 alive 11 unreachable 0 unknown addresses
55 timeouts (waiting for response)
1474 ICMP Echos sent
1420 ICMP Echo Replies received
0 other ICMP received
0.05 ms (min round trip time) 5.83 ms (avg round trip time) 281 ms (max round trip time) 40.704 sec (elapsed real time) </snip>
Now, this seems a bit lengthy to me. I mean, if the avg round trip time is 5.83 ms, and there are 1430 hosts, should the total time in transit for all hosts should be 8336ms, or 8 seconds... right? Even when I remove the hosts that aren't responding, the results on are par with those above.
Our polling interval is once every 60 seconds (which we want to maintain, because we like to know ASAP when something drops even one ping), so it's not a problem yet. We add hosts on a daily basis, however, so it will be a problem some time in the future and I'd like to fix it before it becomes a problem.
This machine is a dual 3GHz Xeon /w 6GB of memory, running Fedora Core 5. I've flipped every bit in the kernel parameters that I dare via sysctl, with little to no effect on the poll time. Does anybody out there have any recommendations on a way to speed this up?
Regards, -Eric Schwimmer Network Engineer UVA HSCS Network Engineering
On Wed, Apr 26, 2006 at 03:24:15PM -0400, Schwimmer, Eric E *HS wrote:
We're monitoring 1420 IPs in hobbit, and it takes fping ~40 seconds to go through them all:
Is that a number you get from the "bbtest" status or from running fping by hand?
Are you doing other network tests in Hobbit than just ping? Hobbit does the ping tests in parallel with the other tests.
<snip> [root at hobbit fping]# fping -i5 -b12 -f ips -r1 -t250 -B2 -q -s
Are you using those parameters also on the FPING command in hobbitserver.cfg? Or is it just for your testing ?
Now, this seems a bit lengthy to me. I mean, if the avg round trip time is 5.83 ms, and there are 1430 hosts, should the total time in transit for all hosts should be 8336ms, or 8 seconds... right?
No, it should be less - because fping pings several hosts in parallel.
You have "-i5" which causes a 5 ms delay between each ping. So that's (5/1000)*1430 = 7.15 seconds where it does nothing. The default setting is "-i25" - i.e. 5 times higher - which would actually match your ~40 seconds nicely.
Don't forget that there is probably also some time spent doing ARP lookups for all of these IP's. Unless you have "testip" on all of the entries in bb-hosts (or run bbtest-net with "--dns=ip"), you'll also spend some time on DNS lookups (hint: use a local caching DNS server on the Hobbit server).
Even when I remove the hosts that aren't responding, the results on are par with those above.
Our polling interval is once every 60 seconds (which we want to maintain, because we like to know ASAP when something drops even one ping), so it's not a problem yet. We add hosts on a daily basis, however, so it will be a problem some time in the future and I'd like to fix it before it becomes a problem.
Well, the good news is that it probably won't become a problem. Because fping pings multiple hosts in parallel, the runtime doesn't change very much when you add more hosts.
If it does become an issue, spread the load. Setup an extra server to do half the network tests, and configure your bb-hosts file with "NET:net-a" and "NET:net-b" tags on the hosts. Then you set BBLOCATION="net-a" on one box, and "BBLOCATION=net-b" on the other. Then they'll only test those hosts where the NET:... setting matches. Unless it's an OS limitation, you could probably do that on a single box and just have two instances of the [bbnet] task in hobbitlaunch.cfg - instead of running bbtest-net directly, they would run a shell-script which sets the BBLOCATION environment just before running bbtest-net.
Regards, Henrik
Schwimmer, Eric E *HS a écrit :
We're monitoring 1420 IPs in hobbit, and it takes fping ~40 seconds to go through them all:
<snip> [root at hobbit fping]# fping -i5 -b12 -f ips -r1 -t250 -B2 -q -s 1430 targets 1419 alive 11 unreachable 0 unknown addresses
55 timeouts (waiting for response) 1474 ICMP Echos sent 1420 ICMP Echo Replies received 0 other ICMP received0.05 ms (min round trip time) 5.83 ms (avg round trip time) 281 ms (max round trip time) 40.704 sec (elapsed real time) </snip>
Hi Eric
this won't help you much, but I'm monitoring 1733 hosts with Hobbit, on a dual Xeon 3.2 GHz with 4 Gb running an up-to-date Gentoo Linux. Hobbit takes between 15 and 30 seconds to ping 1632 hosts; sudo is used to run fping :
TIME SPENT Event Starttime Duration PING test completed (1632 hosts) 1146122348.804056 19.808170
Running fping by hand gives this :
fping -i5 -b12 -f /tmp/ips.txt -r1 -t250 -B2 -q -s
[...] 30.999 sec (elapsed real time)
Lowering the -i, -r, -t values doesn't give anything...
The funny thing is that Hobbit runs sudo with -Ae, which is way slower when I run it by hand...
--
Frédéric Mangeant
Steria EDC Sophia-Antipolis
Hi Enrick,
i have installed the last snapshot from scratch on linux ES release 3, this is my hobbitlaunch.log:
2006-04-27 12:13:41 hobbitlaunch starting 2006-04-27 12:13:41 Loading tasklist configuration from /opt/hobbit-secure/server/etc/hobbitlaunch.cfg 2006-04-27 12:13:41 Loading hostnames 2006-04-27 12:13:41 Loading saved state 2006-04-27 12:13:41 Setting up network listener on 127.0.0.2:1984 2006-04-27 12:13:41 Setting up signal handlers 2006-04-27 12:13:41 Setting up hobbitd channels 2006-04-27 12:13:41 Setting up logfiles 2006-04-27 12:16:04 Task hobbitd terminated by signal 6
Marco
On Thu, Apr 27, 2006 at 12:27:21PM +0200, Marco Avvisano wrote:
i have installed the last snapshot from scratch on linux ES release 3, this is my hobbitlaunch.log: 2006-04-27 12:16:04 Task hobbitd terminated by signal 6
I was working on some new code last night but didn't get to finish it. So even though it compiles, I haven't actually seen it run at all :-)
Not sure if I'll have time to look at it tonight, but it should definitely get fixed over the week-end.
Henrik
I am running todays snapshot.
Lars
----- Original Message ----- From: "Henrik Stoerner" <henrik at hswn.dk> To: <hobbit at hswn.dk> Sent: Thursday, April 27, 2006 1:22 PM Subject: Re: [hobbit] last snapshot - Task hobbitd terminated
On Thu, Apr 27, 2006 at 12:27:21PM +0200, Marco Avvisano wrote:
i have installed the last snapshot from scratch on linux ES release 3, this is my hobbitlaunch.log: 2006-04-27 12:16:04 Task hobbitd terminated by signal 6
I was working on some new code last night but didn't get to finish it. So even though it compiles, I haven't actually seen it run at all :-)
Not sure if I'll have time to look at it tonight, but it should definitely get fixed over the week-end.
Henrik
To unsubscribe from the hobbit list, send an e-mail to hobbit-unsubscribe at hswn.dk
Hi Enrick,
the strange thing is that is make update, not install from scratch, it work.. in effect i have a particular configuration.... on my linux box i run two hobbit server, these is my configuration: Server 1
BBSERVERIP="127.0.0.1"
General settings
BBPORT="1984" # Portnumber where hobbitd/bbd listens BBDISP="0.0.0.0" # IP of a single hobbit/bbd server BBDISPLAYS="127.0.0.1 127.0.0.2"
[bbproxy] ENABLED ENVFILE /opt/hobbit/server/etc/hobbitserver.cfg CMD $BBHOME/bin/bbproxy --hobbitd --bbdisplay=127.0.0.1,127.0.0.2 --listen=PUBLIC_IP:1984 --report=$MACHINE.bbproxy --no-daemon --pidfile=$BBSERVERLOGS/bbproxy.pid LOGFILE $BBSERVERLOGS/bbproxy.log
Server 2:
BBSERVERIP="127.0.0.2" BBPORT="1984" # Portnumber where hobbitd/bbd listens BBDISP="$BBSERVERIP" # IP of a single hobbit/bbd server BBDISPLAYS="" #
bbproxy disabled
There is something in this configuration that is wrong?
Marco
----- Original Message ----- From: "Henrik Stoerner" <henrik at hswn.dk> To: <hobbit at hswn.dk> Sent: Thursday, April 27, 2006 1:22 PM Subject: Re: [hobbit] last snapshot - Task hobbitd terminated
On Thu, Apr 27, 2006 at 12:27:21PM +0200, Marco Avvisano wrote:
i have installed the last snapshot from scratch on linux ES release 3, this is my hobbitlaunch.log: 2006-04-27 12:16:04 Task hobbitd terminated by signal 6
I was working on some new code last night but didn't get to finish it. So even though it compiles, I haven't actually seen it run at all :-)
Not sure if I'll have time to look at it tonight, but it should definitely get fixed over the week-end.
Henrik
To unsubscribe from the hobbit list, send an e-mail to hobbit-unsubscribe at hswn.dk
On Fri, Apr 28, 2006 at 01:06:28PM +0200, Marco Avvisano wrote:
the strange thing is that is make update, not install from scratch, it work..
It's a bug - a variable wasn't initialized properly. I believe it's fixed in the next snapshot.
Henrik
participants (5)
-
EES2Y@hscmail.mcc.virginia.edu
-
frederic.mangeant@steria.com
-
henrik@hswn.dk
-
lars.ebeling@leopg9.no-ip.org
-
marco.avvisano@regione.toscana.it