Yes, I correctly configured hobbit-clients.cfg on the Xymon server (Linux host), 'cuz it works fine on the cluster's head-node, but not on any of the compute nodes. And it's not a "cluster" thing, 'cuz cpu, disk, memory, and messages all work fine on the compute nodes.
"Ports" says "No port checks defined", and "Procs" says "No process checks defined"; but, here are the entries for the first compute node:
HOST=node13.cluster.private PROC hobbitlaunch PROC sge_execd PORT "LOCAL=%([.:]22)$" state=LISTEN color=yellow TRACK=sshd "TEXT=SSHD Server"
However, clicking on the blinking white "Ports" LED displays the following:
Thu Jan 14 09:38:57 AKST 2010 - Ports NOT ok
No port checks defined
tcp4 0 0 192.168.2.113.55604 192.168.2.100.2049 ESTABLISHED tcp4 0 0 192.168.2.113.55602 199.165.76.54.2049 ESTABLISHED tcp4 0 0 192.168.2.113.50066 199.165.76.54.6444 ESTABLISHED tcp4 0 0 *.8649 *.* LISTEN tcp4 0 0 *.6445 *.* LISTEN tcp4 0 0 *.311 *.* LISTEN tcp46 0 0 *.5900 *.* LISTEN tcp4 0 0 *.88 *.* LISTEN tcp6 0 0 *.88 *.* LISTEN tcp4 0 0 *.22 *.* LISTEN tcp6 0 0 *.22 *.* LISTEN tcp4 0 0 *.625 *.* LISTEN tcp4 0 0 127.0.0.1.631 *.* LISTEN tcp6 0 0 ::1.631 *.* LISTEN
So we see that port 22 is being LISTENed on.
And, clicking on the blinking white "Ports" LED displays the following:
Thu Jan 14 09:43:57 AKST 2010 - Processes NOT ok
[image: clear] No process checks defined
PID PPID USER STARTED STAT PRI %CPU TIME %MEM RSS VSZ COMMAND 1 0 root Mon08AM Ss 31 0.0 0:46.13 0.0 588 76512 /sbin/launchd 25 1 root Mon08AM Ss 31 0.0 0:01.31 0.0 1292 75944 /usr/libexec/kextd 26 1 root Mon08AM Ss 31 1.8 32:04.05 0.1 5036 80040 /usr/sbin/DirectoryService 27 1 root Mon08AM Ss 31 0.0 0:04.26 0.0 484 75920 /usr/sbin/notifyd 28 1 root Mon08AM Ss 31 0.0 0:48.60 0.0 484 77012 /usr/sbin/syslogd 29 1 root Mon08AM Ss 31 0.0 6:40.76 0.0 1796 77508 /usr/sbin/configd 30 1 daemon Mon08AM Ss 31 0.0 0:03.21 0.0 656 75324 /usr/sbin/distnoted 31 1 _mdnsresponder Mon08AM Ss 31 0.0 0:01.01 0.0 1296 77360 /usr/sbin/mDNSResponder -launchd 35 1 root Mon08AM Ss 31 0.0 0:00.99 0.0 1704 77088 /usr/sbin/securityd -i 39 1 root Mon08AM Ss 31 0.0 0:04.95 0.0 752 76484 master 40 1 root Mon08AM Ss 31 0.0 0:15.51 0.0 856 75888 /usr/sbin/ntpd -c /private/etc/ntp-restrict.conf -n -g -p /var/run/ntpd.pid -f /var/db/ntp.drift 41 1 _amavisd Mon08AM Ss 31 0.0 0:08.86 0.4 37516 114284 clamd 42 1 root Mon08AM Ss 31 0.0 0:00.01 0.0 320 75320 getty serial.57600 tty.serial 43 1 root Mon08AM Ss 63 0.0 0:01.99 0.0 652 75576 watchdogtimerd 44 1 213 Mon08AM Ss 31 0.0 0:00.04 0.0 1028 77308 /System/Library/PrivateFrameworks/MobileDevice.framework/Versions/A/Resources/usbmuxd -launchd 45 1 root Mon08AM Ss 31 0.0 0:46.29 0.0 292 75300 /usr/sbin/update 46 1 root Mon08AM Ss 31 0.0 0:00.02 0.0 684 75344 /sbin/SystemStarter 49 1 root Mon08AM Ss 31 0.0 4:54.83 0.1 8892 99552 servermgrd -x 51 1 root Mon08AM Ss 31 0.0 0:00.02 0.0 1064 76368 /System/Library/CoreServices/RemoteManagement/AppleVNCServer.bundle/Contents/Support/RFBRegisterMDNS 52 1 root Mon08AM Ss 50 0.0 0:35.73 0.1 5668 119164 /System/Library/Frameworks/CoreServices.framework/Frameworks/Metadata.framework/Support/mds 53 1 root Mon08AM Ss 48 0.0 0:03.22 0.0 3844 99900 /System/Library/CoreServices/loginwindow.app/Contents/MacOS/loginwindow console 54 1 root Mon08AM Ss 31 0.0 0:00.04 0.0 652 75420 /usr/sbin/KernelEventAgent 56 1 root Mon08AM Ss 31 0.0 18:55.08 0.0 1840 75932 hwmond 57 1 root Mon08AM Ss 31 0.0 0:00.67 0.0 600 75864 /usr/libexec/hidd 59 1 root Mon08AM Ss 50 0.0 0:06.90 0.0 1176 80024 /System/Library/Frameworks/CoreServices.framework/Versions/A/Frameworks/CarbonCore.framework/Versions/A/Support/fseventsd 61 1 root Mon08AM Ss 31 0.0 1:58.65 0.0 1816 85404 /sbin/emond 62 1 root Mon08AM Ss 63 0.0 0:00.01 0.0 700 75348 /sbin/dynamic_pager -F /private/var/vm/swapfile 65 1 root Mon08AM Ss 31 0.0 0:00.24 0.0 940 75432 /usr/sbin/diskarbitrationd 69 1 root Mon08AM Ss 31 0.0 0:00.02 0.0 676 75360 autofsd 79 1 root Mon08AM Ss 31 0.0 0:14.39 0.0 996 75468 /usr/sbin/kdcmond -n -a 82 1 root Mon08AM Ss 31 0.0 0:00.79 0.0 2108 78684 /System/Library/CoreServices/coreservicesd 84 1 _windowserver Mon08AM Ss 63 0.0 3:01.21 0.2 16928 114260 /System/Library/Frameworks/ApplicationServices.framework/Frameworks/CoreGraphics.framework/Resources/WindowServer -daemon 87 79 root Mon08AM S 31 0.0 0:00.04 0.0 1208 75772 /usr/sbin/krb5kdc -n -r LKDC:SHA1.A84C8C2567E3A97141E7B5B705DB050E8D8F8D0E 90 39 _postfix Mon08AM S 31 0.0 0:00.60 0.0 836 75524 qmgr -l -t fifo -u 102 1 _atsserver Mon08AM Ss 31 0.0 0:00.25 0.0 1744 112100 /System/Library/Frameworks/ApplicationServices.framework/Frameworks/ATS.framework/Support/ATSServer 108 1 nobody Mon08AM Ss 97 0.0 0:00.07 0.0 1872 86948 /System/Library/CoreServices/RemoteManagement/ARDAgent.app/Contents/MacOS/ARDAgent 109 1 root Mon08AM Ss 31 0.0 0:00.02 0.0 872 75336 /usr/sbin/UserEventAgent -l LoginWindow 111 53 root Mon08AM Ss 31 0.0 0:05.27 0.0 3912 99100 /System/Library/CoreServices/ManagedClient.app/Contents/MacOS/ManagedClient -s 112 108 nobody Mon08AM S 31 0.0 0:00.05 0.0 1716 85020 /System/Library/CoreServices/RemoteManagement/AppleVNCServer.bundle/Contents/MacOS/AppleVNCServer 113 35 root Mon08AM S 31 0.0 0:00.11 0.0 1536 86656 /System/Library/CoreServices/SecurityAgent.app/Contents/Resources/authorizationhost 115 35 _securityagent Mon08AM S 47 0.7 50:36.10 0.1 11336 154704 /System/Library/CoreServices/SecurityAgent.app/Contents/MacOS/SecurityAgent 220 1 3000 Mon08AM S 31 0.0 0:34.61 0.0 912 76648 /usr/local/sge/bin/darwin/sge_execd 231 1 nobody Mon08AM Ss 31 0.0 0:40.63 0.0 2684 76092 /usr/sbin/gmond 346 1 root Mon09AM Ss 31 0.0 0:16.81 0.0 840 76676 /usr/sbin/serialnumberd 8374 1 _update_sharing 1:37PM Ss 31 0.0 0:00.01 0.0 300 67120 /System/Library/Frameworks/JavaVM.framework/Versions/A/Resources/bin/updateSharingD 25291 39 _postfix 8:23AM S 31 0.0 0:00.02 0.0 760 75468 pickup -l -t fifo -u -o content_filter 26058 1 root 9:08AM Ss 31 0.0 0:00.03 0.0 276 76400 /usr/local/xymon/client/bin/hobbitlaunch --config=/usr/local/xymon/client/etc/clientlaunch.cfg --log=/usr/local/xymon/client/logs/clientlaunch.log --pidfile=/usr/local/xymon/client/logs/clientlaunch.node13.cluster.private.pid 26558 1 root 9:43AM Ss 31 0.0 0:00.01 0.0 788 75440 /usr/libexec/samba/synchronize-preferences --linger 26563 26058 root 9:43AM S 31 3.0 0:00.01 0.0 728 75944 /bin/sh /usr/local/xymon/client/bin/hobbitclient.sh 26567 26563 root 9:43AM R 30 5.3 0:00.02 0.0 684 75944 /bin/sh /usr/local/xymon/client/bin/hobbitclient-darwin.sh 26583 26567 root 9:43AM R 31 0.0 0:00.00 0.0 360 75352 ps -ax -ww -o pid
So we see that both the hobbitlaunch and sge_execd processes are present.
Thoughts? Suggestions?
-- JONATHAN B. HOREN Systems Administrator UAF Life Science Informatics Center for Research Services jbhoren at alaska.edu http://biotech.inbre.alaska.edu