Well, I think these tests are coded on the server, and then fetched by the client and implemented at the client level. Under perfect conditions it can take 10 or 15 minutes for the client to retrieve and implement any changes. However, if you have coded everything correctly (like there is no mismatch on your host name, etc.) then there may be a problem with the client fetching mechanism. Since none of your stuff is working, it is hard to say whether the problem is in your configuration file (remember that Xymon silently ignores errors) or in the client itself.
GLH
On Thu, Jan 14, 2010 at 12:44 PM, Jonathan B. Horen <jbhoren at alaska.edu>wrote:
Yes, I correctly configured hobbit-clients.cfg on the Xymon server (Linux host), 'cuz it works fine on the cluster's head-node, but not on any of the compute nodes. And it's not a "cluster" thing, 'cuz cpu, disk, memory, and messages all work fine on the compute nodes.
"Ports" says "No port checks defined", and "Procs" says "No process checks defined"; but, here are the entries for the first compute node:
HOST=node13.cluster.private PROC hobbitlaunch PROC sge_execd PORT "LOCAL=%([.:]22)$" state=LISTEN color=yellow TRACK=sshd "TEXT=SSHD Server"
However, clicking on the blinking white "Ports" LED displays the following:
Thu Jan 14 09:38:57 AKST 2010 - Ports NOT ok
No port checks defined
tcp4 0 0 192.168.2.113.55604 192.168.2.100.2049 ESTABLISHED tcp4 0 0 192.168.2.113.55602 199.165.76.54.2049 ESTABLISHED tcp4 0 0 192.168.2.113.50066 199.165.76.54.6444 ESTABLISHED
tcp4 0 0 *.8649 *.* LISTEN tcp4 0 0 *.6445 *.* LISTEN tcp4 0 0 *.311 *.* LISTEN
tcp46 0 0 *.5900 *.* LISTEN tcp4 0 0 *.88 *.* LISTEN tcp6 0 0 *.88 *.* LISTEN tcp4 0 0 *.22 *.* LISTEN tcp6 0 0 *.22 *.* LISTEN
tcp4 0 0 *.625 *.* LISTEN tcp4 0 0 127.0.0.1.631 *.* LISTEN tcp6 0 0 ::1.631 *.* LISTEN
So we see that port 22 is being LISTENed on.
And, clicking on the blinking white "Ports" LED displays the following:
Thu Jan 14 09:43:57 AKST 2010 - Processes NOT ok
[image: clear] No process checks defined
PID PPID USER STARTED STAT PRI %CPU TIME %MEM RSS VSZ COMMAND
1 0 root Mon08AM Ss 31 0.0 0:46.13 0.0 588 76512 /sbin/launchd25 1 root Mon08AM Ss 31 0.0 0:01.31 0.0 1292 75944 /usr/libexec/kextd 26 1 root Mon08AM Ss 31 1.8 32:04.05 0.1 5036 80040 /usr/sbin/DirectoryService
27 1 root Mon08AM Ss 31 0.0 0:04.26 0.0 484 75920 /usr/sbin/notifyd 28 1 root Mon08AM Ss 31 0.0 0:48.60 0.0 484 77012 /usr/sbin/syslogd 29 1 root Mon08AM Ss 31 0.0 6:40.76 0.0 1796 77508 /usr/sbin/configd
30 1 daemon Mon08AM Ss 31 0.0 0:03.21 0.0 656 75324 /usr/sbin/distnoted 31 1 _mdnsresponder Mon08AM Ss 31 0.0 0:01.01 0.0 1296 77360 /usr/sbin/mDNSResponder -launchd 35 1 root Mon08AM Ss 31 0.0 0:00.99 0.0 1704 77088 /usr/sbin/securityd -i
39 1 root Mon08AM Ss 31 0.0 0:04.95 0.0 752 76484 master 40 1 root Mon08AM Ss 31 0.0 0:15.51 0.0 856 75888 /usr/sbin/ntpd -c /private/etc/ntp-restrict.conf -n -g -p /var/run/ntpd.pid -f /var/db/ntp.drift
41 1 _amavisd Mon08AM Ss 31 0.0 0:08.86 0.4 37516 114284 clamd 42 1 root Mon08AM Ss 31 0.0 0:00.01 0.0 320 75320 getty serial.57600 tty.serial 43 1 root Mon08AM Ss 63 0.0 0:01.99 0.0 652 75576 watchdogtimerd
44 1 213 Mon08AM Ss 31 0.0 0:00.04 0.0 1028 77308 /System/Library/PrivateFrameworks/MobileDevice.framework/Versions/A/Resources/usbmuxd -launchd 45 1 root Mon08AM Ss 31 0.0 0:46.29 0.0 292 75300 /usr/sbin/update
46 1 root Mon08AM Ss 31 0.0 0:00.02 0.0 684 75344 /sbin/SystemStarter 49 1 root Mon08AM Ss 31 0.0 4:54.83 0.1 8892 99552 servermgrd -x 51 1 root Mon08AM Ss 31 0.0 0:00.02 0.0 1064 76368 /System/Library/CoreServices/RemoteManagement/AppleVNCServer.bundle/Contents/Support/RFBRegisterMDNS
52 1 root Mon08AM Ss 50 0.0 0:35.73 0.1 5668 119164 /System/Library/Frameworks/CoreServices.framework/Frameworks/Metadata.framework/Support/mds 53 1 root Mon08AM Ss 48 0.0 0:03.22 0.0 3844 99900 /System/Library/CoreServices/loginwindow.app/Contents/MacOS/loginwindow console
54 1 root Mon08AM Ss 31 0.0 0:00.04 0.0 652 75420 /usr/sbin/KernelEventAgent 56 1 root Mon08AM Ss 31 0.0 18:55.08 0.0 1840 75932 hwmond 57 1 root Mon08AM Ss 31 0.0 0:00.67 0.0 600 75864 /usr/libexec/hidd
59 1 root Mon08AM Ss 50 0.0 0:06.90 0.0 1176 80024 /System/Library/Frameworks/CoreServices.framework/Versions/A/Frameworks/CarbonCore.framework/Versions/A/Support/fseventsd 61 1 root Mon08AM Ss 31 0.0 1:58.65 0.0 1816 85404 /sbin/emond
62 1 root Mon08AM Ss 63 0.0 0:00.01 0.0 700 75348 /sbin/dynamic_pager -F /private/var/vm/swapfile 65 1 root Mon08AM Ss 31 0.0 0:00.24 0.0 940 75432 /usr/sbin/diskarbitrationd
69 1 root Mon08AM Ss 31 0.0 0:00.02 0.0 676 75360 autofsd 79 1 root Mon08AM Ss 31 0.0 0:14.39 0.0 996 75468 /usr/sbin/kdcmond -n -a 82 1 root Mon08AM Ss 31 0.0 0:00.79 0.0 2108 78684 /System/Library/CoreServices/coreservicesd
84 1 _windowserver Mon08AM Ss 63 0.0 3:01.21 0.2 16928 114260 /System/Library/Frameworks/ApplicationServices.framework/Frameworks/CoreGraphics.framework/Resources/WindowServer -daemon 87 79 root Mon08AM S 31 0.0 0:00.04 0.0 1208 75772 /usr/sbin/krb5kdc -n -r LKDC:SHA1.A84C8C2567E3A97141E7B5B705DB050E8D8F8D0E
90 39 _postfix Mon08AM S 31 0.0 0:00.60 0.0 836 75524 qmgr -l -t fifo -u 102 1 _atsserver Mon08AM Ss 31 0.0 0:00.25 0.0 1744 112100 /System/Library/Frameworks/ApplicationServices.framework/Frameworks/ATS.framework/Support/ATSServer
108 1 nobody Mon08AM Ss 97 0.0 0:00.07 0.0 1872 86948 /System/Library/CoreServices/RemoteManagement/ARDAgent.app/Contents/MacOS/ARDAgent 109 1 root Mon08AM Ss 31 0.0 0:00.02 0.0 872 75336 /usr/sbin/UserEventAgent -l LoginWindow
111 53 root Mon08AM Ss 31 0.0 0:05.27 0.0 3912 99100 /System/Library/CoreServices/ManagedClient.app/Contents/MacOS/ManagedClient -s 112 108 nobody Mon08AM S 31 0.0 0:00.05 0.0 1716 85020 /System/Library/CoreServices/RemoteManagement/AppleVNCServer.bundle/Contents/MacOS/AppleVNCServer
113 35 root Mon08AM S 31 0.0 0:00.11 0.0 1536 86656 /System/Library/CoreServices/SecurityAgent.app/Contents/Resources/authorizationhost 115 35 _securityagent Mon08AM S 47 0.7 50:36.10 0.1 11336 154704 /System/Library/CoreServices/SecurityAgent.app/Contents/MacOS/SecurityAgent 220 1 3000 Mon08AM S 31 0.0 0:34.61 0.0 912 76648 /usr/local/sge/bin/darwin/sge_execd 231 1 nobody Mon08AM Ss 31 0.0 0:40.63 0.0 2684 76092 /usr/sbin/gmond
346 1 root Mon09AM Ss 31 0.0 0:16.81 0.0 840 76676 /usr/sbin/serialnumberd 8374 1 _update_sharing 1:37PM Ss 31 0.0 0:00.01 0.0 300 67120 /System/Library/Frameworks/JavaVM.framework/Versions/A/Resources/bin/updateSharingD
25291 39 _postfix 8:23AM S 31 0.0 0:00.02 0.0 760 75468 pickup -l -t fifo -u -o content_filter 26058 1 root 9:08AM Ss 31 0.0 0:00.03 0.0 276 76400 /usr/local/xymon/client/bin/hobbitlaunch --config=/usr/local/xymon/client/etc/clientlaunch.cfg --log=/usr/local/xymon/client/logs/clientlaunch.log --pidfile=/usr/local/xymon/client/logs/clientlaunch.node13.cluster.private.pid
26558 1 root 9:43AM Ss 31 0.0 0:00.01 0.0 788 75440 /usr/libexec/samba/synchronize-preferences --linger 26563 26058 root 9:43AM S 31 3.0 0:00.01 0.0 728 75944 /bin/sh /usr/local/xymon/client/bin/hobbitclient.sh
26567 26563 root 9:43AM R 30 5.3 0:00.02 0.0 684 75944 /bin/sh /usr/local/xymon/client/bin/hobbitclient-darwin.sh 26583 26567 root 9:43AM R 31 0.0 0:00.00 0.0 360 75352 ps -ax -ww -o pid
So we see that both the hobbitlaunch and sge_execd processes are present.
Thoughts? Suggestions?
-- JONATHAN B. HOREN Systems Administrator UAF Life Science Informatics Center for Research Services jbhoren at alaska.edu http://biotech.inbre.alaska.edu
-- Disclaimer: 1) all opinions are my own, 2) I may be completely wrong, 3) my advice is worth at least as much as what you are paying for it, or your money cheerfully refunded.