I was playing around with hobbit-clients.cfg file trying to create a LOG rule to ignore this alert: May 21 00:54:21 redirect1-bo3.dl2.example.com monit[2029]: [ID 111343 daemon.error] 'gmond-sample.xml' timestamp test failed for /usr/local/Ganglia/logs/gmond-sample.xml
I **think the rule that put it into conniptions was HOST=%redirect.*bo3.dl2.example.com LOG /var/adm/messages COLOR=yellow IGNORE=%(repeated|gmond|monit|puppetd)
Also, I am experiencing something I've seen a few other times this week - a service that is not reporting, that was signed out, stays blue even when signed back in. I can't get rid of the xymond_client blue. Where is blue status stored? (it does not appear as blue on the enable/disable page but I have a blue dot on the host page and a blue report when I drill in)
[xymon at netmon2 server]$ gdb bin/hobbitd_client tmp/core.24453 GNU gdb (GDB) Red Hat Enterprise Linux (7.0.1-23.el5_5.2) Copyright (C) 2009 Free Software Foundation, Inc. License GPLv3+: GNU GPL version 3 or later <http://gnu.org/licenses/gpl.html> This is free software: you are free to change and redistribute it. There is NO WARRANTY, to the extent permitted by law. Type "show copying" and "show warranty" for details. This GDB was configured as "x86_64-redhat-linux-gnu". For bug reporting instructions, please see: <http://www.gnu.org/software/gdb/bugs/>... Reading symbols from /u1/xymon/server/bin/hobbitd_client...done. Reading symbols from /lib64/libpcre.so.0...(no debugging symbols found)...done. Loaded symbols for /lib64/libpcre.so.0 Reading symbols from /lib64/librt.so.1...(no debugging symbols found)...done. Loaded symbols for /lib64/librt.so.1 Reading symbols from /lib64/libc.so.6...(no debugging symbols found)...done. Loaded symbols for /lib64/libc.so.6 Reading symbols from /lib64/libpthread.so.0...(no debugging symbols found)...done. Loaded symbols for /lib64/libpthread.so.0 Reading symbols from /lib64/ld-linux-x86-64.so.2...(no debugging symbols found)...done. Loaded symbols for /lib64/ld-linux-x86-64.so.2 Core was generated by `hobbitd_client'. Program terminated with signal 6, Aborted. #0 0x0000003833430265 in raise () from /lib64/libc.so.6 (gdb) bt #0 0x0000003833430265 in raise () from /lib64/libc.so.6 #1 0x0000003833431d10 in abort () from /lib64/libc.so.6 #2 0x0000000000427133 in sigsegv_handler (signum=<value optimized out>) at sig.c:57 #3 <signal handler called> #4 0x00000000004179f6 in scan_log (hinfo=0x1679440, classname=0x2b9863ae507e "sunos", logname=0x2b9863aee44b "/var/adm/messages", logdata=0x2b9863aee45e "May 21 00:57:25 redirect2-bo3.dl2.e-dialog.com last message repeated 36 times\nMay 21 00:57:35 redirect2-bo3.dl2.example.com monit[10418]: [ID 111343 daemon.error] 'gmond-sample.xml' timestamp test fa"..., section=<value optimized out>, summarybuf=0x1683a80) at client_config.c:2491 #5 0x0000000000408d0a in msgs_report ( hostname=0x2b9863ae5059 "redirect2-bo3.dl2.example.com", clientclass=0x2b9863ae507e "sunos", os=<value optimized out>, hinfo=0x1679440, fromline=0x7fff00bf2c50 "\nStatus message received from 10.200.32.51\n", timestr=0x2b9863ae50be "Sat May 21 01:11:24 EDT 2011", msgsstr=0x0) at xymond_client.c:1221 #6 0x000000000040fd6a in handle_solaris_client ( hostname=0x2b9863ae5059 "redirect2-bo3.dl2.example.com", clienttype=0x2b9863ae507e "sunos", os=OS_SOLARIS, hinfo=0x1679440, sender=<value optimized out>, timestamp=<value optimized out>, clientdata=0x2b9863ae5085 "client redirect2-bo3,dl2,example,com.sunos sunos\n[date") at client/solaris.c:69 #7 0x0000000000411e5f in main (argc=<value optimized out>, argv=0x7fff00bf3368) at xymond_client.c:2199