Hello,
I have a collection of pages and subpages that compose a system. There are several systems in my xymon hosts file. Each system looks at a set of machines, VMs and http targets. And each system is listed on an 'Environments' page, which only has links (and icons) to the subpages for the systems (i.e. no URL or http info is connected to that Environments page)
Periodically, each system is brought down for maintenance. But when that happens, I get swamped with alerts about every component of that system.
Does anyone know of a way to get one, high-level alert, say from the subpages under my ovmEnvironments page instead of 100+ alerts from my monitored elements ?
I have not seen any discussions like this in the archives, but I could have missed them.
thanks Greg
Here is a modified snippet of my hosts.cfg file
page ovmEnvironments OVMEnvironments title OVM Environments
##################################################################################################
Env1 starts here
################################################################################################## subpage b16Env1 B16Env1 title B16 Env1 subparent b16Env1 b16Env1Hypers Hypervisors title B16 Env1 Hypervisors group-compress <font size="+1">Hypervisors</font> 0.0.0.6 appohs-crm-machine06 0.0.0.7 authohs-crm-machine07 ...
subparent b16Env1 b16Env1Hosts Hosts title B16 Env1 Hosts group-compress <font size="+1">OVM Hosts</font> 0.0.0.8 v_resourceWebgate_machine08 0.0.0.9 v_challengWebgate_machine09 ...
subparent b16Env1 b16Env1Admin Weblogic Admin/EM Pages title B16 Env1 Weblogic Administration group-compress <font size="+1">All Admin Consoles</font> 0.0.0.91 v_BI_ADMIN # http://user:pswd at machine:port/console 0.0.0.90 v_CRM_ADMIN # http://user:pswd at machine:port/console ...
and here is a snippet from alerts.cfg which DO NOT fire when any of the subpage indicators turn red:
PAGE=ovmEnvironments COLOR=red RECOVERED SCRIPT /home/msmith/xymon/xymonServer/server/ext/smsplus.sh me at company.com REPEAT=24h
PAGE=ovmEnvironments/b16Env1 COLOR=red RECOVERED SCRIPT /home/msmith/xymon/xymonServer/server/ext/smsplus.sh me at company.com REPEAT=24h
We currently monitor (ping) our AP's and it would really suck to get over 1000 emails if/when something goes down, so I exclude them as follows:
You can exclude pages from you alerts in two ways:
1.) above all other alerts add: PAGE=<yourpage/yoursubpage> Add nothing for action, and that's it.
2.) On your bottom (default) alert action append the following: EXPAGE=<yourpage/yoursubpage>
Ken Connell Intermediate Network Engineer Computer & Communication Services Ryerson University 350 Victoria St RM AB50 Toronto, Ont M5B 2K3 416-979-5000 x6709
-----Original Message----- From: gsimpson at comcast.net Sender: xymon-bounces at xymon.com Date: Wed, 15 Jun 2011 20:05:26 To: <xymon at xymon.com> Subject: [Xymon] alert collections
Xymon mailing list Xymon at xymon.com http://lists.xymon.com/mailman/listinfo/xymon
I just removed the default alert section. That way I only get notified for systems/pages that I specifically call out in the alerts.cfg file.
Scot Kreienkamp
-----Original Message----- From: xymon-bounces at xymon.com [mailto:xymon-bounces at xymon.com] On Behalf Of kconnell at ryerson.ca Sent: Thursday, June 16, 2011 7:52 AM To: gsimpson at comcast.net; xymon-bounces at xymon.com; Xymon Email List Subject: Re: [Xymon] alert collectionsif I understand
We currently monitor (ping) our AP's and it would really suck to get over 1000 emails if/when something goes down, so I exclude them as follows:
You can exclude pages from you alerts in two ways:
1.) above all other alerts add: PAGE=<yourpage/yoursubpage> Add nothing for action, and that's it.
2.) On your bottom (default) alert action append the following: EXPAGE=<yourpage/yoursubpage>
Ken Connell Intermediate Network Engineer Computer & Communication Services Ryerson University 350 Victoria St RM AB50 Toronto, Ont M5B 2K3 416-979-5000 x6709
-----Original Message----- From: gsimpson at comcast.net Sender: xymon-bounces at xymon.com Date: Wed, 15 Jun 2011 20:05:26 To: <xymon at xymon.com> Subject: [Xymon] alert collections
Xymon mailing list Xymon at xymon.com http://lists.xymon.com/mailman/listinfo/xymon
Xymon mailing list Xymon at xymon.com http://lists.xymon.com/mailman/listinfo/xymon
This message is intended only for the individual or entity to which it is addressed. It may contain privileged, confidential information which is exempt from disclosure under applicable laws. If you are not the intended recipient, please note that you are strictly prohibited from disseminating or distributing this information (other than to the intended recipient) or copying this information. If you have received this communication in error, please notify us immediately by e-mail or by telephone at the above number. Thank you.
Thanks for the responses guys. I am making things different now, but I'm not sure that it is progress.
so now, my alerts.cfg only has the entries listed below. I am not certain of how to designate a default alert.
But now, when I run a test, I don't get ANY email notices sent. So, I believe I am excluding the subpages correctly, which is good for me.
But I am not generating any alerts when my 'System' page turns red. I think it is because I have nothing but subpages on each of my 'System' pages.
Do you have any advice on generating alerts from a page like that ? I want to get an alert from my "b16Env1" subpage (see the hosts snippet below).
thanks!
Here is my alerts.cfg file
PAGE=ovmEnvironments/b16Env1 COLOR=red RECOVERED PAGE=ovmEnvironments/b16Env1/b16Env1Hypers COLOR=red RECOVERED PAGE=ovmEnvironments/b17Env1/b16Env1Hosts COLOR=red RECOVERED PAGE=ovmEnvironments/b16Env1/b16Env1Admin COLOR=red RECOVERED PAGE=ovmEnvironments/b16Env1/b16Env1Apps/b16Env1IcApps COLOR=red RECOVERED PAGE=ovmEnvironments/b16Env1/b16Env1Apps/b16Env1TechApps COLOR=red RECOVERED PAGE=ovmEnvironments COLOR=red RECOVERED
MAIL xymon at mycompany.com REPEAT=5 EXPAGE=ovmEnvironments/b16Env1/b16Env1Apps/b16Env1IcApps EXPAGE=ovmEnvironments/b16Env1/b16Env1Apps/b16Env1TechApps #-------------------
and here is the test output from sending one http error from a low-level subpage #------------------- 2011-06-16 08:18:51 Using default environment file /home/gjsimpso/xymon/xymonServer/server/etc/xymonserver.cfg 00026390 2011-06-16 08:18:51 send_alert v_TS_ODSM:http state Paging 00026390 2011-06-16 08:18:51 Matching host:service:page 'v_TS_ODSM:http:ovmEnvironments/b16Env1/b16Env1Apps/b16Env1TechApps' against rule line 66 00026390 2011-06-16 08:18:51 Failed 'PAGE=ovmEnvironments/b16Env1/b16Env1Apps/b16Env1FSApps COLOR=red RECOVERED' (pagename not in include list) 00026390 2011-06-16 08:18:51 Matching host:service:page 'v_TS_ODSM:http:ovmEnvironments/b16Env1/b16Env1Apps/b16Env1TechApps' against rule line 67 00026390 2011-06-16 08:18:51 Failed 'PAGE=ovmEnvironments/b16Env1/b16Env1Apps/b16Env1CrmApps COLOR=red RECOVERED' (pagename not in include list) 00026390 2011-06-16 08:18:51 Matching host:service:page 'v_TS_ODSM:http:ovmEnvironments/b16Env1/b16Env1Apps/b16Env1TechApps' against rule line 68 00026390 2011-06-16 08:18:51 Failed 'PAGE=ovmEnvironments/b16Env1/b16Env1Apps/b16Env1FinsApps COLOR=red RECOVERED' (pagename not in include list) 00026390 2011-06-16 08:18:51 Matching host:service:page 'v_TS_ODSM:http:ovmEnvironments/b16Env1/b16Env1Apps/b16Env1TechApps' against rule line 69 00026390 2011-06-16 08:18:51 Failed 'PAGE=ovmEnvironments/b16Env1/b16Env1Apps/b16Env1HcmApps COLOR=red RECOVERED' (pagename not in include list) 00026390 2011-06-16 08:18:51 Matching host:service:page 'v_TS_ODSM:http:ovmEnvironments/b16Env1/b16Env1Apps/b16Env1TechApps' against rule line 70 00026390 2011-06-16 08:18:51 Failed 'PAGE=ovmEnvironments/b16Env1/b16Env1Apps/b16Env1ScmApps COLOR=red RECOVERED' (pagename not in include list) 00026390 2011-06-16 08:18:51 Matching host:service:page 'v_TS_ODSM:http:ovmEnvironments/b16Env1/b16Env1Apps/b16Env1TechApps' against rule line 71 00026390 2011-06-16 08:18:51 Failed 'PAGE=ovmEnvironments/b16Env1/b16Env1Apps/b16Env1IcApps COLOR=red RECOVERED' (pagename not in include list) 00026390 2011-06-16 08:18:51 Matching host:service:page 'v_TS_ODSM:http:ovmEnvironments/b16Env1/b16Env1Apps/b16Env1TechApps' against rule line 72 00026390 2011-06-16 08:18:51 *** Match with 'PAGE=ovmEnvironments/b16Env1/b16Env1Apps/b16Env1TechApps COLOR=red RECOVERED' ***
I have not changed my hosts.cfg file (this is a snippet) #------------------- page ovmEnvironments OVMEnvironments title OVM Environments
##################################################################################################
Env1 starts here
################################################################################################## subpage b16Env1 B16Env1 title B16 Env1 subparent b16Env1 b16Env1Hypers Hypervisors title B16 Env1 Hypervisors group-compress <font size="+1">Hypervisors</font> 0.0.0.6 appohs-crm-machine06 0.0.0.7 authohs-crm-machine07 ...
subparent b16Env1 b16Env1Hosts Hosts title B16 Env1 Hosts group-compress <font size="+1">OVM Hosts</font> 0.0.0.8 v_resourceWebgate_machine08 0.0.0.9 v_challengWebgate_machine09 ...
subparent b16Env1 b16Env1Admin Weblogic Admin/EM Pages title B16 Env1 Weblogic Administration group-compress <font size="+1">All Admin Consoles</font> 0.0.0.91 v_BI_ADMIN # http://user:pswd at machine:port/console 0.0.0.90 v_CRM_ADMIN # http://user:pswd at machine:port/console ...
##################################################################################################
Env2 starts here
################################################################################################## subpage b16Env2 B16Env2 title B16 Env2 ...
-- Greg
----- Original Message ----- From: "Scot Kreienkamp" <SKreien at la-z-boy.com> To: kconnell at ryerson.ca, gsimpson at comcast.net, xymon-bounces at xymon.com, "Xymon Email List" <xymon at xymon.com> Sent: Thursday, June 16, 2011 6:45:24 AM Subject: RE: [Xymon] alert collectionsif I understand
I just removed the default alert section. That way I only get notified for systems/pages that I specifically call out in the alerts.cfg file.
Scot Kreienkamp
-----Original Message----- From: xymon-bounces at xymon.com [mailto:xymon-bounces at xymon.com] On Behalf Of kconnell at ryerson.ca Sent: Thursday, June 16, 2011 7:52 AM To: gsimpson at comcast.net; xymon-bounces at xymon.com; Xymon Email List Subject: Re: [Xymon] alert collectionsif I understand
We currently monitor (ping) our AP's and it would really suck to get over 1000 emails if/when something goes down, so I exclude them as follows:
You can exclude pages from you alerts in two ways:
1.) above all other alerts add: PAGE=<yourpage/yoursubpage> Add nothing for action, and that's it.
2.) On your bottom (default) alert action append the following: EXPAGE=<yourpage/yoursubpage>
Ken Connell Intermediate Network Engineer Computer & Communication Services Ryerson University 350 Victoria St RM AB50 Toronto, Ont M5B 2K3 416-979-5000 x6709
participants (3)
-
gsimpson@comcast.net
-
kconnell@ryerson.ca
-
SKreien@la-z-boy.com