Acked alerts occasionally continuing to email?
Running xymon 4.3.17on rhel6
We are occasionally seeing custom test yellow alerts continue to email after being ack'ed. It's been hard to pin down, because if I send a green and then let them go yellow again I am not seeing the problem recur. They appear as acked in the web interface.
Wondering if anyone else has seen anything like this?
(major boss-annoyance factor here)
thanks Betsy
On Mon, Oct 27, 2014, at 08:12, Betsy Schwartz wrote:
Running xymon 4.3.17on rhel6
We are occasionally seeing custom test yellow alerts continue to email after being ack'ed. It's been hard to pin down, because if I send a green and then let them go yellow again I am not seeing the problem recur. They appear as acked in the web interface.
Wondering if anyone else has seen anything like this?
(major boss-annoyance factor here)
thanks Betsy
Yes and unfortunately I find it hard to locate someone with the same problem. I don't have access to that Xymon server anymore, so I can't compare notes.
This is getting to be a REALLY BIG problem for us. I've got two alerts that keep emailing after ack, but not consistently my boss's boss wants me to make fixing this my next #1 priority
Anyone else experiencing this or have any thoughts on what might be triggering it?
On Mon, Oct 27, 2014 at 10:32 AM, Mark Felder <feld at feld.me> wrote:
On Mon, Oct 27, 2014, at 08:12, Betsy Schwartz wrote:
Running xymon 4.3.17on rhel6
We are occasionally seeing custom test yellow alerts continue to email after being ack'ed. It's been hard to pin down, because if I send a green and then let them go yellow again I am not seeing the problem recur. They appear as acked in the web interface.
Wondering if anyone else has seen anything like this?
(major boss-annoyance factor here)
thanks Betsy
Yes and unfortunately I find it hard to locate someone with the same problem. I don't have access to that Xymon server anymore, so I can't compare notes.
Xymon mailing list Xymon at xymon.com http://lists.xymon.com/mailman/listinfo/xymon
On Tue, Oct 28, 2014, at 08:14, Betsy Schwartz wrote:
This is getting to be a REALLY BIG problem for us. I've got two alerts that keep emailing after ack, but not consistently my boss's boss wants me to make fixing this my next #1 priority
Anyone else experiencing this or have any thoughts on what might be triggering it?
I recall running into this once where the directory the ack database is stored in wasn't writable or didn't exist, so the ack existed in memory but the alerts couldn't read the database and just sent alerts anyway.
I'm pretty sure this is what I hit once, but that was a few years ago. I've long since solved that problem permanently and have run into other mysterious alert problems I can't explain.
It might be worth checking into this first.
Verified that permissions are OK on the entire tree
Not sure if it's related but we're also seeing an issue where sometimes when we go to ACK a custom test, we see "No Active Alerts" and can't ack, or "No Acks Requested" after ack
(This is driving my boss's boss nuts, he's going to push us to Nagios if we can't reassure him that Xymon is working correctly)
On Tue, Oct 28, 2014 at 9:16 AM, Mark Felder <feld at feld.me> wrote:
On Tue, Oct 28, 2014, at 08:14, Betsy Schwartz wrote:
This is getting to be a REALLY BIG problem for us. I've got two alerts that keep emailing after ack, but not consistently my boss's boss wants me to make fixing this my next #1 priority
Anyone else experiencing this or have any thoughts on what might be triggering it?
I recall running into this once where the directory the ack database is stored in wasn't writable or didn't exist, so the ack existed in memory but the alerts couldn't read the database and just sent alerts anyway.
I'm pretty sure this is what I hit once, but that was a few years ago. I've long since solved that problem permanently and have run into other mysterious alert problems I can't explain.
It might be worth checking into this first.
participants (2)
-
betsy.schwartz@gmail.com
-
feld@feld.me