I know there was a thread on this a while back, and if I recall, there was mention that the 4.3.0 line would have features for HA.
Anyway, I've been working on implementing a 2-node HA Xymon cluster using LinuxHA, and probably DRBD for shared disk (I have yet to implement DRBD, though). The Apache and Xymon services fail over to the other node fairly nicely, and I have successfully testing that using a shared IP to the Xymon web page works. It's still in the early stages of implementation and testing, but so far it appears hold hold promise.
I'd be interesting in hearing about other HA solutions that have been implemented (if any). I don't recall previous mention of LinuxHA, so if no one else has done it, I'll be glad to update the list with my progress.
Oh yes, forgot about this: http://en.wikibooks.org/wiki/System_Monitoring_with_Hobbit/Administration_Gu...
On Tue, Feb 10, 2009 at 10:36 AM, Gary Baluha <gumby3203 at gmail.com> wrote:
I know there was a thread on this a while back, and if I recall, there was mention that the 4.3.0 line would have features for HA.
Anyway, I've been working on implementing a 2-node HA Xymon cluster using LinuxHA, and probably DRBD for shared disk (I have yet to implement DRBD, though). The Apache and Xymon services fail over to the other node fairly nicely, and I have successfully testing that using a shared IP to the Xymon web page works. It's still in the early stages of implementation and testing, but so far it appears hold hold promise.
I'd be interesting in hearing about other HA solutions that have been implemented (if any). I don't recall previous mention of LinuxHA, so if no one else has done it, I'll be glad to update the list with my progress.
Anyone know how I can check to see why the 'bbd' service isn't apparently running on my Hobbit server and better yet, how do I start it?
Thanks, Jason
This will be great ! :-)
.. i've got new hardware since 3 month for that (my) "project", but not real enough time for a striking success :-(
This would be very helpful !
cheers, martin
On Tue, 10 Feb 2009, Gary Baluha wrote:
I know there was a thread on this a while back, and if I recall, there was mention that the 4.3.0 line would have features for HA.
Anyway, I've been working on implementing a 2-node HA Xymon cluster using LinuxHA, and probably DRBD for shared disk (I have yet to implement DRBD, though). The Apache and Xymon services fail over to the other node fairly nicely, and I have successfully testing that using a shared IP to the Xymon web page works. It's still in the early stages of implementation and testing, but so far it appears hold hold promise.
I'd be interesting in hearing about other HA solutions that have been implemented (if any). I don't recall previous mention of LinuxHA, so if no one else has done it, I'll be glad to update the list with my progress.
I have it running with Conga/Ricci/lucci redhat-cluster and a 3510 shared disk array. [2 node with shared IP]
I was thinking of switching it over to linux-HA though, because it takes 3-5 seconds to fail over, and this seems too slow for me
how fast does linuxHA failover?
-Sean
From: Gary Baluha [mailto:gumby3203 at gmail.com] Sent: Tuesday, February 10, 2009 10:37 AM To: hobbit at hswn.dk Subject: [hobbit] HA solutions
I know there was a thread on this a while back, and if I recall, there was mention that the 4.3.0 line would have features for HA.
Anyway, I've been working on implementing a 2-node HA Xymon cluster using LinuxHA, and probably DRBD for shared disk (I have yet to implement DRBD, though). The Apache and Xymon services fail over to the other node fairly nicely, and I have successfully testing that using a shared IP to the Xymon web page works. It's still in the early stages of implementation and testing, but so far it appears hold hold promise.
I'd be interesting in hearing about other HA solutions that have been implemented (if any). I don't recall previous mention of LinuxHA, so if no one else has done it, I'll be glad to update the list with my progress.
P Go Green! Print this email only when necessary. Thank you for helping Time Warner Cable be environmentally responsible.
This E-mail and any of its attachments may contain Time Warner Cable proprietary information, which is privileged, confidential, or subject to copyright belonging to Time Warner Cable. This E-mail is intended solely for the use of the individual or entity to which it is addressed. If you are not the intended recipient of this E-mail, you are hereby notified that any dissemination, distribution, copying, or action taken in relation to the contents of and attachments to this E-mail is strictly prohibited and may be unlawful. If you have received this E-mail in error, please notify the sender immediately and permanently delete the original and any copy of this E-mail and any printout.
I haven't gotten down to tuning the setup, but out of the box, LinuxHA seems to fail over in about that same 3-5 second time frame. That's with the heartbeat going through the primary LAN interface, and just going by what crm_mon says (with a 5 second update interval).
Once I get DRBD up and running on the nodes, I'm going to do a bit more exhaustive testing and see how quickly and smoothly I get can things to fail over. Maybe now that I've started this thread, it will give me an incentive to spend some more time on this ;-)
On Tue, Feb 10, 2009 at 1:43 PM, Clark, Sean <sean.clark at twcable.com> wrote:
I have it running with Conga/Ricci/lucci redhat-cluster and a 3510 shared disk array. [2 node with shared IP]
I was thinking of switching it over to linux-HA though, because it takes 3-5 seconds to fail over, and this seems too slow for me
how fast does linuxHA failover?
-Sean
P Go Green! Print this email only when necessary. Thank you for helping Time Warner Cable be environmentally responsible.
*From:* Gary Baluha [mailto:gumby3203 at gmail.com] *Sent:* Tuesday, February 10, 2009 10:37 AM *To:* hobbit at hswn.dk *Subject:* [hobbit] HA solutions
I know there was a thread on this a while back, and if I recall, there was mention that the 4.3.0 line would have features for HA.
Anyway, I've been working on implementing a 2-node HA Xymon cluster using LinuxHA, and probably DRBD for shared disk (I have yet to implement DRBD, though). The Apache and Xymon services fail over to the other node fairly nicely, and I have successfully testing that using a shared IP to the Xymon web page works. It's still in the early stages of implementation and testing, but so far it appears hold hold promise.
I'd be interesting in hearing about other HA solutions that have been implemented (if any). I don't recall previous mention of LinuxHA, so if no one else has done it, I'll be glad to update the list with my progress.
This E-mail and any of its attachments may contain Time Warner Cable proprietary information, which is privileged, confidential, or subject to copyright belonging to Time Warner Cable. This E-mail is intended solely for the use of the individual or entity to which it is addressed. If you are not the intended recipient of this E-mail, you are hereby notified that any dissemination, distribution, copying, or action taken in relation to the contents of and attachments to this E-mail is strictly prohibited and may be unlawful. If you have received this E-mail in error, please notify the sender immediately and permanently delete the original and any copy of this E-mail and any printout.
Gary,
HA over WAN is the approach I am using due to WAN network structure I have.
I tested linux-ha+drbd for HA over LAN requirement a few years ago and this approach is not needed for my current requirements. So please submit your notes about this HA-LAN using drbd approach to xymon wiki site.
T.J. Yang
Date: Tue, 10 Feb 2009 14:14:19 -0500 From: gumby3203 at gmail.com To: hobbit at hswn.dk Subject: Re: [hobbit] HA solutions
I haven't gotten down to tuning the setup, but out of the box, LinuxHA seems to fail over in about that same 3-5 second time frame. That's with the heartbeat going through the primary LAN interface, and just going by what crm_mon says (with a 5 second update interval).
Once I get DRBD up and running on the nodes, I'm going to do a bit more exhaustive testing and see how quickly and smoothly I get can things to fail over. Maybe now that I've started this thread, it will give me an incentive to spend some more time on this ;-)
On Tue, Feb 10, 2009 at 1:43 PM, Clark, Sean <sean.clark at twcable.com> wrote:
I have it running with Conga/Ricci/lucci redhat-cluster and a 3510 shared disk array. [2 node with shared IP]
I was thinking of switching it over to linux-HA though, because it takes 3-5 seconds to fail over, and this seems too slow for me
how fast does linuxHA failover?
-Sean
P Go Green! Print this email only when necessary. Thank you for helping Time Warner Cable be environmentally responsible.
From: Gary Baluha [mailto:gumby3203 at gmail.com]
Sent: Tuesday, February 10, 2009 10:37 AM To: hobbit at hswn.dk Subject: [hobbit] HA solutions
I know there was a thread on this a while back, and if I recall, there was mention that the 4.3.0 line would have features for HA.
Anyway,
I've been working on implementing a 2-node HA Xymon cluster using LinuxHA, and
probably DRBD for shared disk (I have yet to implement DRBD, though). The
Apache and Xymon services fail over to the other node fairly nicely, and I have
successfully testing that using a shared IP to the Xymon web page works.
It's still in the early stages of implementation and testing, but so far it
appears hold hold promise.
I'd be interesting in hearing about other HA solutions that have been implemented (if any). I don't recall previous mention of LinuxHA, so if no one else has done it, I'll be glad to update the list with my progress. This E-mail and any of its attachments may contain Time Warner Cable proprietary information, which is privileged, confidential, or subject to copyright belonging to Time Warner Cable. This E-mail is intended solely for the use of the individual or entity to which it is addressed. If you are not the intended recipient of this E-mail, you are hereby notified that any dissemination, distribution, copying, or action taken in relation to the contents of and attachments to this E-mail is strictly prohibited and may be unlawful. If you have received this E-mail in error, please notify the sender immediately and permanently delete the original and any copy of this E-mail and any printout.
Windows Live™: Keep your life in sync. http://windowslive.com/explore?ocid=TXT_TAGLM_WL_t1_allup_explore_022009
Hi,
we're running a 2-node Linux-HA cluster using DRBD 8 and Xen. Xymon/hobbit is installed in a Xen domain.
If a node fails the Xen domain is automatically switched over to the other node.
With DRBD 8.0.x even live migration is possible. There is no need to use a cluster filesystem (like ocfs2 or gfs). Have a look at http://fghaas.wordpress.com/2007/09/03/drbd-806-brings-full-live-migration-f...
Our current installation uses Debian Etch 4.0, Xen 3.2 (Backports), DRBD (Source) and Heartbeat 2.1.x (Backports).
Regards, Alexander
Am 10.02.2009 16:36, Gary Baluha schrieb:
I know there was a thread on this a while back, and if I recall, there was mention that the 4.3.0 line would have features for HA.
Anyway, I've been working on implementing a 2-node HA Xymon cluster using LinuxHA, and probably DRBD for shared disk (I have yet to implement DRBD, though). The Apache and Xymon services fail over to the other node fairly nicely, and I have successfully testing that using a shared IP to the Xymon web page works. It's still in the early stages of implementation and testing, but so far it appears hold hold promise.
I'd be interesting in hearing about other HA solutions that have been implemented (if any). I don't recall previous mention of LinuxHA, so if no one else has done it, I'll be glad to update the list with my progress.
participants (6)
-
gumby3203@gmail.com
-
hobbit@alexkeller.de
-
jason@hands4christ.org
-
martin.flemming@desy.de
-
sean.clark@twcable.com
-
tj_yang@hotmail.com