[hobbit] Status Unavailable - again
Yep. I did. Between myself and Henrik, we have tried a number of versions, and a few special diagnostic versions of Hobbit. We have been working mostly off-list because we have been exchanging potentially confidential information, and don't believe that our failure to diagnose a problem is of general interest. (I am sure a full account of this sad tale will be posted once it is resolved.)
Kudos to Henrik though. I think he has really tried. He has worked tirelessly to try and resolve this issue, which I believe is very admirable, when you consider his reward for all this hard work. Henrik is a true mensch. (Yes, that's an English word. Look it up)
So far, we have not been able to identify the root cause of the problem. Henrik was going to have a look at some of the messages that came from one of the hosts, and get back to me.
Stefan, if you are interested in assisting us with this, then myself and Henrik can cc you in our off-list exchanges.
Have you got any theories as to the cause?
Regards Vernon
-----Original Message----- From: Stefan Loos [mailto:stefan_loos at hotmail.com] Sent: Monday, 11 July 2005 4:21 PM To: hobbit at hswn.dk Subject: RE: [hobbit] Status Unavailable - again
did you use the hobbitd from the snapshot?
NOTICE: This message and any attachments are confidential and may contain copyright material of Australian Finance Group Limited or a third party. It is intended solely for the purpose of the addressee and any other named recipient. If you are not the intended recipient, any use, distribution, disclosure or copying of this message is strictly prohibited. The confidentiality attached to this message is not waived or lost by reason of the mistaken transmission or delivery to any unintended party. If you have received this message in error, please notify the author immediately or contact Australian Finance Group on +61 8 9420 7888.
It would be great if you can put me in cc. If you want I can try to assist you. I'm at a point where I don't know what to try anymore. I think it isn't easy for Henrik to find this issue - I have no coredumps and nothing in the logfile what could help. And I never had any doubt that Henrik does a great job! (I think my English is not good enough to say it in other words) So if there is anything what I can do to solve this problem....
I've tried to lookup "mensch" but I think I'm using the wrong sites - german english dictionaries always recognize mensch as a german word ;-)
Regards, Stefan
<br><br><br>>From: "Vernon Everett"
<v.everett at afgonline.com.au><br>>Reply-To:
hobbit at hswn.dk<br>>To: <hobbit at hswn.dk><br>>Subject: RE:
[hobbit] Status Unavailable - again<br>>Date: Mon, 11 Jul 2005 16:37:28
+0800<br>><br>>Yep.<br>>I did.<br>>Between myself and Henrik, we
have tried a number of versions, and a few<br>>special diagnostic
versions of Hobbit.<br>>We have been working mostly off-list because we
have been exchanging<br>>potentially confidential information, and don't
believe that our failure<br>>to diagnose a problem is of general
interest. (I am sure a full account<br>>of this sad tale will be posted
once it is resolved.)<br>><br>>Kudos to Henrik though.<br>>I think
he has really tried. He has worked tirelessly to try and resolve<br>>this
issue, which I believe is very admirable, when you consider
his<br>>reward for all this hard work.<br>>Henrik is a true mensch.
(Yes, that's an English word. Look it up)<br>><br>>So far, we have not
been able to identify the root cause of the problem.<br>>Henrik was going
to have a look at some of the messages that came from<br>>one of the
hosts, and get back to me.<br>><br>>Stefan, if you are interested in
assisting us with this, then myself and<br>>Henrik can cc you in our
off-list exchanges.<br>><br>>Have you got any theories as to the
cause?<br>><br>>Regards<br>>
Vernon<br>><br>><br>><br>>-----Original
Message-----<br>>From: Stefan Loos
[mailto:stefan_loos at hotmail.com]<br>>Sent: Monday, 11 July 2005 4:21
PM<br>>To: hobbit at hswn.dk<br>>Subject: RE: [hobbit] Status Unavailable
- again<br>><br>>did you use the hobbitd from the snapshot?<br>><br>><br>>_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _<br>><br>>NOTICE: This message and any attachments are confidential and may contain copyright material<br>>of Australian Finance Group Limited or a third party. It is intended solely for the purpose of the<br>>addressee and any other named recipient. If you are not the intended recipient, any use,<br>>distribution, disclosure or copying of this message is strictly prohibited. The confidentiality attached<br>>to this message is not waived or lost by reason of the mistaken transmission or delivery to any<br>>unintended party. If you have received this message in error, please notify the author immediately or<br>>contact Australian Finance Group on +61 8 9420 7888.<br>><br>><br>>To unsubscribe from the hobbit list, send an e-mail to<br>>hobbit-unsubscribe at hswn.dk<br>><br>><br>
Hi Stefan,
On Mon, Jul 11, 2005 at 09:36:42AM +0000, Stefan Loos wrote:
It would be great if you can put me in cc. If you want I can try to assist you. I'm at a point where I don't know what to try anymore. I think it isn't easy for Henrik to find this issue - I have no coredumps and nothing in the logfile what could help.
Yes, this is a really nasty problem. Vernon and I though we had it nailed down by the end of last week, but there's more to it than what we found then.
What kind of external scripts run on your clients, apart from the BB client ? The current suspicion is that this is triggered by a status message that is handled badly by Hobbit causing this lock-up. So I'm trying to see if there might be something in common between your setups.
And what kind of system are you running Hobbit on ? If Linux, which distribution ? Another suspicion I have is that this might be a problem with the implementation of SysV IPC semaphores.
Regards, Henrik
Hi Henrik,
we have several own-written scripts (mostly in perl) which monitor oracle instances, bea weblogic servers. There is one for hardware monitoring - HP (Intel) and Sun Servers (prtdiag based) and some are just the output of a http request to the software running on that weblogic servers. I have just one server for testing, it's a HP DL 360. We are running Redhat Enterprise Server 3 but I've tried it with a SuSE 9.3 too.
Regards,
Stefan
<br><br><br>>From: henrik at hswn.dk (Henrik Stoerner)<br>>Reply-To: hobbit at hswn.dk<br>>To: hobbit at hswn.dk<br>>Subject: Re: [hobbit] Status Unavailable - again<br>>Date: Mon, 11 Jul 2005 13:08:59 +0200<br>><br>>Hi Stefan,<br>><br>>On Mon, Jul 11, 2005 at 09:36:42AM +0000, Stefan Loos wrote:<br>> > It would be great if you can put me in cc. If you want I can try to assist<br>> > you. I'm at a point where I don't know what to try anymore. I think it<br>> > isn't easy for Henrik to find this issue - I have no coredumps and nothing<br>> > in the logfile what could help.<br>><br>>Yes, this is a really nasty problem. Vernon and I though we had it<br>>nailed down by the end of last week, but there's more to it than<br>>what we found then.<br>><br>>What kind of external scripts run on your clients, apart from the BB<br>>client ? The current suspicion is that this is triggered by a status<br>>message that is handled badly by Hobbit causing this lock-up. So I'm<br>>trying to see if there might be something in common between your setups.<br>><br>>And what kind of system are you running Hobbit on ? If Linux, which<br>>distribution ? Another suspicion I have is that this might be a<br>>problem with the implementation of SysV IPC semaphores.<br>><br>><br>>Regards,<br>>Henrik<br>><br>><br>>To unsubscribe from the hobbit list, send an e-mail to<br>>hobbit-unsubscribe at hswn.dk<br>><br>><br>
participants (3)
-
henrik@hswn.dk
-
stefan_loos@hotmail.com
-
v.everett@afgonline.com.au