This morning, all of a sudden, one of our HOBBIT servers lost the ability to ping other servers.
Everything on the network is up and working fine, but the file: /var/log/hobbit/hobbitlaunch.log says:
2006-03-28 10:19:50 Task bbnet started with PID 10586 2006-03-28 10:19:50 Task bbnet terminated by signal 6
and when I run bbtest-net as the user hobbit, I get the following: $ ../bin/bbtest-net 2006-03-28 09:40:27 xgetenv: Cannot find value for variable MACHINE Aborted (core dumped)
Help! I have had to shut down paging because the network test is broken and thinks all the clients are dead!
Thanks for any help you can give - Winn
SPECIAL NOTICE
All information transmitted hereby is intended only for the use of the addressee(s) named above and may contain confidential and privileged information. Any unauthorized review, use, disclosure or distribution of confidential and privileged information is prohibited. If the reader of this message is not the intended recipient(s) or the employee or agent responsible for delivering the message to the intended recipient, you are hereby notified that you must not read this transmission and that disclosure, copying, printing, distribution or use of any of the information contained in or attached to this transmission is STRICTLY PROHIBITED.
Anyone who receives confidential and privileged information in error should notify us immediately by telephone and mail the original message to us at the above address and destroy all copies. To the extent any portion of this communication contains public information, no such restrictions apply to that information. (gate01)
On Tue, Mar 28, 2006 at 10:44:35AM -0700, Winn Beutler wrote:
This morning, all of a sudden, one of our HOBBIT servers lost the ability to ping other servers.
Everything on the network is up and working fine, but the file: /var/log/hobbit/hobbitlaunch.log says:
2006-03-28 10:19:50 Task bbnet started with PID 10586 2006-03-28 10:19:50 Task bbnet terminated by signal 6
What version ?
What's in the bb-network.log file ?
This indicates a programming error in the bbtest-net tool, which causes it to crash. There should be a core file in ~hobbit/server/tmp/ directory - please run it through gdb as described in http://www.hswn.dk/hobbit/help/known-issues.html#bugreport
and when I run bbtest-net as the user hobbit, I get the following: $ ../bin/bbtest-net 2006-03-28 09:40:27 xgetenv: Cannot find value for variable MACHINE Aborted (core dumped)
And if you run it as "bbcmd bbtest-net --debug" ? bbcmd defines a bunch of environment settings that bbtest-net needs (hobbitlaunch also defines them, when it launches tasks to run automatically).
Regards, Henrik
participants (2)
-
henrik@hswn.dk
-
Winn.Beutler@mcdata.com