On Thu, 26 Oct 2006, Lily Li wrote:
> we start having a higher rate of lamd hanging problem
on the
> headnode. The lamd will not response to the command
"lamnodes" after
> the LAM is booted and used for couple of days.
This is a bit vague description of the problem. Have you
done anything
to diagnose why the lamd would not respond anymore ? For
example, have
you tried attaching to the "hung" lamd with gdb or
using 'strace -p'
to know what the process is actually doing ?
> do we need to recompile/link the LAM and the
applications after we
> upgrade the linux kernel ?
No. Especially with kernels from an enterprise class Linux
distribution which should not change too much between
updates.
--
Bogdan Costescu
IWR - Interdisziplinaeres Zentrum fuer Wissenschaftliches
Rechnen
Universitaet Heidelberg, INF 368, D-69120 Heidelberg,
GERMANY
Telephone: +49 6221 54 8869, Telefax: +49 6221 54 8868
E-mail: Bogdan.Costescu IWR.Uni-Heidelberg.De
_______________________________________________
This list is archived at http://www.l
am-mpi.org/MailArchives/lam/
|