SCIENTIFIC-LINUX-USERS Archives

June 2007

SCIENTIFIC-LINUX-USERS@LISTSERV.FNAL.GOV

Options: Use Monospaced Font
Show Text Part by Default
Show All Mail Headers

Message: [<< First] [< Prev] [Next >] [Last >>]
Topic: [<< First] [< Prev] [Next >] [Last >>]
Author: [<< First] [< Prev] [Next >] [Last >>]

Print Reply
Subject:
From:
Connie Sieh <[log in to unmask]>
Reply To:
Connie Sieh <[log in to unmask]>
Date:
Wed, 13 Jun 2007 10:32:37 -0500
Content-Type:
TEXT/PLAIN
Parts/Attachments:
TEXT/PLAIN (44 lines)
On Wed, 13 Jun 2007, Pann McCuaig wrote:

> Greetings, all:
>
> I've already contacted SAS technical support, but thought that someone
> on this list might have and idea.
>
> Last weekend someone tripped a breaker (or something) and the dedicated
> 220VAC line to our cluster rack went bye-bye. Monday morning I rebooted
> the cluster nodes one at a time and they all came up OK, although one
> required manual fsck intervention.
>
> After the reboot all seemed to be well until someone tried to run SAS.
>
> Here is my brief description of the problem for SAS technical support:
> ---------------------------------------------------------------------
> Scientific Linux SL release 4.4 (Beryllium)
> 2.6.9-42.0.10.ELsmp
> Sun V40z, 4-Opteron CPUs, 32G RAM
>
> SAS hangs and generates a number of non-kill-able tasks when executed
> either with the GUI (which never shows up) or with -nodms from the
> command line.

You could attach a "strace" to the "hung" process to determine if it is 
really hung or doing something "else".

   strace -o /tmp/trace.out -p <processid>

-Connie Sieh
>
> Installed SAS on a similar platform. Runs fine.
>
> Did a second parallel installation of SAS on the platform that is
> failing. Same symptoms with the clean installation as with the one that
> has been running fine (until recently).
> ---------------------------------------------------------------------
>
> I'm flailing so any and all suggestions will be greatly appreciated.
>
> Cheers,
> Pann
>

ATOM RSS1 RSS2