SCIENTIFIC-LINUX-USERS Archives

March 2015

SCIENTIFIC-LINUX-USERS@LISTSERV.FNAL.GOV

Options: Use Monospaced Font
Show Text Part by Default
Show All Mail Headers

Message: [<< First] [< Prev] [Next >] [Last >>]
Topic: [<< First] [< Prev] [Next >] [Last >>]
Author: [<< First] [< Prev] [Next >] [Last >>]

Print Reply
Subject:
From:
Arnau Bria <[log in to unmask]>
Reply To:
Arnau Bria <[log in to unmask]>
Date:
Mon, 2 Mar 2015 09:34:33 +0100
Content-Type:
text/plain
Parts/Attachments:
text/plain (24 lines)
On Mon, 2 Mar 2015 08:33:47 +0100
Andreas Haupt wrote:

> Hi Arnau,

Hi Andreas,
 
> over the weekend we managed to provoke an identical behaviour. Jobs
> crash during the epilog phase when the job's CGroup gets removed.

So UGE 8.2.1 + Kernel is 2.6.32-504.8.1?
what kernel are you running in your production cluster? did you see
this problem there, too? I can' upgrade to newer kernel because many
nodes reboot and we lose many many jobs...

> Did you already open a bug at Univa or somewhere else?
Not yet. I've been downgrading the kernel in many nodes and I had no
time to write to support. I'll do this week.

> Cheers,
> Andreas
Cheers,
Arnau

ATOM RSS1 RSS2