SCIENTIFIC-LINUX-USERS Archives

April 2010

SCIENTIFIC-LINUX-USERS@LISTSERV.FNAL.GOV

Options: Use Monospaced Font
Show Text Part by Default
Show All Mail Headers

Message: [<< First] [< Prev] [Next >] [Last >>]
Topic: [<< First] [< Prev] [Next >] [Last >>]
Author: [<< First] [< Prev] [Next >] [Last >>]

Print Reply
Subject:
From:
Aaron van Meerten <[log in to unmask]>
Reply To:
Aaron van Meerten <[log in to unmask]>
Date:
Fri, 23 Apr 2010 11:56:53 -0500
Content-Type:
multipart/signed
Parts/Attachments:
text/plain (2882 bytes) , smime.p7s (1948 bytes)
I am having this exactly problem.  My solution has been to re-run srvadmin-services.sh start again anytime these daemons stop running.  They will then continue to run for some short period of time before failing again.

I have only one node that refuses to ever respond to these commands, and the daemons quit without any warning.

I called Dell support with regards to this topic, and was told basically that since SL5.3 is NOT a support platform, that no one from Linux Services was going to talk to me about this topic.  He also told me that I'd have to purchase a higher level of support, as the baseline support contract only supports the hardware and doesn't promise anything about the software at all.

So, let me know if you come up with anything resembling a solution to this.  My work-around with srvadmin-services.sh is not elegant, and isn't a sure-fire solution either.

Cheers,

-Aaron van Meerten
MWT2

On Apr 23, 2010, at 11:47 AM, Michael Tiernan wrote:

> Hello everyone.
> 
> I'm running into a problem and I'm wondering if anyone has seen anything very similar or if I'm breaking new ground with it.
> 
> I have a system that I just got, it's a Dell PE R710 with the new H700 RAID card. I installed it with SL5.3 (Our current version.) and when loading the srvadmin tools (directly from the Dell site) I have been finding that the daemons don't continue to run. Here's a small snippet from my system log:
> 
> Apr 23 12:33:50 xxxxxxxx Server Administrator: Instrumentation Service EventID: 1001  Server Administrator startup complete
> Apr 23 12:33:51 xxxxxxxx dataeng: dsm_sa_eventmgrd shutdown succeeded
> Apr 23 12:33:58 xxxxxxxx dataeng: dsm_sa_datamgrd shutdown succeeded
> Apr 23 12:33:59 xxxxxxxx instsvcdrv: dell_rbu device driver unloaded
> 
> After the start process finishes (first line), the next three lines show the daemons shutting down. It continues to try to respawn the services only to have them continue to die.
> 
> The start sequence is:
> /opt/dell/srvadmin/sbin/srvadmin-service.sh start
> [short pause........]
> omreport storage vdisk
> [a normal output from the omreport about vdisks.]
> [another short pause.....]
> omreport storage vdisk
> [error, can't find any controllers]
> 
> It is at this point, you can see the dataeng trying to restart the dsm_sa daemons over and over again. (Clarification, they start then stop.)
> 
> I built a RHEL5.3 system and the tools don't seem to be having a problem.
> 
> Before I go digging into it much more I thought I'd ask if anyone else has had any experience with this problem.
> 
> Thanks for everyone's time and use of the forum.
> 
> -- 
>  <<  MCT>>    Michael C Tiernan.   xmpp:[log in to unmask]
>  MIT - Laboratory for Nuclear Science - http://www.lns.mit.edu
>  High Perf Research Computing Facility at The Bates Linear Accelerator
>  "Bit-smashing your bits better than anyone can!"
> 
> 



ATOM RSS1 RSS2