SCIENTIFIC-LINUX-USERS Archives

June 2005

SCIENTIFIC-LINUX-USERS@LISTSERV.FNAL.GOV

Options: Use Monospaced Font
Show Text Part by Default
Show All Mail Headers

Message: [<< First] [< Prev] [Next >] [Last >>]
Topic: [<< First] [< Prev] [Next >] [Last >>]
Author: [<< First] [< Prev] [Next >] [Last >>]

Print Reply
Subject:
From:
Gregory Harris <[log in to unmask]>
Reply To:
Gregory Harris <[log in to unmask]>
Date:
Wed, 8 Jun 2005 16:39:46 -0500
Content-Type:
text/plain
Parts/Attachments:
text/plain (93 lines)
I have been doing more investigation on this issue.  You are right about 
the ethernet drivers.  But it does not only happen when using ssh, it 
also happens at the console until a ping packet (or some other network 
activity) brings the race condition back to normal.

After disabling the two onboard e1000 network ports and installing a 
3c905 network card, the race condition no longer occurs.  So I believe 
that there is a problem with the e1000 kernel module causing a race 
condition.  I have only been able to verify this on the PowerEdge 750. 
If anybody else has an e1000 card and would like to test this it would 
be great, but I would wager this problem exists on all systems with the 
e1000 driver.

I guess for now, I will sit it out with a different network card 
replacing the onboard one until the upstream vendor releases a new 
kernel fix.

Greg

John Franks wrote:
> Does this happen only when using ssh?  If so, I would suspect issues
> with the ethernet drivers.  There have been problems with the drivers
> for some Broadcom devices.  You might check out redhat bugzilla.  
> 
> On Fri, 2005-05-13 at 09:08 -0500, Gregory Harris wrote:
> 
>>Hi Folks,
>>
>>SL4 works great on all of the Dell Optiplex GX280's I've tried it on. 
>>However, I'm having some rather mysterious behavior on a dell poweredge 
>>750.  Dell has said they will start certifying hardware and officially 
>>supporting the vendor's enterprise 4 operating system next month, so it 
>>may not be worth the effort to try to figure this out.  Anyhow, I 
>>thought I'd sent to the list and see if anybody else is having any 
>>issues like this.
>>
>>1.  Sometimes, processes just seem to hang.  I've noticed this over an 
>>SSH session, a lot of times when using the yum command.  I ssh in to the 
>>server and see the process still in the process table.
>>
>>2.  Doing a 'service iptables restart', the system stalls on the 
>>"Unloading iptables modules:".  If I go to a different window and ping 
>>the server, the iptables service restart resumes.
>>
>>These issues aren't sounding like userland issues to me, although they 
>>very well could be.  Here are the kernel modules the machine is using:
>>
>>Module                  Size  Used by
>>ipt_REJECT             10561  1
>>ipt_state               5825  14
>>ip_conntrack           45701  1 ipt_state
>>iptable_filter          6721  1
>>ip_tables              21441  3 ipt_REJECT,ipt_state,iptable_filter
>>md5                     8001  1
>>ipv6                  238945  16
>>parport_pc             27905  0
>>lp                     15405  0
>>parport                37641  2 parport_pc,lp
>>autofs4                22085  0
>>nfs                   200997  6
>>lockd                  65129  2 nfs
>>sunrpc                137637  9 nfs,lockd
>>dm_mod                 57157  0
>>button                 10449  0
>>battery                12869  0
>>ac                      8773  0
>>uhci_hcd               32473  0
>>ehci_hcd               31813  0
>>e1000                  82253  0
>>floppy                 58065  0
>>ext3                  118473  3
>>jbd                    59481  1 ext3
>>ata_piix               12357  0
>>libata                 44229  1 ata_piix
>>aacraid                44529  4
>>sd_mod                 20545  5
>>scsi_mod              116301  3 libata,aacraid,sd_mod
>>
>>Thanks,
>>
>>Greg
>>
>>

-- 

Gregory Harris
Network Specialist
Department of Mathematics
The University of Kansas
[log in to unmask]
(785) 864-7311

ATOM RSS1 RSS2