I've encountered a strange networking problem within a cluster of 4 AMD workstations running SL4.4. Two of the machines are recognized outside of the network and are ssh-able, but the two other machines are not recognized within the domain. More specifically,
[nmoore@honker ~]$ host honker
honker.workstations.winona.edu has address 199.17.152.193
[root@diepholz ~]# host diepholz
Host diepholz not found: 3(NXDOMAIN)
for diepholz, the ifconfig output follows,
[root@diepholz ~]# ifconfig
eth0 Link encap:Ethernet HWaddr 00:50:8D:C1:E3:05
inet addr:199.17.152.97 Bcast:199.17.152.255 Mask:255.255.255.0
inet6 addr: fe80::250:8dff:fec1:e305/64 Scope:Link
UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
RX packets:3657979 errors:0 dropped:0 overruns:0 frame:0
TX packets:2069443 errors:0 dropped:0 overruns:0 carrier:0
collisions:0 txqueuelen:1000
RX bytes:93750978 (89.4 MiB) TX bytes:178735590 (170.4 MiB)
Interrupt:217 Base address:0xcc00
lo Link encap:Local Loopback
inet addr:127.0.0.1 Mask:255.0.0.0
inet6 addr: ::1/128 Scope:Host
UP LOOPBACK RUNNING MTU:16436 Metric:1
RX packets:54249 errors:0 dropped:0 overruns:0 frame:0
TX packets:54249 errors:0 dropped:0 overruns:0 carrier:0
collisions:0 txqueuelen:0
RX bytes:11863382 (11.3 MiB) TX bytes:11863382 (11.3 MiB)
I've also posted the /etc/hosts file below,
[root@diepholz ~]# cat /etc/hosts
# Do not remove the following line, or various programs
# that require network functionality will fail.
127.0.0.1 diepholz localhost.localdomain localhost
199.17.152.17 runner
199.17.152.135 muscovey
199.17.152.13 pekin
199.17.152.193 honker
199.17.152.188 toulouse
199.17.152.202 sebastopol
So, I understand that the inter-cluster ssh works because of the hand-curated /etc/hosts file quoted above, but I don't understand why the two non-working machines are not broadcasting their IP/hostname pair.
regards,
Nathan Moore
- - - - - - - - - - - - - - - - - - - - - - -
Nathan Moore
Physics
Winona State University