Hi Matevz,
Thanks for helping me dig into this. While following your questions I
realized that the original address on first connection 169.228.130.92
somehow gets transmuted into 169.228.230.92 which is a nonexistent host,
does not have a DNS record, either. I feel somewhat stupid for not spotting
this earlier, sorry :(
How can this happen? Does server tell the client to which IP to connect?
>>> Yes, the server responds with IP addresses for a locate request
>>> (historic stuff). The question is the server responding that way or is
>>> the client mangling the address. This will be one tough cookie to track
>>> down if it's the client. So, try using the old client in the same way
>>> and see if the same thing happens there. If so, it's a server issue. If
>>> not, it's a client issue (assuming a sample of 2 is enough :-)
Andy
> Also, what is the result of:
>
> ]==> netstat -nt | grep 169.228.230.92
>
> when the connections are 'in progress'?
It just shows the connection in established state. Tried several times to
catch some short-lived sockets but had no luck.
For master / fedora 19 client (connection timeout error) I see this
(apparently going through ip6):
matevz@desire matevz> netstat -ntp | pcregrep '169.228.[12]30.92'
tcp6 0 1 132.239.186.42:49331 169.228.230.92:9940 SYN_SENT
23539/xrdfs
tcp6 0 0 132.239.186.42:52532 169.228.130.92:9940
ESTABLISHED 23539/xrdfs
matevz@desire matevz> ll /proc/23539/fd | grep socket
lrwx------ 1 matevz zh 64 Dec 3 14:03 10 -> socket:[6225802]
lrwx------ 1 matevz zh 64 Dec 3 14:03 9 -> socket:[6225797]
matevz@desire matevz> cat /proc/net/tcp6
sl local_address remote_address
st tx_queue rx_queue tr tm->when retrnsmt uid timeout inode
8: 0000000000000000FFFF00002ABAEF84:CC72
0000000000000000FFFF00005C82E4A9:26D4 01 00000000:00000000 00:00000000
00000000 411 0 6225797 1 ffff88058335cd80 20 4 24 10 -1
9: 0000000000000000FFFF00002ABAEF84:BFF1
0000000000000000FFFF00005CE6E4A9:26D4 02 00000001:00000000 01:00000176
00000004 411 0 6225802 2 ffff88058335ae80 1600 0 0 1 5
^
|
!!! how come this is different ??
How could I have missed this :) Apparently 169.228.130.92 (the correct ip)
is somehow changed into 169.228.230.92 (non-existent).
This is also why Andy got confused
For 3.3.3 client on slc5 (no route to host) I only see the established
socket in the fd list. And it's an ip4 socket, there is no /proc/net/tcp6 on
machine at all.
My program that connects to the host several times shows up in ip4,
/proc/net/tcp.
Thanks a lot & cheers,
Matevz
>>> Lukasz
>>>
>>> On 03.12.2013 11:39, Lukasz Janyst wrote:
>>>> Hi Andy,
>>>>
>>>> no, b is not the problem. There was an issue like the one you
>>>> mention, but it was introduced after migration to IPv6 in master only
>>>> and fixed immediately after it was discovered. It was never introduced
>>>> to the stable-3.3.x branch.
>>>>
>>>> Matevz, what you see is really strange. It looks like the system
>>>> won't let you connect to the host twice... A router issue perhaps? Can
>>>> you telnet twice to this host:port from your test box?
>>>>
>>>> Cheers,
>>>> Lukasz
<snip>
########################################################################
Use REPLY-ALL to reply to list
To unsubscribe from the XROOTD-DEV list, click the following link:
https://listserv.slac.stanford.edu/cgi-bin/wa?SUBED1=XROOTD-DEV&A=1
|