Print

Print


Hello

Below is a forwarded email from Remi (awaits moderator approval for
xrootd-l).

Wilko


I have an ugly problem with xrdcp from today's cvs HEAD when it tries
to create more than one new directory on the server. It crashes in
XrdClientConn::ReadPartialAnswer after creation of the 2nd or higher
directory. Interestingly, sometimes it succeeds.

This worked flawlessly with the cvs version from Nov 12.

I tried bbrprod01 and bbrprod05 with the same result.

Here the output from gdb:

(gdb)  run -d2 KanAccess.cfg
root://bbrprod05:1094////prod/subskim/SPskims/test5/001237/200301/1008/
S0002/V04/
The program being debugged has been started already.
Start it from the beginning? (y or n) y

Starting program:
/afs/slac.stanford.edu/u/br/bbrskim/releases/test-16.0.1a/xrootd/bin/
i386_rhel30/xrdcp -d2 KanAccess.cfg
root://bbrprod05:1094////prod/subskim/SPskims/test5/001237/200301/1008/
S0002/V04/
[New Thread -1221110560 (LWP 27851)]
041118 17:02:34 001 Xrd: main (C) 2004 SLAC INFN xrdcp 0.2 beta
041118 17:02:34 001 Xrd: main KanAccess.cfg -->
root://bbrprod05:1094////prod/subskim/SPskims/test5/001237/200301/1008/
S0002/V04/
041118 17:02:34 001 Xrd:  (C) 2004 SLAC XrdClientAdmin 0.2 beta
041118 17:02:34 001 Xrd: GetDomainToMatch GetHostName(noric02) returned
name=noric02.slac.stanford.edu
041118 17:02:34 001 Xrd: GetDomainToMatch GetDomain(noric02) -->
slac.stanford.edu
041118 17:02:34 001 Xrd: XrdClientUrlSet List of servers to connect to
is [bbrprod05:1094]
041118 17:02:34 001 Xrd: XrdClientUrlSet Remote file to open is
'///prod/subskim/SPskims/test5/001237/200301/1008/S0002/V04/'
041118 17:02:34 001 Xrd: ConvertSingleDNSAlias Resolving bbrprod05:1094
041118 17:02:34 001 Xrd: ConvertSingleDNSAlias Found host
bbrprod05.slac.stanford.edu with addr 134.79.87.115
041118 17:02:34 001 Xrd: ShowUrls The converted URLs count is 1
041118 17:02:34 001 Xrd: ShowUrls URL n.1:
bbrprod05.slac.stanford.edu:1094//.
041118 17:02:34 001 Xrd: GetDomainToMatch
GetHostName(bbrprod05.slac.stanford.edu) returned
name=bbrprod05.slac.stanford.edu
041118 17:02:34 001 Xrd: GetDomainToMatch
GetDomain(bbrprod05.slac.stanford.edu) --> slac.stanford.edu
041118 17:02:34 001 Xrd: CheckHostDomain Resolved
[bbrprod05.slac.stanford.edu]'s domain name into [slac.stanford.edu]
041118 17:02:34 001 Xrd: CheckHostDomain Access granted to the domain
of [bbrprod05.slac.stanford.edu].
041118 17:02:34 001 Xrd: GetDomainToMatch
GetHostName(bbrprod05.slac.stanford.edu) returned
name=bbrprod05.slac.stanford.edu
041118 17:02:34 001 Xrd: GetDomainToMatch
GetDomain(bbrprod05.slac.stanford.edu) --> slac.stanford.edu
041118 17:02:34 001 Xrd: CheckHostDomain Resolved
[bbrprod05.slac.stanford.edu]'s domain name into [slac.stanford.edu]
041118 17:02:34 001 Xrd: CheckHostDomain Access granted to the domain
of [bbrprod05.slac.stanford.edu].
041118 17:02:34 001 Xrd: CreateTXNf Trying to connect to
bbrprod05.slac.stanford.edu:1094. Connect try 1
041118 17:02:34 001 Xrd: ConnectionMgr Explicitly requested not to
start the garbage collector thread. Are you sure?
041118 17:02:34 001 Xrd: Connect Creating a logical connection...
041118 17:02:34 001 Xrd: Connect Physical connection not found.
Creating a new one...
041118 17:02:34 001 Xrd: Connect Connecting to
[bbrprod05.slac.stanford.edu:1094]
041118 17:02:34 001 Xrd: ClientSock::TryConnect Trying to connect
tobbrprod05.slac.stanford.edu(134.79.87.115):1094
041118 17:02:34 001 Xrd: Connect Connected to
[bbrprod05.slac.stanford.edu:1094]
041118 17:02:34 001 Xrd: Connect New physical connection to server
bbrprod05.slac.stanford.edu:1094 succesfully created.
041118 17:02:34 001 Xrd: Connect LogConn: size:1 count: 1PhyConn:
size:1 count: 1
041118 17:02:34 001 Xrd: Connect Connect(bbrprod05.slac.stanford.edu,
1094) returned 0
041118 17:02:34 001 Xrd: CreateTXNf The logical connection id is 0.
This will be the streamid for this client
041118 17:02:34 001 Xrd: CreateTXNf Working url is
bbrprod05.slac.stanford.edu:1094//
041118 17:02:34 001 Xrd: DoHandShake HandShake step 1: Sending 20 bytes
to the server [bbrprod05.slac.stanford.edu:1094]
041118 17:02:34 001 Xrd: DoHandShake HandShake step 2: Reading 4 bytes
from server [bbrprod05.slac.stanford.edu:1094].
041118 17:02:34 001 Xrd: DoHandShake HandShake step 3: Reading 12 bytes
from server [bbrprod05.slac.stanford.edu:1094].
041118 17:02:34 001 Xrd: StartReader Starting reader thread...
[New Thread -1221256272 (LWP 27880)]
041118 17:02:34 000 Xrd: SocketReaderThread Reader Thread starting.
041118 17:02:34 001 Xrd: GetAccessToSrv Ok, the server on
[bbrprod05.slac.stanford.edu:1094] is an xrootd data server.
041118 17:02:34 001 Xrd: DoLogin Logging into the server
[bbrprod05.slac.stanford.edu:1094]. pid=27851 uid=bbrskim
041118 17:02:34 001 Xrd: SendGenCommand Sending command
XTNetconn::doLogin
041118 17:02:34 001 Xrd: ReadPartialAnswer Reading a XrdClientMessage
from the server [bbrprod05.slac.stanford.edu:1094]...
041118 17:02:34 001 Xrd: Create Access to server granted.
041118 17:02:34 001 Xrd: Connect Connected.
041118 17:02:34 001 Xrd: SendGenCommand Sending command Stat
041118 17:02:34 001 Xrd: ReadPartialAnswer Reading a XrdClientMessage
from the server [bbrprod05.slac.stanford.edu:1094]...
041118 17:02:35 001 Xrd: ReadPartialAnswer Server
[bbrprod05.slac.stanford.edu:1094] did not answer OK. Resp status is
[kXR_error]
041118 17:02:35 001 Xrd: Stat Server [bbrprod05.slac.stanford.edu:1094]
did not return OK message for last request.
041118 17:02:35 001 Xrd: SendGenCommand Server declared error
3011:Unable to locate
/prod/subskim/SPskims/test5/001237/200301/1008/S0002/V04/; No such file
or directory
041118 17:02:35 001 Xrd: SendGenCommand Sending command Stat
041118 17:02:35 001 Xrd: ReadPartialAnswer Reading a XrdClientMessage
from the server [bbrprod05.slac.stanford.edu:1094]...
041118 17:02:35 001 Xrd: SendGenCommand Sending command Stat
041118 17:02:35 001 Xrd: ReadPartialAnswer Reading a XrdClientMessage
from the server [bbrprod05.slac.stanford.edu:1094]...
041118 17:02:35 001 Xrd: SendGenCommand Sending command Stat
041118 17:02:35 001 Xrd: ReadPartialAnswer Reading a XrdClientMessage
from the server [bbrprod05.slac.stanford.edu:1094]...
041118 17:02:35 001 Xrd: SendGenCommand Sending command Stat
041118 17:02:35 001 Xrd: ReadPartialAnswer Reading a XrdClientMessage
from the server [bbrprod05.slac.stanford.edu:1094]...

Program received signal SIGSEGV, Segmentation fault.
[Switching to Thread -1221110560 (LWP 27851)]
0xb73f07fd in _int_malloc () from /lib/tls/libc.so.6
(gdb) where
#0  0xb73f07fd in _int_malloc () from /lib/tls/libc.so.6
#1  0xb73efb0d in malloc () from /lib/tls/libc.so.6
#2  0xb75658ae in operator new(unsigned) () from /usr/lib/libstdc++.so.5
#3  0xb7551b93 in std::__default_alloc_template<true,
0>::allocate(unsigned) () from /usr/lib/libstdc++.so.5
#4  0xb7557bf8 in std::string::_Rep::_S_create(unsigned,
std::allocator<char> const&) () from /usr/lib/libstdc++.so.5
#5  0xb7557d29 in std::string::_Rep::_M_clone(std::allocator<char>
const&, unsigned) () from /usr/lib/libstdc++.so.5
#6  0xb7555a96 in std::string::reserve(unsigned) () from
/usr/lib/libstdc++.so.5
#7  0xb754ccb9 in std::basic_stringbuf<char, std::char_traits<char>,
std::allocator<char> >::overflow(int) () from /usr/lib/libstdc++.so.5
#8  0xb7552f64 in std::basic_streambuf<char, std::char_traits<char>
 >::xsputn(char const*, int) () from /usr/lib/libstdc++.so.5
#9  0xb7549961 in std::ostream::write(char const*, int) () from
/usr/lib/libstdc++.so.5
#10 0xb754a076 in std::basic_ostream<char, std::char_traits<char> >&
std::operator<< <std::char_traits<char> >(std::basic_ostream<char,
std::char_traits<char> >&, char const*) () from /usr/lib/libstdc++.so.5
#11 0x08057532 in XrdClientConn::ReadPartialAnswer(XReqErrorType&,
unsigned&, ClientRequest*, bool, void**,
XrdClientConn::EThreeStateReadHandler&) (this=0x8089ec8,
errorType=@0xbfffb204, TotalBlkSize=@0xbfffb20c, req=0xbfffbbb0,
HasToAlloc=false, tmpMoreData=0xbfffb208,
     what_to_do=@0xbfffb1fc) at XrdClientConn.cc:726
#12 0x08054e5c in XrdClientConn::ClientServerCmd(ClientRequest*, void
const*, void**, void*, bool) (this=0x8089ec8, req=0xbfffbbb0,
     reqMoreData=0x8090af0, answMoreDataAllocated=0x0,
answMoreData=0xbfffb3b0, HasToAlloc=false) at XrdClientConn.cc:238
#13 0x08055138 in XrdClientConn::SendGenCommand(ClientRequest*, void
const*, void**, void*, bool, char*, ServerResponseHeader*)
(this=0x8089ec8,
     req=0xbfffbbb0, reqMoreData=0x8090af0, answMoreDataAllocated=0x0,
answMoreData=0xbfffb3b0, HasToAlloc=false, CmdName=0x807ba58 "Stat",
     srh=0x0) at XrdClientConn.cc:300
#14 0x0806d8a1 in XrdClientAdmin::Stat(char*, long&, long&, long&,
long&) (this=0x8089728,
     fname=0x8090af0 "///prod/subskim/SPskims/test5/001237",
id=@0xbfffbe78, size=@0xbfffbe74, flags=@0xbfffbe70,
modtime=@0xbfffbe6c)
     at XrdClientAdmin.cc:250
#15 0x0804c9ba in CreateDestPath_xrd(XrdClientString, bool) (url=
       {data = 0x8087e18
"root://bbrprod05.slac.stanford.edu:1094////prod/subskim/SPskims/test5/
001237/200301/1008/S0002/V04/", size = 99, capacity = 256}, isdir=true)
at Xrdcp.cc:198
#16 0x0804e84b in main (argc=4, argv=0xbfffc144) at Xrdcp.cc:518


Cheers,
		Remi

---------------------------------------------------------------------
If it's green, it's biology. If it stinks, it's chemistry. If it has
numbers, it's math. If it doesn't work, it's technology. (anonymous)

*********************************************************************
Remigius K. Mommsen                 e-mail: [log in to unmask]
University of California, Irvine       URL:    http://cern.ch/mommsen
c/o SLAC                             voice:        ++1 (650) 926-3595
2575 Sand Hill Road #35                fax:        ++1 (650) 926-3882
Menlo Park, CA 94025, US              home:        ++1 (650) 233-9041
*********************************************************************