XROOTD-L Archives

Support use of xrootd by HEP experiments

XROOTD-L@LISTSERV.SLAC.STANFORD.EDU

Options: Use Forum View

Use Monospaced Font
Show Text Part by Default
Show All Mail Headers

Message: [<< First] [< Prev] [Next >] [Last >>]
Topic: [<< First] [< Prev] [Next >] [Last >>]
Author: [<< First] [< Prev] [Next >] [Last >>]

Print Reply
Subject:
From:
Wilko Kroeger <[log in to unmask]>
Date:
19 Nov 2004 09:33:52 -0800 (PST)Fri, 19 Nov 2004 09:33:52 -0800 (PST)
Content-Type:
TEXT/PLAIN
Parts/Attachments:
TEXT/PLAIN (288 lines)

Hello Fabrizio

I am not sure if I see the same problem as Remi but if I do:

xrdcp -d2 test.root root://bbrprod05:1094////prod/s5/s4/s3/s2/s1/wk.root

It works if the file (wk.root) doesn't exist, but if it exists (with
proper permissions) it fails and produces a core:

......
041119 09:25:15 001 Xrd: DoLogin Logging into the server
[bbrprod05.slac.stanford.edu:1094]. pid=23362 uid=wilko
041119 09:25:15 001 Xrd: SendGenCommand Sending command XTNetconn::doLogin
041119 09:25:15 001 Xrd: ReadPartialAnswer Reading a XrdClientMessage from
the server [bbrprod05.slac.stanford.edu:1094]...
041119 09:25:15 001 Xrd: Create Access to server granted.
041119 09:25:15 001 Xrd: Connect Connected.
041119 09:25:15 001 Xrd: SendGenCommand Sending command Stat
041119 09:25:15 001 Xrd: ReadPartialAnswer Reading a XrdClientMessage from
the server [bbrprod05.slac.stanford.edu:1094]...
Segmentation fault (core dumped)

The xrootd log on bbrprod05 shows:

041119 09:29:42 066 XrootdXeq: User logged in as wilko.23913:17@noric04
041119 09:29:42 066 wilko.23913:17@noric04 XrootdProtocol: 0000 req=3017 dlen=30
041119 09:29:42 066 wilko.23913:17@noric04 XrootdProtocol: 0000 rc=0 stat
      /prod/s5/s4/s3/s2/s1/wk.root
041119 09:29:42 066 wilko.23913:17@noric04 XrootdResponse: 0000 sending 35 data bytes; rc=0
041119 09:29:42 066 wilko.23913:17@noric04 XrdPoll: FD 17 detached from poller 0; num=0
041119 09:29:42 066 XrdLink: wilko.23913:17@noric04 disconnected after 0:00:00

gdb showed:

#0  0xb73ed7fd in _int_malloc () from /lib/tls/libc.so.6
#1  0xb73ecb0d in malloc () from /lib/tls/libc.so.6
#2  0xb73ecd32 in realloc () from /lib/tls/libc.so.6
#3  0x080719c9 in SocketReaderThread ()
#4  0x0804f276 in ReaderThread_loc(void*) ()
#5  0x0804ecb9 in ReaderThread_loc(void*) ()
#6  0x0804f348 in ReaderThread_loc(void*) ()
#7  0x0804f502 in ReaderThread_loc(void*) ()
#8  0x0804eea8 in ReaderThread_loc(void*) ()
#9  0x08070b52 in SocketReaderThread ()
#10 0x0804e10f in ReaderThread_loc(void*) ()
#11 0xb7391748 in __libc_start_main () from /lib/tls/libc.so.6
#12 0x0804baf5 in ?? ()


I will be gone until Monday, so I won't be able to do much testing, but
I should be online from time to time.


Cheers,
   Wilko


On Fri, 19 Nov 2004, Fabrizio Furano wrote:

> Hi,
>
>   I am not able to reproduce the problem in bbrprod01.
>
>   I sent a number of times commands like this:
>
> bin/arch_dbg/xrdcp -d2 configure
> root://bbrprod01:1094//prod/test_fabrizio1911-12/subskim/SPskims/test5/001237/200301/1008//a/b/c/d/e/f/g/h/i/arggghhh/a/b/c/d/e/f/g/h/i//j/
>
> As you can see in /kanga/prod all my tests were successful.
> Can you point me to a testbed where the things crash?
>
> BTW, why are you used to write ////prod? (I tried this too)
>
> Another thing. Has libXrdOfs on bbrprod01 been updated? The newest
> versions are supposed to create new files with correct permissions with
> the flags XrdClient sends.
>
> Fabrizio
>
> Wilko Kroeger wrote:
> > Hello
> >
> > Below is a forwarded email from Remi (awaits moderator approval for
> > xrootd-l).
> >
> > Wilko
> >
> >
> > I have an ugly problem with xrdcp from today's cvs HEAD when it tries
> > to create more than one new directory on the server. It crashes in
> > XrdClientConn::ReadPartialAnswer after creation of the 2nd or higher
> > directory. Interestingly, sometimes it succeeds.
> >
> > This worked flawlessly with the cvs version from Nov 12.
> >
> > I tried bbrprod01 and bbrprod05 with the same result.
> >
> > Here the output from gdb:
> >
> > (gdb)  run -d2 KanAccess.cfg
> > root://bbrprod05:1094////prod/subskim/SPskims/test5/001237/200301/1008/
> > S0002/V04/
> > The program being debugged has been started already.
> > Start it from the beginning? (y or n) y
> >
> > Starting program:
> > /afs/slac.stanford.edu/u/br/bbrskim/releases/test-16.0.1a/xrootd/bin/
> > i386_rhel30/xrdcp -d2 KanAccess.cfg
> > root://bbrprod05:1094////prod/subskim/SPskims/test5/001237/200301/1008/
> > S0002/V04/
> > [New Thread -1221110560 (LWP 27851)]
> > 041118 17:02:34 001 Xrd: main (C) 2004 SLAC INFN xrdcp 0.2 beta
> > 041118 17:02:34 001 Xrd: main KanAccess.cfg -->
> > root://bbrprod05:1094////prod/subskim/SPskims/test5/001237/200301/1008/
> > S0002/V04/
> > 041118 17:02:34 001 Xrd:  (C) 2004 SLAC XrdClientAdmin 0.2 beta
> > 041118 17:02:34 001 Xrd: GetDomainToMatch GetHostName(noric02) returned
> > name=noric02.slac.stanford.edu
> > 041118 17:02:34 001 Xrd: GetDomainToMatch GetDomain(noric02) -->
> > slac.stanford.edu
> > 041118 17:02:34 001 Xrd: XrdClientUrlSet List of servers to connect to
> > is [bbrprod05:1094]
> > 041118 17:02:34 001 Xrd: XrdClientUrlSet Remote file to open is
> > '///prod/subskim/SPskims/test5/001237/200301/1008/S0002/V04/'
> > 041118 17:02:34 001 Xrd: ConvertSingleDNSAlias Resolving bbrprod05:1094
> > 041118 17:02:34 001 Xrd: ConvertSingleDNSAlias Found host
> > bbrprod05.slac.stanford.edu with addr 134.79.87.115
> > 041118 17:02:34 001 Xrd: ShowUrls The converted URLs count is 1
> > 041118 17:02:34 001 Xrd: ShowUrls URL n.1:
> > bbrprod05.slac.stanford.edu:1094//.
> > 041118 17:02:34 001 Xrd: GetDomainToMatch
> > GetHostName(bbrprod05.slac.stanford.edu) returned
> > name=bbrprod05.slac.stanford.edu
> > 041118 17:02:34 001 Xrd: GetDomainToMatch
> > GetDomain(bbrprod05.slac.stanford.edu) --> slac.stanford.edu
> > 041118 17:02:34 001 Xrd: CheckHostDomain Resolved
> > [bbrprod05.slac.stanford.edu]'s domain name into [slac.stanford.edu]
> > 041118 17:02:34 001 Xrd: CheckHostDomain Access granted to the domain
> > of [bbrprod05.slac.stanford.edu].
> > 041118 17:02:34 001 Xrd: GetDomainToMatch
> > GetHostName(bbrprod05.slac.stanford.edu) returned
> > name=bbrprod05.slac.stanford.edu
> > 041118 17:02:34 001 Xrd: GetDomainToMatch
> > GetDomain(bbrprod05.slac.stanford.edu) --> slac.stanford.edu
> > 041118 17:02:34 001 Xrd: CheckHostDomain Resolved
> > [bbrprod05.slac.stanford.edu]'s domain name into [slac.stanford.edu]
> > 041118 17:02:34 001 Xrd: CheckHostDomain Access granted to the domain
> > of [bbrprod05.slac.stanford.edu].
> > 041118 17:02:34 001 Xrd: CreateTXNf Trying to connect to
> > bbrprod05.slac.stanford.edu:1094. Connect try 1
> > 041118 17:02:34 001 Xrd: ConnectionMgr Explicitly requested not to
> > start the garbage collector thread. Are you sure?
> > 041118 17:02:34 001 Xrd: Connect Creating a logical connection...
> > 041118 17:02:34 001 Xrd: Connect Physical connection not found.
> > Creating a new one...
> > 041118 17:02:34 001 Xrd: Connect Connecting to
> > [bbrprod05.slac.stanford.edu:1094]
> > 041118 17:02:34 001 Xrd: ClientSock::TryConnect Trying to connect
> > tobbrprod05.slac.stanford.edu(134.79.87.115):1094
> > 041118 17:02:34 001 Xrd: Connect Connected to
> > [bbrprod05.slac.stanford.edu:1094]
> > 041118 17:02:34 001 Xrd: Connect New physical connection to server
> > bbrprod05.slac.stanford.edu:1094 succesfully created.
> > 041118 17:02:34 001 Xrd: Connect LogConn: size:1 count: 1PhyConn:
> > size:1 count: 1
> > 041118 17:02:34 001 Xrd: Connect Connect(bbrprod05.slac.stanford.edu,
> > 1094) returned 0
> > 041118 17:02:34 001 Xrd: CreateTXNf The logical connection id is 0.
> > This will be the streamid for this client
> > 041118 17:02:34 001 Xrd: CreateTXNf Working url is
> > bbrprod05.slac.stanford.edu:1094//
> > 041118 17:02:34 001 Xrd: DoHandShake HandShake step 1: Sending 20 bytes
> > to the server [bbrprod05.slac.stanford.edu:1094]
> > 041118 17:02:34 001 Xrd: DoHandShake HandShake step 2: Reading 4 bytes
> > from server [bbrprod05.slac.stanford.edu:1094].
> > 041118 17:02:34 001 Xrd: DoHandShake HandShake step 3: Reading 12 bytes
> > from server [bbrprod05.slac.stanford.edu:1094].
> > 041118 17:02:34 001 Xrd: StartReader Starting reader thread...
> > [New Thread -1221256272 (LWP 27880)]
> > 041118 17:02:34 000 Xrd: SocketReaderThread Reader Thread starting.
> > 041118 17:02:34 001 Xrd: GetAccessToSrv Ok, the server on
> > [bbrprod05.slac.stanford.edu:1094] is an xrootd data server.
> > 041118 17:02:34 001 Xrd: DoLogin Logging into the server
> > [bbrprod05.slac.stanford.edu:1094]. pid=27851 uid=bbrskim
> > 041118 17:02:34 001 Xrd: SendGenCommand Sending command
> > XTNetconn::doLogin
> > 041118 17:02:34 001 Xrd: ReadPartialAnswer Reading a XrdClientMessage
> > from the server [bbrprod05.slac.stanford.edu:1094]...
> > 041118 17:02:34 001 Xrd: Create Access to server granted.
> > 041118 17:02:34 001 Xrd: Connect Connected.
> > 041118 17:02:34 001 Xrd: SendGenCommand Sending command Stat
> > 041118 17:02:34 001 Xrd: ReadPartialAnswer Reading a XrdClientMessage
> > from the server [bbrprod05.slac.stanford.edu:1094]...
> > 041118 17:02:35 001 Xrd: ReadPartialAnswer Server
> > [bbrprod05.slac.stanford.edu:1094] did not answer OK. Resp status is
> > [kXR_error]
> > 041118 17:02:35 001 Xrd: Stat Server [bbrprod05.slac.stanford.edu:1094]
> > did not return OK message for last request.
> > 041118 17:02:35 001 Xrd: SendGenCommand Server declared error
> > 3011:Unable to locate
> > /prod/subskim/SPskims/test5/001237/200301/1008/S0002/V04/; No such file
> > or directory
> > 041118 17:02:35 001 Xrd: SendGenCommand Sending command Stat
> > 041118 17:02:35 001 Xrd: ReadPartialAnswer Reading a XrdClientMessage
> > from the server [bbrprod05.slac.stanford.edu:1094]...
> > 041118 17:02:35 001 Xrd: SendGenCommand Sending command Stat
> > 041118 17:02:35 001 Xrd: ReadPartialAnswer Reading a XrdClientMessage
> > from the server [bbrprod05.slac.stanford.edu:1094]...
> > 041118 17:02:35 001 Xrd: SendGenCommand Sending command Stat
> > 041118 17:02:35 001 Xrd: ReadPartialAnswer Reading a XrdClientMessage
> > from the server [bbrprod05.slac.stanford.edu:1094]...
> > 041118 17:02:35 001 Xrd: SendGenCommand Sending command Stat
> > 041118 17:02:35 001 Xrd: ReadPartialAnswer Reading a XrdClientMessage
> > from the server [bbrprod05.slac.stanford.edu:1094]...
> >
> > Program received signal SIGSEGV, Segmentation fault.
> > [Switching to Thread -1221110560 (LWP 27851)]
> > 0xb73f07fd in _int_malloc () from /lib/tls/libc.so.6
> > (gdb) where
> > #0  0xb73f07fd in _int_malloc () from /lib/tls/libc.so.6
> > #1  0xb73efb0d in malloc () from /lib/tls/libc.so.6
> > #2  0xb75658ae in operator new(unsigned) () from /usr/lib/libstdc++.so.5
> > #3  0xb7551b93 in std::__default_alloc_template<true,
> > 0>::allocate(unsigned) () from /usr/lib/libstdc++.so.5
> > #4  0xb7557bf8 in std::string::_Rep::_S_create(unsigned,
> > std::allocator<char> const&) () from /usr/lib/libstdc++.so.5
> > #5  0xb7557d29 in std::string::_Rep::_M_clone(std::allocator<char>
> > const&, unsigned) () from /usr/lib/libstdc++.so.5
> > #6  0xb7555a96 in std::string::reserve(unsigned) () from
> > /usr/lib/libstdc++.so.5
> > #7  0xb754ccb9 in std::basic_stringbuf<char, std::char_traits<char>,
> > std::allocator<char> >::overflow(int) () from /usr/lib/libstdc++.so.5
> > #8  0xb7552f64 in std::basic_streambuf<char, std::char_traits<char>
> >  >::xsputn(char const*, int) () from /usr/lib/libstdc++.so.5
> > #9  0xb7549961 in std::ostream::write(char const*, int) () from
> > /usr/lib/libstdc++.so.5
> > #10 0xb754a076 in std::basic_ostream<char, std::char_traits<char> >&
> > std::operator<< <std::char_traits<char> >(std::basic_ostream<char,
> > std::char_traits<char> >&, char const*) () from /usr/lib/libstdc++.so.5
> > #11 0x08057532 in XrdClientConn::ReadPartialAnswer(XReqErrorType&,
> > unsigned&, ClientRequest*, bool, void**,
> > XrdClientConn::EThreeStateReadHandler&) (this=0x8089ec8,
> > errorType=@0xbfffb204, TotalBlkSize=@0xbfffb20c, req=0xbfffbbb0,
> > HasToAlloc=false, tmpMoreData=0xbfffb208,
> >      what_to_do=@0xbfffb1fc) at XrdClientConn.cc:726
> > #12 0x08054e5c in XrdClientConn::ClientServerCmd(ClientRequest*, void
> > const*, void**, void*, bool) (this=0x8089ec8, req=0xbfffbbb0,
> >      reqMoreData=0x8090af0, answMoreDataAllocated=0x0,
> > answMoreData=0xbfffb3b0, HasToAlloc=false) at XrdClientConn.cc:238
> > #13 0x08055138 in XrdClientConn::SendGenCommand(ClientRequest*, void
> > const*, void**, void*, bool, char*, ServerResponseHeader*)
> > (this=0x8089ec8,
> >      req=0xbfffbbb0, reqMoreData=0x8090af0, answMoreDataAllocated=0x0,
> > answMoreData=0xbfffb3b0, HasToAlloc=false, CmdName=0x807ba58 "Stat",
> >      srh=0x0) at XrdClientConn.cc:300
> > #14 0x0806d8a1 in XrdClientAdmin::Stat(char*, long&, long&, long&,
> > long&) (this=0x8089728,
> >      fname=0x8090af0 "///prod/subskim/SPskims/test5/001237",
> > id=@0xbfffbe78, size=@0xbfffbe74, flags=@0xbfffbe70,
> > modtime=@0xbfffbe6c)
> >      at XrdClientAdmin.cc:250
> > #15 0x0804c9ba in CreateDestPath_xrd(XrdClientString, bool) (url=
> >        {data = 0x8087e18
> > "root://bbrprod05.slac.stanford.edu:1094////prod/subskim/SPskims/test5/
> > 001237/200301/1008/S0002/V04/", size = 99, capacity = 256}, isdir=true)
> > at Xrdcp.cc:198
> > #16 0x0804e84b in main (argc=4, argv=0xbfffc144) at Xrdcp.cc:518
> >
> >
> > Cheers,
> > 		Remi
> >
> > ---------------------------------------------------------------------
> > If it's green, it's biology. If it stinks, it's chemistry. If it has
> > numbers, it's math. If it doesn't work, it's technology. (anonymous)
> >
> > *********************************************************************
> > Remigius K. Mommsen                 e-mail: [log in to unmask]
> > University of California, Irvine       URL:    http://cern.ch/mommsen
> > c/o SLAC                             voice:        ++1 (650) 926-3595
> > 2575 Sand Hill Road #35                fax:        ++1 (650) 926-3882
> > Menlo Park, CA 94025, US              home:        ++1 (650) 233-9041
> > *********************************************************************
>



ATOM RSS1 RSS2