(Full log at debug level 2 here - http://glidemon.web.cern.ch/glidemon/show.php?log=http://vocms20.cern.ch/mon/cms980/140423_022500_vocms20:bbockelm_crab_xrootd_test_2/job_out.503.2.txt)

I'm seeing a consistent segfault in XrdCl when the open operation times out. Here's the last few debug messages:

[2014-04-23 17:16:39 +0200][Error  ][XRootD            ] [xrootd.rcac.purdue.edu:1094] Impossible to send message . Trying to recover.
[2014-04-23 17:16:39 +0200][Error  ][XRootD            ] [xrootd.rcac.purdue.edu:1094] Handling error while processing : [ERROR] Operation expired.
[2014-04-23 17:16:39 +0200][Error  ][XRootD            ] [xrootd.rcac.purdue.edu:1094] Unable to get the response to request 

The relevant log lines are:

#8  std::basic_string<char, std::char_traits<char>, std::allocator<char> >::basic_string (this=0x7fbb18f156d0, __str=...) at /build/davidlt/build-BOOTSTRAP_slc6_amd64_gcc481/b/BUILD/slc6_amd64_gcc481/external/gcc/4.8.1/gcc-gcc-4_8-branch-199526/obj/x86_64-redhat-linux-gnu/libstdc++-v3/include/bits/basic_string.tcc:173
#9  0x00007fbb1a06f857 in XrdCl::FileStateHandler::OnOpen(XrdCl::XRootDStatus const*, XrdCl::OpenInfo const*, std::vector<XrdCl::HostInfo, std::allocator<XrdCl::HostInfo> > const*) () from /cvmfs/cms.cern.ch/slc6_amd64_gcc481/cms/cmssw/CMSSW_7_0_0_XROOTD/external/slc6_amd64_gcc481/lib/libXrdCl.so.1
#10 0x00007fbb1a0709fb in ?? () from /cvmfs/cms.cern.ch/slc6_amd64_gcc481/cms/cmssw/CMSSW_7_0_0_XROOTD/external/slc6_amd64_gcc481/lib/libXrdCl.so.1
#11 0x00007fbb1a056a7b in XrdCl::XRootDMsgHandler::HandleResponse() () from /cvmfs/cms.cern.ch/slc6_amd64_gcc481/cms/cmssw/CMSSW_7_0_0_XROOTD/external/slc6_amd64_gcc481/lib/libXrdCl.so.1
#12 0x00007fbb1a057132 in XrdCl::XRootDMsgHandler::HandleError(XrdCl::Status, XrdCl::Message*) () from /cvmfs/cms.cern.ch/slc6_amd64_gcc481/cms/cmssw/CMSSW_7_0_0_XROOTD/external/slc6_amd64_gcc481/lib/libXrdCl.so.1
#13 0x00007fbb1a057755 in XrdCl::XRootDMsgHandler::OnStatusReady(XrdCl::Message const*, XrdCl::Status) () from /cvmfs/cms.cern.ch/slc6_amd64_gcc481/cms/cmssw/CMSSW_7_0_0_XROOTD/external/slc6_amd64_gcc481/lib/libXrdCl.so.1
#14 0x00007fbb1a043e59 in XrdCl::OutQueue::Report(XrdCl::Status) () from /cvmfs/cms.cern.ch/slc6_amd64_gcc481/cms/cmssw/CMSSW_7_0_0_XROOTD/external/slc6_amd64_gcc481/lib/libXrdCl.so.1
#15 0x00007fbb1a03bab5 in XrdCl::Stream::Tick(long) () from /cvmfs/cms.cern.ch/slc6_amd64_gcc481/cms/cmssw/CMSSW_7_0_0_XROOTD/external/slc6_amd64_gcc481/lib/libXrdCl.so.1
#16 0x00007fbb1a03a44f in XrdCl::Channel::Tick(long) () from /cvmfs/cms.cern.ch/slc6_amd64_gcc481/cms/cmssw/CMSSW_7_0_0_XROOTD/external/slc6_amd64_gcc481/lib/libXrdCl.so.1
#17 0x00007fbb1a03a479 in ?? () from /cvmfs/cms.cern.ch/slc6_amd64_gcc481/cms/cmssw/CMSSW_7_0_0_XROOTD/external/slc6_amd64_gcc481/lib/libXrdCl.so.1
#18 0x00007fbb1a04493d in XrdCl::TaskManager::RunTasks() () from /cvmfs/cms.cern.ch/slc6_amd64_gcc481/cms/cmssw/CMSSW_7_0_0_XROOTD/external/slc6_amd64_gcc481/lib/libXrdCl.so.1

I don't have a good way to replicate this, as I don't know any handy way to force a server to timeout. Looking at the code and traceback (sorry, no debug symbols are available right now), I'm guessing that there's something wrong with the host list - maybe something hasn't been initialized yet?


Reply to this email directly or view it on GitHub.



Use REPLY-ALL to reply to list

To unsubscribe from the XROOTD-DEV list, click the following link:
https://listserv.slac.stanford.edu/cgi-bin/wa?SUBED1=XROOTD-DEV&A=1