Print

Print


URL:
  <http://savannah.cern.ch/bugs/?91129>

                 Summary: XrdClient deadlocks on auth failure
                 Project: XROOTD
            Submitted by: bbockelm
            Submitted on: 2012-02-03 01:27
                Severity: 3 - Normal
                Priority: 5 - Normal
                  Status: None
                 Privacy: Public
             Assigned to: None
        Originator Email: 
             Open/Closed: Open
         Discussion Lock: Any
      Fixed by commit(s): 

    _______________________________________________________

Details:

Hi,

I think, in the stack trace below, Thread #1 has caused a deadlock by
disconnecting after an authentication failure.  This happens 100% of the time
in my current tests.  Should be Xrootd 3.1.0 (as shipped with CMSSW).

I've also copied the client session at debug level 1.

Brian


[bbockelm@t3 CMSSW_5_0_1]$ pstack 7484
Thread 3 (Thread 0x4256a940 (LWP 7485)):
#0  0x000000392240d524 in __lll_lock_wait () from /lib64/libpthread.so.0
#1  0x0000003922408e35 in _L_lock_1127 () from /lib64/libpthread.so.0
#2  0x0000003922408d33 in pthread_mutex_lock () from /lib64/libpthread.so.0
#3  0x00002b6f423f8480 in XrdClientConnectionMgr::GarbageCollect() () from
/opt/osg/app/cmssoft/cms/slc5_amd64_gcc434/cms/cmssw/CMSSW_5_0_1/external/slc5_amd64_gcc434/lib/libXrdClient.so
#4  0x00002b6f423f8745 in GarbageCollectorThread(void*, XrdClientThread*) ()
from
/opt/osg/app/cmssoft/cms/slc5_amd64_gcc434/cms/cmssw/CMSSW_5_0_1/external/slc5_amd64_gcc434/lib/libXrdClient.so
#5  0x00002b6f42436d37 in XrdSysThread_Xeq () from
/opt/osg/app/cmssoft/cms/slc5_amd64_gcc434/cms/cmssw/CMSSW_5_0_1/external/slc5_amd64_gcc434/lib/libXrdClient.so
#6  0x000000392240673d in start_thread () from /lib64/libpthread.so.0
#7  0x0000003921cd44bd in clone () from /lib64/libc.so.6
Thread 2 (Thread 0x4396c940 (LWP 7488)):
#0  0x000000392240d524 in __lll_lock_wait () from /lib64/libpthread.so.0
#1  0x0000003922408e35 in _L_lock_1127 () from /lib64/libpthread.so.0
#2  0x0000003922408d33 in pthread_mutex_lock () from /lib64/libpthread.so.0
#3  0x00002b6f424019b2 in XrdClientPhyConnection::Disconnect() () from
/opt/osg/app/cmssoft/cms/slc5_amd64_gcc434/cms/cmssw/CMSSW_5_0_1/external/slc5_amd64_gcc434/lib/libXrdClient.so
#4  0x00002b6f42402722 in XrdClientPhyConnection::ReadRaw(void*, int, int,
int*) () from
/opt/osg/app/cmssoft/cms/slc5_amd64_gcc434/cms/cmssw/CMSSW_5_0_1/external/slc5_amd64_gcc434/lib/libXrdClient.so
#5  0x00002b6f423ffc0c in XrdClientMessage::ReadRaw(XrdClientPhyConnection*)
() from
/opt/osg/app/cmssoft/cms/slc5_amd64_gcc434/cms/cmssw/CMSSW_5_0_1/external/slc5_amd64_gcc434/lib/libXrdClient.so
#6  0x00002b6f42401f0a in XrdClientPhyConnection::BuildMessage(bool, bool) ()
from
/opt/osg/app/cmssoft/cms/slc5_amd64_gcc434/cms/cmssw/CMSSW_5_0_1/external/slc5_amd64_gcc434/lib/libXrdClient.so
#7  0x00002b6f4240362a in SocketReaderThread(void*, XrdClientThread*) () from
/opt/osg/app/cmssoft/cms/slc5_amd64_gcc434/cms/cmssw/CMSSW_5_0_1/external/slc5_amd64_gcc434/lib/libXrdClient.so
#8  0x00002b6f42436d37 in XrdSysThread_Xeq () from
/opt/osg/app/cmssoft/cms/slc5_amd64_gcc434/cms/cmssw/CMSSW_5_0_1/external/slc5_amd64_gcc434/lib/libXrdClient.so
#9  0x000000392240673d in start_thread () from /lib64/libpthread.so.0
#10 0x0000003921cd44bd in clone () from /lib64/libc.so.6
Thread 1 (Thread 0x2b6f410d8f70 (LWP 7484)):
#0  0x000000392240d524 in __lll_lock_wait () from /lib64/libpthread.so.0
#1  0x0000003922408e35 in _L_lock_1127 () from /lib64/libpthread.so.0
#2  0x0000003922408d33 in pthread_mutex_lock () from /lib64/libpthread.so.0
#3  0x00002b6f42400f70 in XrdClientPhyConnection::IsValid() () from
/opt/osg/app/cmssoft/cms/slc5_amd64_gcc434/cms/cmssw/CMSSW_5_0_1/external/slc5_amd64_gcc434/lib/libXrdClient.so
#4  0x00002b6f423f81fb in DumpPhyConn(char const*, XrdClientPhyConnection*,
void*) () from
/opt/osg/app/cmssoft/cms/slc5_amd64_gcc434/cms/cmssw/CMSSW_5_0_1/external/slc5_amd64_gcc434/lib/libXrdClient.so
#5  0x00002b6f423fba1e in XrdOucHash<XrdClientPhyConnection>::Apply(int
(*)(char const*, XrdClientPhyConnection*, void*), void*) () from
/opt/osg/app/cmssoft/cms/slc5_amd64_gcc434/cms/cmssw/CMSSW_5_0_1/external/slc5_amd64_gcc434/lib/libXrdClient.so
#6  0x00002b6f423f84d1 in XrdClientConnectionMgr::GarbageCollect() () from
/opt/osg/app/cmssoft/cms/slc5_amd64_gcc434/cms/cmssw/CMSSW_5_0_1/external/slc5_amd64_gcc434/lib/libXrdClient.so
#7  0x00002b6f423fa09c in XrdClientConnectionMgr::Disconnect(int, bool) ()
from
/opt/osg/app/cmssoft/cms/slc5_amd64_gcc434/cms/cmssw/CMSSW_5_0_1/external/slc5_amd64_gcc434/lib/libXrdClient.so
#8  0x00002b6f423e83e0 in XrdClientConn::Disconnect(bool) () from
/opt/osg/app/cmssoft/cms/slc5_amd64_gcc434/cms/cmssw/CMSSW_5_0_1/external/slc5_amd64_gcc434/lib/libXrdClient.so
#9  0x00002b6f423f4dfd in XrdClientConn::DoLogin() () from
/opt/osg/app/cmssoft/cms/slc5_amd64_gcc434/cms/cmssw/CMSSW_5_0_1/external/slc5_amd64_gcc434/lib/libXrdClient.so
#10 0x00002b6f423f5c78 in XrdClientConn::GetAccessToSrv() () from
/opt/osg/app/cmssoft/cms/slc5_amd64_gcc434/cms/cmssw/CMSSW_5_0_1/external/slc5_amd64_gcc434/lib/libXrdClient.so
#11 0x00002b6f423dd153 in XrdClient::Open(unsigned short, unsigned short,
bool) () from
/opt/osg/app/cmssoft/cms/slc5_amd64_gcc434/cms/cmssw/CMSSW_5_0_1/external/slc5_amd64_gcc434/lib/libXrdClient.so
#12 0x00002b6f423b9e96 in XrdFile::open(char const*, int, int) () from
/home/cse496/bbockelm/tmp/CMSSW_5_0_1/lib/slc5_amd64_gcc434/libUtilitiesXrdAdaptor.so
#13 0x00002b6f423ba61d in XrdFile::XrdFile(std::basic_string<char,
std::char_traits<char>, std::allocator<char> > const&, int, int) () from
/home/cse496/bbockelm/tmp/CMSSW_5_0_1/lib/slc5_amd64_gcc434/libUtilitiesXrdAdaptor.so
#14 0x00002b6f423b2031 in XrdStorageMaker::open(std::basic_string<char,
std::char_traits<char>, std::allocator<char> > const&,
std::basic_string<char, std::char_traits<char>, std::allocator<char> >
const&, int) () from
/home/cse496/bbockelm/tmp/CMSSW_5_0_1/lib/slc5_amd64_gcc434/pluginUtilitiesXrdAdaptorPlugin.so
#15 0x00002b6f4172a3e1 in StorageFactory::open(std::basic_string<char,
std::char_traits<char>, std::allocator<char> > const&, int) () from
/home/cse496/bbockelm/tmp/CMSSW_5_0_1/lib/slc5_amd64_gcc434/libUtilitiesStorageFactory.so
#16 0x00002b6f416c9a64 in TStorageFactoryFile::TStorageFactoryFile(char
const*, char const*, char const*, int, int, bool) () from
/home/cse496/bbockelm/tmp/CMSSW_5_0_1/lib/slc5_amd64_gcc434/pluginIOPoolTFileAdaptor.so
#17 0x00002b6f416d066f in G__TFileAdaptorLinkDef_220_0_13(G__value*, char
const*, G__param*, int) () from
/home/cse496/bbockelm/tmp/CMSSW_5_0_1/lib/slc5_amd64_gcc434/pluginIOPoolTFileAdaptor.so
#18 0x00002b6f402de5dc in Cint::G__CallFunc::Execute(void*) () from
/opt/osg/app/cmssoft/cms/slc5_amd64_gcc434/cms/cmssw/CMSSW_5_0_1/external/slc5_amd64_gcc434/lib/libCint.so
#19 0x00002b6f3fd089d1 in TCint::CallFunc_ExecInt(void*, void*) const () from
/opt/osg/app/cmssoft/cms/slc5_amd64_gcc434/cms/cmssw/CMSSW_5_0_1/external/slc5_amd64_gcc434/lib/libCore.so
#20 0x00002b6f3fd36634 in TMethodCall::Execute(void*, long&) () from
/opt/osg/app/cmssoft/cms/slc5_amd64_gcc434/cms/cmssw/CMSSW_5_0_1/external/slc5_amd64_gcc434/lib/libCore.so
#21 0x00002b6f3fc98b14 in TPluginHandler::ExecPlugin(int, ...) () from
/opt/osg/app/cmssoft/cms/slc5_amd64_gcc434/cms/cmssw/CMSSW_5_0_1/external/slc5_amd64_gcc434/lib/libCore.so
#22 0x00002b6f3f83bbb6 in TFile::Open(char const*, char const*, char const*,
int, int) () from
/opt/osg/app/cmssoft/cms/slc5_amd64_gcc434/cms/cmssw/CMSSW_5_0_1/external/slc5_amd64_gcc434/lib/libRIO.so
#23 0x00002b6f421d33f9 in edm::InputFile::InputFile(char const*, char const*)
() from
/opt/osg/app/cmssoft/cms/slc5_amd64_gcc434/cms/cmssw/CMSSW_5_0_1/lib/slc5_amd64_gcc434/pluginIOPoolInput.so
#24 0x00002b6f42204043 in edm::RootInputFileSequence::initFile(bool) () from
/opt/osg/app/cmssoft/cms/slc5_amd64_gcc434/cms/cmssw/CMSSW_5_0_1/lib/slc5_amd64_gcc434/pluginIOPoolInput.so
#25 0x00002b6f4220887f in
edm::RootInputFileSequence::RootInputFileSequence(edm::ParameterSet const&,
edm::PoolSource const&, edm::InputFileCatalog const&, edm::PrincipalCache&,
edm::InputType::InputType) () from
/opt/osg/app/cmssoft/cms/slc5_amd64_gcc434/cms/cmssw/CMSSW_5_0_1/lib/slc5_amd64_gcc434/pluginIOPoolInput.so
#26 0x00002b6f421d8b11 in edm::PoolSource::PoolSource(edm::ParameterSet
const&, edm::InputSourceDescription const&) () from
/opt/osg/app/cmssoft/cms/slc5_amd64_gcc434/cms/cmssw/CMSSW_5_0_1/lib/slc5_amd64_gcc434/pluginIOPoolInput.so
#27 0x00002b6f421d5434 in edmplugin::PluginFactory<edm::InputSource*
()(edm::ParameterSet const&, edm::InputSourceDescription
const&)>::PMaker<edm::PoolSource>::create(edm::ParameterSet const&,
edm::InputSourceDescription const&) const () from
/opt/osg/app/cmssoft/cms/slc5_amd64_gcc434/cms/cmssw/CMSSW_5_0_1/lib/slc5_amd64_gcc434/pluginIOPoolInput.so
#28 0x00002b6f3e1f1495 in
edm::InputSourceFactory::makeInputSource(edm::ParameterSet const&,
edm::InputSourceDescription const&) const () from
/opt/osg/app/cmssoft/cms/slc5_amd64_gcc434/cms/cmssw/CMSSW_5_0_1/lib/slc5_amd64_gcc434/libFWCoreFramework.so
#29 0x00002b6f3e1925fb in edm::makeInput(edm::ParameterSet&,
edm::CommonParams const&, edm::ProductRegistry&, edm::PrincipalCache&,
boost::shared_ptr<edm::ActivityRegistry>,
boost::shared_ptr<edm::ProcessConfiguration>) () from
/opt/osg/app/cmssoft/cms/slc5_amd64_gcc434/cms/cmssw/CMSSW_5_0_1/lib/slc5_amd64_gcc434/libFWCoreFramework.so
#30 0x00002b6f3e194a9e in
edm::EventProcessor::init(boost::shared_ptr<edm::ProcessDesc>&,
edm::ServiceToken const&, edm::serviceregistry::ServiceLegacy) () from
/opt/osg/app/cmssoft/cms/slc5_amd64_gcc434/cms/cmssw/CMSSW_5_0_1/lib/slc5_amd64_gcc434/libFWCoreFramework.so
#31 0x00002b6f3e196c9d in
edm::EventProcessor::EventProcessor(boost::shared_ptr<edm::ProcessDesc>&,
edm::ServiceToken const&, edm::serviceregistry::ServiceLegacy) () from
/opt/osg/app/cmssoft/cms/slc5_amd64_gcc434/cms/cmssw/CMSSW_5_0_1/lib/slc5_amd64_gcc434/libFWCoreFramework.so
#32 0x000000000040f0a4 in main ()



[bbockelm@t3 CMSSW_5_0_1]$ cmsRun simple.py -j test2.fjr
120202 19:18:16 001 Xrd: CheckErrorStatus: Server [brian-test.unl.edu]
declared: (error code: 3010)
120202 19:18:16 001 Xrd: DoAuthentication: 
XrdSec: No authentication protocols are available.
120202 19:18:16 001 Xrd: Connect: Authentication failure: : unable to get
protocol object
02-Feb-2012 19:18:16 CST  Initiating request to open file
root://brian-test.unl.edu//store/relval/CMSSW_5_0_1/Mu/RECO/GR_R_50_V11_RelVal_mu2010B-v1/0224/6289ACC2-4E4B-E111-B830-002618943971.root
120202 19:18:16 001 Xrd: ShowUrls: The converted URLs count is 1
120202 19:18:16 001 Xrd: ShowUrls: URL n.1:
root://brian-test.unl.edu:1094//store/relval/CMSSW_5_0_1/Mu/RECO/GR_R_50_V11_RelVal_mu2010B-v1/0224/6289ACC2-4E4B-E111-B830-002618943971.root.
120202 19:18:16 001 Xrd: ShowUrls: The converted URLs count is 1
120202 19:18:16 001 Xrd: ShowUrls: URL n.1:
root://brian-test.unl.edu:1094//store/relval/CMSSW_5_0_1/Mu/RECO/GR_R_50_V11_RelVal_mu2010B-v1/0224/6289ACC2-4E4B-E111-B830-002618943971.root.
120202 19:18:16 001 Xrd: CheckErrorStatus: Server [brian-test.unl.edu]
declared: (error code: 3010)
120202 19:18:16 001 Xrd: DoAuthentication: 
XrdSec: No authentication protocols are available.
120202 19:18:16 001 Xrd: CheckErrorStatus: Server [brian-test.unl.edu]
declared: Invalid request; user not authenticated(error code: 3006)
120202 19:18:16 001 Xrd: DumpPhyConn: Phyconn entry,
[log in to unmask]:1094', LogCnt=1 NotValid
120202 19:18:16 001 Xrd: DumpPhyConn: Phyconn entry, key='Trashed
connection', LogCnt=1 NotValid
120202 19:18:16 001 Xrd: DumpPhyConn: Phyconn entry, key='Trashed
connection', LogCnt=0 NotValid
120202 19:18:16 001 Xrd: Open: Access to server failed: error: 3006 (Invalid
request; user not authenticated) - retrying.
120202 19:18:16 001 Xrd: Open: Connection attempt failed. Sleeping 5
seconds.
120202 19:18:16 7484 Xrd: XrdClientMessage::ReadRaw: Failed to read header (8
bytes).
120202 19:18:17 7484 Xrd: XrdClientMessage::ReadRaw: Failed to read header (8
bytes).
120202 19:18:22 001 Xrd: ShowUrls: The converted URLs count is 1
120202 19:18:22 001 Xrd: ShowUrls: URL n.1:
root://brian-test.unl.edu:1094//store/relval/CMSSW_5_0_1/Mu/RECO/GR_R_50_V11_RelVal_mu2010B-v1/0224/6289ACC2-4E4B-E111-B830-002618943971.root.
120202 19:18:22 001 Xrd: CheckErrorStatus: Server [brian-test.unl.edu]
declared: (error code: 3010)
120202 19:18:22 001 Xrd: DoAuthentication: 
XrdSec: No authentication protocols are available.
120202 19:18:22 001 Xrd: CheckErrorStatus: Server [brian-test.unl.edu]
declared: Invalid request; user not authenticated(error code: 3006)





    _______________________________________________________

Reply to this item at:

  <http://savannah.cern.ch/bugs/?91129>

_______________________________________________
  Message sent via/by LCG Savannah
  http://savannah.cern.ch/

########################################################################
Use REPLY-ALL to reply to list

To unsubscribe from the XROOTD-DEV list, click the following link:
https://listserv.slac.stanford.edu/cgi-bin/wa?SUBED1=XROOTD-DEV&A=1