Print

Print


we have seen several (repeated, but not reproducible) 4.9.1 client crashes with the following stacktrace:

[root@p05798818c77523 ~]# gdb /usr/bin/xrdcp '/var/spool/abrt/ccpp-2019-07-06-19:25:02-27695/coredump'
Core was generated by `/usr/bin/xrdcp -f --nopbar --server -DIStreamTimeout 600 -DIRequestTimeout 660'.
Program terminated with signal 11, Segmentation fault.
(gdb) bt
#0  XrdCl::AsyncSocketHandler::OnRead (this=this@entry=0x17ee070) at /usr/src/debug/xrootd/xrootd/src/XrdCl/XrdClAsyncSocketHandler.cc:699
#1  0x00007fe78652420f in XrdCl::AsyncSocketHandler::Event (this=0x17ee070, type=<optimized out>)
    at /usr/src/debug/xrootd/xrootd/src/XrdCl/XrdClAsyncSocketHandler.cc:232
#2  0x00007fe7864b76f7 in (anonymous namespace)::SocketCallBack::Event (this=0x17eee60, chP=<optimized out>, cbArg=<optimized out>, evFlags=<optimized out>)
    at /usr/src/debug/xrootd/xrootd/src/XrdCl/XrdClPollerBuiltIn.cc:82
#3  0x00007fe785fe655d in XrdSys::IOEvents::Poller::CbkXeq (this=this@entry=0x17ec010, cP=cP@entry=0x17eef90, events=1, eNum=0, eTxt=<optimized out>)
    at /usr/src/debug/xrootd/xrootd/src/XrdSys/XrdSysIOEvents.cc:693
#4  0x00007fe785fe76a9 in XrdSys::IOEvents::PollE::Dispatch (this=this@entry=0x17ec010, cP=0x17eef90, pollEv=<optimized out>)
    at /usr/src/debug/xrootd/xrootd/src/XrdSys/XrdSysIOEventsPollE.icc:270
#5  0x00007fe785fe7889 in XrdSys::IOEvents::PollE::Begin (this=0x17ec010, syncsem=<optimized out>, retcode=<optimized out>, eTxt=<optimized out>)
    at /usr/src/debug/xrootd/xrootd/src/XrdSys/XrdSysIOEventsPollE.icc:225
#6  0x00007fe785fe42ad in XrdSys::IOEvents::BootStrap::Start (parg=0x7ffe52a7c6e0) at /usr/src/debug/xrootd/xrootd/src/XrdSys/XrdSysIOEvents.cc:131
#7  0x00007fe785fec927 in XrdSysThread_Xeq (myargs=0x17e8130) at /usr/src/debug/xrootd/xrootd/src/XrdSys/XrdSysPthread.cc:86
#8  0x00007fe784fa1dd5 in start_thread () from /lib64/libpthread.so.0
#9  0x00007fe7854caead in clone () from /lib64/libc.so.6

Code is

698	      st = pIncHandler.first->ReadMessageBody( pIncoming, pSocket->GetFD(),
699	                                               bytesRead );

and it seems that pIncHandler.first points to some unhappy memory region:

(gdb) x/i $rip
=> 0x7fe786522a2d <XrdCl::AsyncSocketHandler::OnRead()+61>:	mov    (%rdi),%rax
(gdb) p/x $rdi
$1 = 0x47f1d10
(gdb) p pIncHandler
$4 = {first = 0x47f1d10, second = true}
(gdb) x 0x47f1d10
0x47f1d10:	Cannot access memory at address 0x47f1d10
(gdb) info locals 
bytesRead = 32743
st = <optimized out>
log = 0x17e35f0
(gdb) p pIncoming 
$2 = (XrdCl::Message *) 0x7fe778000950
(gdb) p pSocket
$3 = (XrdCl::Socket *) 0x17ee180

This was with xrdcp, but we've seen also EOS crashes (server acting as a client).

Version: xrootd-client-4.9.1-1.el7.x86_64


You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub, or mute the thread.

[ { "@context": "http://schema.org", "@type": "EmailMessage", "potentialAction": { "@type": "ViewAction", "target": "https://github.com/xrootd/xrootd/issues/1021?email_source=notifications\u0026email_token=AA7NRDQVTPAX7WTMHRPLBOLP6RU2FA5CNFSM4H7EHOC2YY3PNVWWK3TUL52HS4DFUVEXG43VMWVGG33NNVSW45C7NFSM4G6CGOFA", "url": "https://github.com/xrootd/xrootd/issues/1021?email_source=notifications\u0026email_token=AA7NRDQVTPAX7WTMHRPLBOLP6RU2FA5CNFSM4H7EHOC2YY3PNVWWK3TUL52HS4DFUVEXG43VMWVGG33NNVSW45C7NFSM4G6CGOFA", "name": "View Issue" }, "description": "View this Issue on GitHub", "publisher": { "@type": "Organization", "name": "GitHub", "url": "https://github.com" } } ]

Use REPLY-ALL to reply to list

To unsubscribe from the XROOTD-DEV list, click the following link:
https://listserv.slac.stanford.edu/cgi-bin/wa?SUBED1=XROOTD-DEV&A=1