we have seen several (repeated, but not reproducible) 4.9.1 client crashes with the following stacktrace: ``` [root@p05798818c77523 ~]# gdb /usr/bin/xrdcp '/var/spool/abrt/ccpp-2019-07-06-19:25:02-27695/coredump' Core was generated by `/usr/bin/xrdcp -f --nopbar --server -DIStreamTimeout 600 -DIRequestTimeout 660'. Program terminated with signal 11, Segmentation fault. (gdb) bt #0 XrdCl::AsyncSocketHandler::OnRead (this=this@entry=0x17ee070) at /usr/src/debug/xrootd/xrootd/src/XrdCl/XrdClAsyncSocketHandler.cc:699 #1 0x00007fe78652420f in XrdCl::AsyncSocketHandler::Event (this=0x17ee070, type=<optimized out>) at /usr/src/debug/xrootd/xrootd/src/XrdCl/XrdClAsyncSocketHandler.cc:232 #2 0x00007fe7864b76f7 in (anonymous namespace)::SocketCallBack::Event (this=0x17eee60, chP=<optimized out>, cbArg=<optimized out>, evFlags=<optimized out>) at /usr/src/debug/xrootd/xrootd/src/XrdCl/XrdClPollerBuiltIn.cc:82 #3 0x00007fe785fe655d in XrdSys::IOEvents::Poller::CbkXeq (this=this@entry=0x17ec010, cP=cP@entry=0x17eef90, events=1, eNum=0, eTxt=<optimized out>) at /usr/src/debug/xrootd/xrootd/src/XrdSys/XrdSysIOEvents.cc:693 #4 0x00007fe785fe76a9 in XrdSys::IOEvents::PollE::Dispatch (this=this@entry=0x17ec010, cP=0x17eef90, pollEv=<optimized out>) at /usr/src/debug/xrootd/xrootd/src/XrdSys/XrdSysIOEventsPollE.icc:270 #5 0x00007fe785fe7889 in XrdSys::IOEvents::PollE::Begin (this=0x17ec010, syncsem=<optimized out>, retcode=<optimized out>, eTxt=<optimized out>) at /usr/src/debug/xrootd/xrootd/src/XrdSys/XrdSysIOEventsPollE.icc:225 #6 0x00007fe785fe42ad in XrdSys::IOEvents::BootStrap::Start (parg=0x7ffe52a7c6e0) at /usr/src/debug/xrootd/xrootd/src/XrdSys/XrdSysIOEvents.cc:131 #7 0x00007fe785fec927 in XrdSysThread_Xeq (myargs=0x17e8130) at /usr/src/debug/xrootd/xrootd/src/XrdSys/XrdSysPthread.cc:86 #8 0x00007fe784fa1dd5 in start_thread () from /lib64/libpthread.so.0 #9 0x00007fe7854caead in clone () from /lib64/libc.so.6 ``` Code is ``` 698 st = pIncHandler.first->ReadMessageBody( pIncoming, pSocket->GetFD(), 699 bytesRead ); ``` and it seems that `pIncHandler.first` points to some unhappy memory region: ``` (gdb) x/i $rip => 0x7fe786522a2d <XrdCl::AsyncSocketHandler::OnRead()+61>: mov (%rdi),%rax (gdb) p/x $rdi $1 = 0x47f1d10 (gdb) p pIncHandler $4 = {first = 0x47f1d10, second = true} (gdb) x 0x47f1d10 0x47f1d10: Cannot access memory at address 0x47f1d10 (gdb) info locals bytesRead = 32743 st = <optimized out> log = 0x17e35f0 (gdb) p pIncoming $2 = (XrdCl::Message *) 0x7fe778000950 (gdb) p pSocket $3 = (XrdCl::Socket *) 0x17ee180 ``` This was with `xrdcp`, but we've seen also EOS crashes (server acting as a client). Version: xrootd-client-4.9.1-1.el7.x86_64 -- You are receiving this because you are subscribed to this thread. Reply to this email directly or view it on GitHub: https://github.com/xrootd/xrootd/issues/1021 ######################################################################## Use REPLY-ALL to reply to list To unsubscribe from the XROOTD-DEV list, click the following link: https://listserv.slac.stanford.edu/cgi-bin/wa?SUBED1=XROOTD-DEV&A=1