Print

Print


Seen with "eoscp", but backtrace seems to be "generic" xrootd code. We have seen this so far on 3 machines.

Recent EOS core dumps on lxfsrf06b03.cern.ch [ eosatlas ]!
-rw------- 1 root daemon 61853696 Jan 27 03:00
/var/spool/eos/core/fst/core.1688
Core was generated by `eoscp -u 2 -g 2 -R -n -p -O
/var/eos/auth/.c0fd78e8-86f6-11e3-9078-000f53100350'.
Program terminated with signal 6, Aborted.
#0  0x0000003934030285 in raise () from /lib64/libc.so.6
#0  0x0000003934030285 in raise () from /lib64/libc.so.6
#1  0x0000003934031d30 in abort () from /lib64/libc.so.6
#2  0x00000039368bed94 in __gnu_cxx::__verbose_terminate_handler() ()
   from /usr/lib64/libstdc++.so.6
#3  0x00000039368bce46 in ?? () from /usr/lib64/libstdc++.so.6
#4  0x00000039368bce73 in std::terminate() () from /usr/lib64/libstdc++.so.6
#5  0x00000039368bcf71 in __cxa_throw () from /usr/lib64/libstdc++.so.6
#6  0x0000003849e23aad in Post (this=0xf210610, syncsem=<value optimized
out>, 
    retcode=<value optimized out>, eTxt=<value optimized out>)
    at /usr/src/debug/xrootd/xrootd/src/XrdSys/XrdSysPthread.hh:342
#7  XrdSys::IOEvents::PollE::Begin (this=0xf210610, 
    syncsem=<value optimized out>, retcode=<value optimized out>, 
    eTxt=<value optimized out>)
    at /usr/src/debug/xrootd/xrootd/src/XrdSys/XrdSysIOEventsPollE.icc:193
#8  0x0000003849e218c4 in XrdSys::IOEvents::BootStrap::Start (
    parg=0x7fff407d99e0)
    at /usr/src/debug/xrootd/xrootd/src/XrdSys/XrdSysIOEvents.cc:130
#9  0x0000003849e2019f in XrdSysThread_Xeq (myargs=<value optimized out>)
    at /usr/src/debug/xrootd/xrootd/src/XrdSys/XrdSysPthread.cc:86
#10 0x000000393480683d in start_thread () from /lib64/libpthread.so.0
#11 0x00000039340d4fad in clone () from /lib64/libc.so.6
(gdb) f 6
#6  0x0000003849e23aad in Post (this=0xf210610, syncsem=<value optimized out>, retcode=<value optimized out>, 
    eTxt=<value optimized out>) at /usr/src/debug/xrootd/xrootd/src/XrdSys/XrdSysPthread.hh:342
342                        {throw "sem_post() failed";}
(gdb) up
#7  XrdSys::IOEvents::PollE::Begin (this=0xf210610, syncsem=<value optimized out>, retcode=<value optimized out>, 
    eTxt=<value optimized out>) at /usr/src/debug/xrootd/xrootd/src/XrdSys/XrdSysIOEventsPollE.icc:193
193    syncsem->Post();
(gdb) quit

Output is just "terminate called after throwing an instance of 'char const*'"

Please let me know if you'd need access to the core files.

As per Lukasz, this might be an instance of the generic Linux glibc semaphore bug/race condition (https://bugzilla.redhat.com/show_bug.cgi?id=977550) that has needed workarounds elsewhere.

internal tracking: https://savannah.cern.ch/bugs/?103736


Reply to this email directly or view it on GitHub.



Use REPLY-ALL to reply to list

To unsubscribe from the XROOTD-DEV list, click the following link:
https://listserv.slac.stanford.edu/cgi-bin/wa?SUBED1=XROOTD-DEV&A=1