Print

Print


Hi Andy,

okay, we got another one. Here's the info. 
The core file is in /var/spool/xrootd/se/ :

-sh-4.2$ ls -aos core.62610 
4715952 -rw-------. 1 xrootd 7023529984 Mar 23 23:10 core.62610

And the gdb output:

----
[...]
[New LWP 51791]
[Thread debugging using libthread_db enabled]
Using host libthread_db library "/lib64/libthread_db.so.1".
Core was generated by `/usr/bin/xrootd -l /var/log/xrootd/xrootd.log -c /etc/xrootd/xrootd-se.cfg -k f'.
Program terminated with signal 11, Segmentation fault.
#0  0x00007fd1e2f8ff9a in XrdOucCallBack::Reply (
    this=this@entry=0x7fcf0c02e030, retVal=0, eValue=0, eText=<optimized out>, 
    Path=0x7fd08c0108e8 "061089e9-c6a2-42c1-9271-97d007970cfd")
    at /usr/src/debug/xrootd/xrootd/src/XrdOuc/XrdOucCallBack.cc:101
101        objCB->Done(retVal, &cbInfo, Path);
Missing separate debuginfos, use: debuginfo-install bzip2-libs-1.0.6-13.el7.x86_64 elfutils-libelf-0.163-3.el7.x86_64 elfutils-libs-0.163-3.el7.x86_64 expat-2.1.0-10.el7_3.x86_64 glibc-2.17-196.el7_4.2.x86_64 keyutils-libs-1.5.8-3.el7.x86_64 krb5-libs-1.14.1-27.el7_3.x86_64 libattr-2.4.46-12.el7.x86_64 libcap-2.22-9.el7.x86_64 libcom_err-1.42.9-9.el7.x86_64 libgcc-4.8.5-4.el7.x86_64 libgcrypt-1.5.3-12.el7_1.1.x86_64 libgpg-error-1.12-3.el7.x86_64 libselinux-2.5-6.el7.x86_64 libstdc++-4.8.5-4.el7.x86_64 libuuid-2.23.2-33.el7.x86_64 libxml2-2.9.1-6.el7_2.3.x86_64 lz4-1.7.5-2.el7.x86_64 openssl-libs-1.0.2k-8.el7.x86_64 pcre-8.32-15.el7_2.1.x86_64 sssd-client-1.13.0-40.el7_2.12.x86_64 systemd-libs-219-62.el7_6.2.x86_64 voms-2.0.14-1.4.osg34.el7.x86_64 xrootd-voms-plugin-0.6.0-2.osg34.el7.x86_64 xz-libs-5.2.2-1.el7.x86_64 zlib-1.2.7-17.el7.x86_64
(gdb) where
#0  0x00007fd1e2f8ff9a in XrdOucCallBack::Reply (
    this=this@entry=0x7fcf0c02e030, retVal=0, eValue=0, eText=<optimized out>, 
    Path=0x7fd08c0108e8 "061089e9-c6a2-42c1-9271-97d007970cfd")
    at /usr/src/debug/xrootd/xrootd/src/XrdOuc/XrdOucCallBack.cc:101
#1  0x00007fd1e3260e03 in XrdOfsTPCInfo::Reply (
    this=this@entry=0x7fcf0c0f2708, rC=rC@entry=0, eC=eC@entry=0, 
    eMsg=eMsg@entry=0x7fd1e3296511 "", mP=mP@entry=0x0)
    at /usr/src/debug/xrootd/xrootd/src/XrdOfs/XrdOfsTPCInfo.cc:145
#2  0x00007fd1e326099f in XrdOfsTPCJob::Done (this=0x7fd0740e5110, 
    pgmP=pgmP@entry=0xe084e0, eTxt=eTxt@entry=0xe087e0 "", rc=<optimized out>)
    at /usr/src/debug/xrootd/xrootd/src/XrdOfs/XrdOfsTPCJob.cc:130
#3  0x00007fd1e3261b8c in XrdOfsTPCProg::Run (this=0xe084e0)
    at /usr/src/debug/xrootd/xrootd/src/XrdOfs/XrdOfsTPCProg.cc:196
#4  0x00007fd1e3261bd9 in XrdOfsTPCProgRun (pp=<optimized out>)
    at /usr/src/debug/xrootd/xrootd/src/XrdOfs/XrdOfsTPCProg.cc:80
#5  0x00007fd1e2f86b47 in XrdSysThread_Xeq (myargs=0x7fd0b402eaf0)
    at /usr/src/debug/xrootd/xrootd/src/XrdSys/XrdSysPthread.cc:86
#6  0x00007fd1e2b3ae25 in start_thread () from /lib64/libpthread.so.0
#7  0x00007fd1e1e4034d in clone () from /lib64/libc.so.6
(gdb) quit
----

Does this help? Let us know what else we can do to debug further.

xrootd seems to continue to run fine, and there haven't been many 
of these 'already opened by 1 writer' errors, only 5 or 10 per day,
so that's good.

Thanks,

	Horst

########################################################################
Use REPLY-ALL to reply to list

To unsubscribe from the XROOTD-DEV list, click the following link:
https://listserv.slac.stanford.edu/cgi-bin/wa?SUBED1=XROOTD-DEV&A=1