Print

Print


URL:
  <http://savannah.cern.ch/bugs/?88259>

                 Summary: seg fault with xrdcp extreme copy
                 Project: XROOTD
            Submitted by: bdouglas
            Submitted on: 2011-10-28 16:39
                Severity: 3 - Normal
                Priority: 5 - Normal
                  Status: None
                 Privacy: Public
             Assigned to: None
        Originator Email: 
             Open/Closed: Open
         Discussion Lock: Any
      Fixed by commit(s): 

    _______________________________________________________

Details:

Yesterday,

  I used xprep to trigger frm to copy files with xrdcp extreme copy
from various sites across the ATLAS xrootd federation. 

I got a lot of seg faults with xrdcp .

Here is some of the information that I collected -

>From - syslog -
Oct 27 18:10:29 wrk2prv kernel: xrdcp[1912]: segfault at 00000000454d99d0 rip
00000035a5e0bd2d rsp 00007fff72c91360 error 4
Oct 27 18:22:45 wrk2prv kernel: xrdcp[2625]: segfault at 00000000463319d0 rip
00000035a5e0bd2d rsp 00007fff518d9ac0 error 4
Oct 27 18:22:51 wrk2prv kernel: xrdcp[2646]: segfault at 0000000045aa39d0 rip
00000035a5e0bd2d rsp 00007fffac126ae0 error 4
Oct 27 18:28:20 wrk2prv kernel: xrdcp[2912]: segfault at 00000000469fa9d0 rip
00000035a5e0bd2d rsp 00007fff90d716e0 error 4
Oct 27 18:31:02 wrk2prv kernel: xrdcp[3058]: segfault at 0000000044bd29d0 rip
00000035a5e0bd2d rsp 00007fff8af27950 error 4
Oct 27 18:35:25 wrk2prv kernel: xrdcp[3288]: segfault at 0000000044e099d0 rip
00000035a5e0bd2d rsp 00007fff4eb53fa0 error 4
Oct 27 19:05:03 wrk2prv kernel: xrdcp[4731]: segfault at 00000000456de9d0 rip
00000035a5e0bd2d rsp 00007ffff97347e0 error 4
Oct 27 19:10:04 wrk2prv kernel: xrdcp[5083]: segfault at 0000000046d019d0 rip
00000035a5e0bd2d rsp 00007fff17483d90 error 4
Oct 27 19:23:53 wrk2prv kernel: xrdcp[5862]: segfault at 0000000046e2d9d0 rip
00000035a5e0bd2d rsp 00007fffdaee0450 error 4
Oct 27 19:27:15 wrk2prv kernel: xrdcp[6041]: segfault at 0000000046c019d0 rip
00000035a5e0bd2d rsp 00007ffff72801e0 error 4
Oct 27 19:30:54 wrk2prv kernel: xrdcp[6251]: segfault at 00000000456f99d0 rip
00000035a5e0bd2d rsp 00007fffc9dc4f00 error 4
Oct 27 19:36:07 wrk2prv kernel: xrdcp[6461]: segfault at 0000000044ad79d0 rip
00000035a5e0bd2d rsp 00007fff6a833be0 error 4
Oct 27 19:37:37 wrk2prv kernel: xrdcp[6542]: segfault at 0000000045ac19d0 rip
00000035a5e0bd2d rsp 00007fff351a8460 error 4
Oct 27 19:38:22 wrk2prv kernel: xrdcp[6597]: segfault at 00000000472499d0 rip
00000035a5e0bd2d rsp 00007fff9e90f670 error 4
Oct 27 19:40:36 wrk2prv kernel: xrdcp[6719]: segfault at 000000004639e9d0 rip
00000035a5e0bd2d rsp 00007fff6eb67e90 error 4
Oct 27 19:43:11 wrk2prv kernel: xrdcp[6845]: segfault at 00000000472669d0 rip
00000035a5e0bd2d rsp 00007fff9122ce80 error 4
Oct 27 19:45:10 wrk2prv kernel: xrdcp[6933]: segfault at 00000000446399d0 rip
00000035a5e0bd2d rsp 00007fff22607ce0 error 4
Oct 27 19:45:43 wrk2prv kernel: xrdcp[6993]: segfault at 0000000045f029d0 rip
00000035a5e0bd2d rsp 00007fffbd3ec220 error 4


 - Note there are many other failures -

>From frm log -

/etc/xrootd/new-stagein.sh: line 63:  4731 Segmentation fault     
${bindir}/xrdcp -x -f -s ${rfn} ${tfn} >> $TMPFILE 2>&1
111027 19:05:03 13534 xfr_Run: `/tmp/new-stagein.sh.jK4712' ->
`/local/xrootd/a/atlas/dq2/data11_7TeV/NTUP_TOP/r2276_p516_p523_p530_p577/data11_7TeV.00178109.physics_Egamma.merge.NTUP_TOP.r2276_p516_p523_p530_p577_tid367183_00/NTUP_TOP.367183._000044.root.1.fail'
111027 19:05:03 13534 xfr_Run: removed `/tmp/new-stagein.sh.jK4712'
111027 19:05:03 13534 xfr_Run: /etc/xrootd/new-stagein.sh ended with status
5


>From - log of copy script

failed_file name =
/local/xrootd/a/atlas/dq2/data11_7TeV/NTUP_TOP/r2276_p516_p523_p530_p577/data11_7TeV.00178109.physics_Egamma.merge.NTUP_TOP.r2276_p516_p523_p530_p577_tid367183_00/NTUP_TOP.367183._000044.root.1.fail

lfn:
/atlas/dq2/data11_7TeV/NTUP_TOP/r2276_p516_p523_p530_p577/data11_7TeV.00178109.physics_Egamma.merge.NTUP_TOP.r2276_p516_p523_p530_p577_tid367183_00/NTUP_TOP.367183._000044.root.1
pfn:
/local/xrootd/a/atlas/dq2/data11_7TeV/NTUP_TOP/r2276_p516_p523_p530_p577/data11_7TeV.00178109.physics_Egamma.merge.NTUP_TOP.r2276_p516_p523_p530_p577_tid367183_00/NTUP_TOP.367183._000044.root.1
rfn:
root://glrd.usatlas.org:1094//atlas/dq2/data11_7TeV/NTUP_TOP/r2276_p516_p523_p530_p577/data11_7TeV.00178109.physics_Egamma.merge.NTUP_TOP.r2276_p516_p523_p530_p577_tid367183_00/NTUP_TOP.367183._000044.root.1?tried=+1213headprv.hep.anl.gov
tfn:
root://wrk2prv.hep.anl.gov:1094//atlas/dq2/data11_7TeV/NTUP_TOP/r2276_p516_p523_p530_p577/data11_7TeV.00178109.physics_Egamma.merge.NTUP_TOP.r2276_p516_p523_p530_p577_tid367183_00/NTUP_TOP.367183._000044.root.1
copy command: /usr/bin/xrdcp -x -f -s
root://glrd.usatlas.org:1094//atlas/dq2/data11_7TeV/NTUP_TOP/r2276_p516_p523_p530_p577/data11_7TeV.00178109.physics_Egamma.merge.NTUP_TOP.r2276_p516_p523_p530_p577_tid367183_00/NTUP_TOP.367183._000044.root.1?tried=+1213headprv.hep.anl.gov
root://wrk2prv.hep.anl.gov:1094//atlas/dq2/data11_7TeV/NTUP_TOP/r2276_p516_p523_p530_p577/data11_7TeV.00178109.physics_Egamma.merge.NTUP_TOP.r2276_p516_p523_p530_p577_tid367183_00/NTUP_TOP.367183._000044.root.1

Extreme Copy enabled. 
Source #1
root://192.17.18.38:1094//atlas/dq2/data11_7TeV/NTUP_TOP/r2276_p516_p523_p530_p577/data11_7TeV.00178109.physics_Egamma.merge.NTUP_TOP.r2276_p516_p523_p530_p577_tid367183_00/NTUP_TOP.367183._000044.root.1?tried=+1213headprv.hep.anl.gov
Source #2
root://192.17.18.40:1094//atlas/dq2/data11_7TeV/NTUP_TOP/r2276_p516_p523_p530_p577/data11_7TeV.00178109.physics_Egamma.merge.NTUP_TOP.r2276_p516_p523_p530_p577_tid367183_00/NTUP_TOP.367183._000044.root.1?tried=+1213headprv.hep.anl.gov
Source #3
root://128.135.158.186:1094//atlas/dq2/data11_7TeV/NTUP_TOP/r2276_p516_p523_p530_p577/data11_7TeV.00178109.physics_Egamma.merge.NTUP_TOP.r2276_p516_p523_p530_p577_tid367183_00/NTUP_TOP.367183._000044.root.1?tried=+1213headprv.hep.anl.gov
[




    _______________________________________________________

Reply to this item at:

  <http://savannah.cern.ch/bugs/?88259>

_______________________________________________
  Message sent via/by LCG Savannah
  http://savannah.cern.ch/