Hi Horst, Other thna those core files nothing that would interest you. Once we see what the problem is maybe an upgrade would be a good thing. Andy On Sat, 21 Mar 2020, Horst Severini wrote: > Hi Andy, > > thanks. I just checked, and that's already installed on se1. > We probably did that a while ago when we were looking at other issues. > > We're running 4.11.1 everywhere on those storage nodes. Should we upgrade > to 4.11.2, or is there nothing in the .2 update that's applicable for us > right now? > > Thanks, > > Horst > > Andrew Hanushevsky <[log in to unmask]> wrote: > >> Hi Horst, >> >> Please make sure to install the debug RPM so that we can get actual >> statement numbers. It's called xrootd-debuginfo-4.11.2-1.el7.x86_64.rpm >> (assuming you are running Cent7). They should be readily available from >> OSG (though we have them too). >> >> Andy >> >> On Sat, 21 Mar 2020, Horst Severini wrote: >> >>> Thanks Andy, >>> >>> will do! >>> >>> Horst >>> >>> Andrew Hanushevsky <[log in to unmask]> wrote: >>> >>>> Hi Horst, >>>> >>>> Indeed, one should never see a core dump and if one does appear we >>>> definitely want to know about it. When you do see one, here is the first >>>> dump of information that would be helpful before we start digging deeper: >>>> >>>> gdb >executable> <corefile> >>>> where >>>> quit >>>> >>>> Cut and past the output into a mail file or posting. We may ask for a >>>> detailed traceback of every thread. I think I'll put that process on the >>>> xroot web page. For now, at least we will know where it went bonkers. >>>> >>>> Andy >>>> >>>> >>>> On Sat, 21 Mar 2020, Horst Severini wrote: >>>> >>>>> Hi Wei, >>>>> >>>>> yes, we have enough space for a few core dumps in /var/. It's just that there >>>>> were 5 or 6 in the last week or two, and that filled /var/ up completely. >>>>> I'll keep a closer eye on it for now. >>>>> >>>>> Thanks, >>>>> >>>>> Horst >>>>> >>>>> On 3/21/20 4:39 PM, Yang, Wei wrote: >>>>>> Also make sure you have enough space to hold a core dump. It can something >>>>>> be 10GB+ >>>>>> >>>>>> -- >>>>>> Wei Yang [log in to unmask] | 650-926-3338(O) >>>>>> >>>>>> ???-----Original Message----- >>>>>> From:<[log in to unmask]> on behalf of Horst >>>>>> Severini<[log in to unmask]> >>>>>> Date: Saturday, March 21, 2020 at 1:30 PM >>>>>> To: xrootd-dev<[log in to unmask]>,"[log in to unmask]" >>>>>> <[log in to unmask]> >>>>>> Subject: Re: XrootD smoke test report for 2020-03-21 10:01:46 GMT >>>>>> >>>>>> Thanks Wei, >>>>>> I'll send you the next one I get!:) >>>>>> Cheers, >>>>>> Horst >>>>>> On 3/21/20 2:55 PM, Yang, Wei wrote: >>>>>> > Indeed a core dump is usually the thing we need. >>>>>> > >>>>>> > -- >>>>>> > Wei Yang [log in to unmask] | 650-926-3338(O) >>>>>> > >>>>>> > On 3/21/20, 12:19 PM,"[log in to unmask] on behalf of >>>>>> Horst Severini" <[log in to unmask] on behalf of [log in to unmask]> >>>>>> wrote: >>>>>> > >>>>>> > We're running 4.11.1 here at OU. >>>>>> > >>>>>> > Cheers, >>>>>> > >>>>>> > Horst >>>>>> > >>>>>> > Albert Rossi<[log in to unmask]> wrote: >>>>>> > >>>>>> > > Hi Horst, >>>>>> > > >>>>>> > > actually, if you notice, all endpoints failed on today's >>>>>> test. So it was not just OU. >>>>>> > > >>>>>> > > The Stanford developers may want you to run a few commands >>>>>> over the core file from gdb once you have it in hand. >>>>>> > > >>>>>> > > What version of xrootd are you running, just out of >>>>>> curiosity? Is it bleeding-edge, or a stable release? >>>>>> > > >>>>>> > > Cheers, Al >>>>>> > > >>>>>> > > ________________________________________________ >>>>>> > > Albert L. Rossi >>>>>> > > Application Developer & Systems Analyst III >>>>>> > > Scientific Computing Division, Data Movement Development >>>>>> > > FCC 229A >>>>>> > > Mail Station 369 (FCC 2W) >>>>>> > > Fermi National Accelerator Laboratory >>>>>> > > Batavia, IL 60510 >>>>>> > > (630) 840-3023 >>>>>> > > ________________________________ >>>>>> > > From: Horst Severini<[log in to unmask]> >>>>>> > > Sent: Saturday, March 21, 2020 1:18 PM >>>>>> > >To:[log in to unmask] >>>>>> <[log in to unmask]>;[log in to unmask]<[log in to unmask]>; >>>>>> Albert Rossi<[log in to unmask]> >>>>>> > > Subject: Re: XrootD smoke test report for 2020-03-21 10:01:46 >>>>>> GMT >>>>>> > > >>>>>> > > Hi Al, >>>>>> > > >>>>>> > > thanks, good idea. I'll save the next core file. >>>>>> > > >>>>>> > > I'm pretty sure the authentication failures simply came >>>>>> because >>>>>> > > that partition was full and no new proxies or what not could >>>>>> be >>>>>> > > created, so I wouldn't worry about that. >>>>>> > > >>>>>> > > Thanks, >>>>>> > > >>>>>> > > Horst >>>>>> > > >>>>>> > > Albert Rossi<[log in to unmask]> wrote: >>>>>> > > >>>>>> > >> Hi Horst, >>>>>> > >> >>>>>> > >> I would definitely report >>>>>> [log in to unmask] >>>>>> > >> >>>>>> > >> As for why the massive authentication failure, I've seen >>>>>> this before, it might have to do with CA cert issues. >>>>>> > >> >>>>>> > >> Cheers, Al >>>>>> > >> >>>>>> > >> ________________________________________________ >>>>>> > >> Albert L. Rossi >>>>>> > >> Application Developer & Systems Analyst III >>>>>> > >> Scientific Computing Division, Data Movement Development >>>>>> > >> FCC 229A >>>>>> > >> Mail Station 369 (FCC 2W) >>>>>> > >> Fermi National Accelerator Laboratory >>>>>> > >> Batavia, IL 60510 >>>>>> > >> (630) 840-3023 >>>>>> > >> ________________________________ >>>>>> > >> From: Horst Severini<[log in to unmask]> >>>>>> > >> Sent: Saturday, March 21, 2020 11:19 AM >>>>>> > >>To:[log in to unmask] <[log in to unmask]> >>>>>> > >> Subject: Re: XrootD smoke test report for 2020-03-21 >>>>>> 10:01:46 GMT >>>>>> > >> >>>>>> > >> Hi all, >>>>>> > >> >>>>>> > >> our/var/ partition had filled up because of too many >>>>>> xrootd core dumps. >>>>>> > >> I cleared those up and restarted xrootd, and things look >>>>>> better again now. >>>>>> > >> >>>>>> > >> Not sure why we keep getting core dumps, though. >>>>>> > >> >>>>>> > >> Cheers, >>>>>> > >> >>>>>> > >> Horst >>>>>> > >> >>>>>> > >> On 3/21/20 5:01 AM,[log in to unmask] wrote: >>>>>> > >>> XROOTD SMOKE TEST SUMMARY >>>>>> > >>> 2020-03-21 10:01:46 GMT >>>>>> > >>> >>>>>> > >>> Client: bogus6.fnal.gov >>>>>> > >>> >>>>>> > >>> XrootD version: v4.11.2 >>>>>> > >>> >>>>>> > >>> Reference server: CERN-TRUNK >>>>>> > >>> >>>>>> > >>> Credential delegation: ON >>>>>> > >>> >>>>>> > >>> Checksum: -C adler32 >>>>>> > >>> >>>>>> > >>> Total number of round-trip tests: 21 >>>>>> > >>> >>>>>> > >>> --------------------------------SOUND >>>>>> ENDPOINTS--------------------------------- >>>>>> > >>> >>>>>> > >>> SCORE ENDPT TYPE UP SRC >>>>>> DST DN >>>>>> > >>> >>>>>> -------------------------------------------------------------------------------- >>>>>> > >>> >>>>>> > >>> -----------------------------PROBLEMATIC >>>>>> ENDPOINTS------------------------------ >>>>>> > >>> >>>>>> > >>> SCORE ENDPT TYPE UP SRC >>>>>> DST DN >>>>>> > >>> >>>>>> -------------------------------------------------------------------------------- >>>>>> > >>> 19 BRUSSELS dCache F - >>>>>> F F 0/4 >>>>>> > >>> 19 CERN-EOS EOS F - >>>>>> F F 0/4 >>>>>> > >>> 19 CERN-TRUNK DPM - F >>>>>> - F 0/2 >>>>>> > >>> 19 DESY-PROM dCache F - >>>>>> F F 0/4 >>>>>> > >>> 19 FNAL dCache F - >>>>>> F F 0/4 >>>>>> > >>> 19 IN2P3-DOMA xrootd F - >>>>>> F F 0/4 >>>>>> > >>> 19 PRAGUE DPM F - >>>>>> F F 0/4 >>>>>> > >>> 19 RAL-CEPH CEPH F - >>>>>> F F 0/4 >>>>>> > >>> 19 RAL-LCG2 Echo F - >>>>>> F F 0/4 >>>>>> > >>> 19 SLAC XrootD F - >>>>>> F F 0/4 >>>>>> > >>> 19 TRIUMF dCache F - >>>>>> F F 0/4 >>>>>> > >>> 19 UKI-BRUNEL DPM F - >>>>>> F F 0/4 >>>>>> > >>> 19 UKI-LANC DPM F - >>>>>> F F 0/4 >>>>>> > >>> 19 UKI-MAN1 DPM F - >>>>>> F F 0/4 >>>>>> > >>> 19 UKI-MAN2 DPM F - >>>>>> F F 0/4 >>>>>> > >>> 19 UNI-BONN CephFS F - >>>>>> F F 0/4 >>>>>> > >>> 18 OU XrootD F - >>>>>> F F 0/4 >>>>>> > >>> 13 BNL dCache F - >>>>>> F F 0/4 >>>>>> > >>> 7 AGLT2 dCache F - >>>>>> F F 0/4 >>>>>> > >>> 0 CALTECH HDFS F - >>>>>> F F 0/4 >>>>>> > >>> 0 TRIUMF-PROD dCache F - >>>>>> F F 0/4 >>>>> >>> > ######################################################################## Use REPLY-ALL to reply to list To unsubscribe from the XROOTD-DEV list, click the following link: https://listserv.slac.stanford.edu/cgi-bin/wa?SUBED1=XROOTD-DEV&A=1