Print

Print


It has been raised at the xrootd-l mailing list. This should fix it:

https://github.com/xrootd/xrootd/commit/5341b44841d8ef7d77bdb1eb8826da94055e78f1

Cheers,
    Lukasz

On 10/21/2014 03:42 PM, Brian Bockelman wrote:
> Hi folks,
>
> Ignore the thread, but take a look at the link from Monday (SEGV on log
> rotation).
>
> Has the bug been reported anywhere besides the WLCG meeting?  I haven’t
> seen it in the tracker...
>
> Brian
>
>> Begin forwarded message:
>>
>> *Date: *October 21, 2014 at 5:46:20 AM CDT
>> *To: *<[log in to unmask] <mailto:[log in to unmask]>>
>> *From: *Nicolo Magini <[log in to unmask]
>> <mailto:[log in to unmask]>>
>> *Subject: **Re: Caltech xrootd failures*
>>
>>
>> *** Discussion title: WAN Data Access
>>
>> Hi all,
>>
>> FWIW, a Tier-1 also reported to WLCG Ops an issue in xrootd-4.0.3:
>>
>> https://twiki.cern.ch/twiki/bin/view/LCG/WLCGDailyMeetingsWeek141020#Monday
>>
>> "xrootd v.4.0.3 in EPEL7 contains a serious problem that leads to memory
>> corruption and eventual daemon crash. Discovered at logrotate time. The
>> script running at 3am systematically coincided (entailed?) the crash."
>>
>> I'll let Marian and the experts comment if it's better to upgrade to
>> 4.0.3 anyway, or to wait for the next release instead.
>>
>> Cheers
>> N.
>>
>> On 10/21/2014 04:36 AM, Marian Zvada wrote:
>>>
>>> *** Discussion title: WAN Data Access
>>>
>>> Hi Samir,
>>>
>>> v4.0.0 version has indeed several issues (IPv6 especially) and it's
>>> recommended go for 4.0.3. This is last production release as of xrootd
>>> developers can tell:
>>>
>>> http://xrootd.org/dload.html
>>>
>>> Currently only in osg-testing repo, but it doesn't bring higher risk
>>> than stay on 4.0.0.:
>>> http://repo.grid.iu.edu/osg/3.2/el6/testing/x86_64/
>>>
>>> So I'd recommend reconsider.
>>>
>>> My 2c,
>>> Marian
>>>
>>> On 10/20/14, 2:14 PM, Samir Cury wrote:
>>>>
>>>> *** Discussion title: WAN Data Access
>>>>
>>>> Hi Brian,
>>>>
>>>> I upgraded one of the servers last week, with the  (hopefully) latest
>>>> packages in OSG 3.2 stable :
>>>>
>>>> xrootd4-libs-4.0.0-1.9.osg32.el6.x86_64
>>>> xrootd-lcmaps-0.0.7-7.osg32.el6.x86_64
>>>> xrootd4-client-libs-4.0.0-1.9.osg32.el6.x86_64
>>>> xrootd4-4.0.0-1.9.osg32.el6.x86_64
>>>> xrootd-hdfs-1.8.4-2.osg32.el6.x86_64
>>>> xrootd4-server-libs-4.0.0-1.9.osg32.el6.x86_64
>>>> xrootd-cmstfc-1.5.1-8.osg32.el6.x86_64
>>>>
>>>> And this is the server which presented the problem.
>>>>
>>>> I'm not really considering upgrading to testing versions for
>>>> stability reasons.
>>>>
>>>> Good news is that this error didn't appear again so far.
>>>>
>>>> Thanks,
>>>> Samir
>>>>
>>>> On Sun, Oct 19, 2014 at 7:05 PM, Brian Bockelman
>>>> <[log in to unmask]> wrote:
>>>>>
>>>>> *** Discussion title: WAN Data Access
>>>>>
>>>>> Hi Samir,
>>>>>
>>>>> What version of Xrootd are you using?  There are some known IPv6
>>>>> issues prior to 4.0.3.
>>>>>
>>>>> Brian
>>>>>
>>>>>> On Oct 18, 2014, at 3:02 PM, Samir Cury
>>>>>> <[log in to unmask]> wrote:
>>>>>>
>>>>>>
>>>>>> *** Discussion title: WAN Data Access
>>>>>>
>>>>>> Interesting, looking at CMSD logs :
>>>>>>
>>>>>> 141018 12:49:01 15811 Login: xrootd.unl.edu login failed; timed out
>>>>>> 141018 12:49:01 15811 Manager: manager.0:[log in to unmask] removed;
>>>>>> lost connection
>>>>>> 141018 12:49:13 15811 XrdSetIF: Skipping duplicate public interface
>>>>>> [2600:900:6:1101:5054:ff:fe00:70cb]
>>>>>> 141018 12:50:13 15811 Login: xrootd.unl.edu login failed; timed out
>>>>>> 141018 12:50:13 15811 Manager: manager.0:[log in to unmask] removed;
>>>>>> lost connection
>>>>>> 141018 12:50:25 15811 XrdSetIF: Skipping duplicate public interface
>>>>>> [2600:900:6:1101:5054:ff:fe00:70cb]
>>>>>> 141018 12:51:25 15811 Login: xrootd.unl.edu login failed; timed out
>>>>>> 141018 12:51:25 15811 Manager: manager.0:[log in to unmask] removed;
>>>>>> lost connection
>>>>>> 141018 12:51:37 15811 XrdSetIF: Skipping duplicate public interface
>>>>>> [2600:900:6:1101:5054:ff:fe00:70cb]
>>>>>> 141018 12:52:37 15811 Login: xrootd.unl.edu login failed; timed out
>>>>>>
>>>>>> Looks like cmsd has been given an IPv6 service to talk to? We're using
>>>>>> the FNAL central redirector as recommended.
>>>>>>
>>>>>> Configuration was fine, everything worked after restart :
>>>>>>
>>>>>> 141018 12:57:17 001 Xrd: main:
>>>>>> root://cmsxrootd.fnal.gov:1094//store/test/xrootd/T2_US_Caltech/store/mc/SAM/GenericTTbar/GEN-SIM-RECO/CMSSW_5_3_1_START53_V5-v1/0013/CE4D66EB-5AAE-E111-96D6-003048D37524.root
>>>>>> --> /tmp/teste2.root
>>>>>> 141018 12:57:17 8658 Xrd: Read: Hole in the cache: offs=0, len=8388608
>>>>>> ^Crootd] Total 492.25 MB |==========>.........| 53.63 % [18.7 MB/s]
>>>>>> -bash-4.1$ nslookup xrootd.unl.edu
>>>>>>
>>>>>> Cheers,
>>>>>> Samir
>
>
>
> ------------------------------------------------------------------------
>
> Use REPLY-ALL to reply to list
>
> To unsubscribe from the XROOTD-DEV list, click the following link:
> https://listserv.slac.stanford.edu/cgi-bin/wa?SUBED1=XROOTD-DEV&A=1
>

########################################################################
Use REPLY-ALL to reply to list

To unsubscribe from the XROOTD-DEV list, click the following link:
https://listserv.slac.stanford.edu/cgi-bin/wa?SUBED1=XROOTD-DEV&A=1