Print

Print


Hi folks,

Ignore the thread, but take a look at the link from Monday (SEGV on log rotation).

Has the bug been reported anywhere besides the WLCG meeting?  I haven’t seen it in the tracker...

Brian

> Begin forwarded message:
> 
> Date: October 21, 2014 at 5:46:20 AM CDT
> To: <[log in to unmask]>
> From: Nicolo Magini <[log in to unmask]>
> Subject: Re: Caltech xrootd failures
> 
> 
> *** Discussion title: WAN Data Access
> 
> Hi all,
> 
> FWIW, a Tier-1 also reported to WLCG Ops an issue in xrootd-4.0.3:
> 
> https://twiki.cern.ch/twiki/bin/view/LCG/WLCGDailyMeetingsWeek141020#Monday
> 
> "xrootd v.4.0.3 in EPEL7 contains a serious problem that leads to memory
> corruption and eventual daemon crash. Discovered at logrotate time. The
> script running at 3am systematically coincided (entailed?) the crash."
> 
> I'll let Marian and the experts comment if it's better to upgrade to
> 4.0.3 anyway, or to wait for the next release instead.
> 
> Cheers
> N.
> 
> On 10/21/2014 04:36 AM, Marian Zvada wrote:
>> 
>> *** Discussion title: WAN Data Access
>> 
>> Hi Samir,
>> 
>> v4.0.0 version has indeed several issues (IPv6 especially) and it's 
>> recommended go for 4.0.3. This is last production release as of xrootd 
>> developers can tell:
>> 
>> http://xrootd.org/dload.html
>> 
>> Currently only in osg-testing repo, but it doesn't bring higher risk 
>> than stay on 4.0.0.:
>> http://repo.grid.iu.edu/osg/3.2/el6/testing/x86_64/
>> 
>> So I'd recommend reconsider.
>> 
>> My 2c,
>> Marian
>> 
>> On 10/20/14, 2:14 PM, Samir Cury wrote:
>>> 
>>> *** Discussion title: WAN Data Access
>>> 
>>> Hi Brian,
>>> 
>>> I upgraded one of the servers last week, with the  (hopefully) latest
>>> packages in OSG 3.2 stable :
>>> 
>>> xrootd4-libs-4.0.0-1.9.osg32.el6.x86_64
>>> xrootd-lcmaps-0.0.7-7.osg32.el6.x86_64
>>> xrootd4-client-libs-4.0.0-1.9.osg32.el6.x86_64
>>> xrootd4-4.0.0-1.9.osg32.el6.x86_64
>>> xrootd-hdfs-1.8.4-2.osg32.el6.x86_64
>>> xrootd4-server-libs-4.0.0-1.9.osg32.el6.x86_64
>>> xrootd-cmstfc-1.5.1-8.osg32.el6.x86_64
>>> 
>>> And this is the server which presented the problem.
>>> 
>>> I'm not really considering upgrading to testing versions for stability reasons.
>>> 
>>> Good news is that this error didn't appear again so far.
>>> 
>>> Thanks,
>>> Samir
>>> 
>>> On Sun, Oct 19, 2014 at 7:05 PM, Brian Bockelman <[log in to unmask]> wrote:
>>>> 
>>>> *** Discussion title: WAN Data Access
>>>> 
>>>> Hi Samir,
>>>> 
>>>> What version of Xrootd are you using?  There are some known IPv6 issues prior to 4.0.3.
>>>> 
>>>> Brian
>>>> 
>>>>> On Oct 18, 2014, at 3:02 PM, Samir Cury <[log in to unmask]> wrote:
>>>>> 
>>>>> 
>>>>> *** Discussion title: WAN Data Access
>>>>> 
>>>>> Interesting, looking at CMSD logs :
>>>>> 
>>>>> 141018 12:49:01 15811 Login: xrootd.unl.edu login failed; timed out
>>>>> 141018 12:49:01 15811 Manager: manager.0:[log in to unmask] removed;
>>>>> lost connection
>>>>> 141018 12:49:13 15811 XrdSetIF: Skipping duplicate public interface
>>>>> [2600:900:6:1101:5054:ff:fe00:70cb]
>>>>> 141018 12:50:13 15811 Login: xrootd.unl.edu login failed; timed out
>>>>> 141018 12:50:13 15811 Manager: manager.0:[log in to unmask] removed;
>>>>> lost connection
>>>>> 141018 12:50:25 15811 XrdSetIF: Skipping duplicate public interface
>>>>> [2600:900:6:1101:5054:ff:fe00:70cb]
>>>>> 141018 12:51:25 15811 Login: xrootd.unl.edu login failed; timed out
>>>>> 141018 12:51:25 15811 Manager: manager.0:[log in to unmask] removed;
>>>>> lost connection
>>>>> 141018 12:51:37 15811 XrdSetIF: Skipping duplicate public interface
>>>>> [2600:900:6:1101:5054:ff:fe00:70cb]
>>>>> 141018 12:52:37 15811 Login: xrootd.unl.edu login failed; timed out
>>>>> 
>>>>> Looks like cmsd has been given an IPv6 service to talk to? We're using
>>>>> the FNAL central redirector as recommended.
>>>>> 
>>>>> Configuration was fine, everything worked after restart :
>>>>> 
>>>>> 141018 12:57:17 001 Xrd: main:
>>>>> root://cmsxrootd.fnal.gov:1094//store/test/xrootd/T2_US_Caltech/store/mc/SAM/GenericTTbar/GEN-SIM-RECO/CMSSW_5_3_1_START53_V5-v1/0013/CE4D66EB-5AAE-E111-96D6-003048D37524.root
>>>>> --> /tmp/teste2.root
>>>>> 141018 12:57:17 8658 Xrd: Read: Hole in the cache: offs=0, len=8388608
>>>>> ^Crootd] Total 492.25 MB |==========>.........| 53.63 % [18.7 MB/s]
>>>>> -bash-4.1$ nslookup xrootd.unl.edu
>>>>> 
>>>>> Cheers,
>>>>> Samir



########################################################################
Use REPLY-ALL to reply to list

To unsubscribe from the XROOTD-DEV list, click the following link:
https://listserv.slac.stanford.edu/cgi-bin/wa?SUBED1=XROOTD-DEV&A=1