Print

Print


On Friday, March 28, 2014, Andrew Hanushevsky <[log in to unmask]> wrote:

> Hi Tommaso & Marian,
>
> I apologize that we didn't get a prompt response back. Somehow this
> slipped through the cracks. So, let me intersperse my answers in the text
> below.
>
> On Thu, 27 Mar 2014, Tommaso Boccali wrote:
>
>  ok, but what happens if a site restarts its cmsd when the redirector
>> points
>> just to one redirector (because the other is off)?
>> when the second redirector comes back in the DNS list, will the site pick
>> it up or not?
>>
> When a site restarts it's cmsd when one of the redirectors is off.
>

uhm, not sure I was explaining correctly myself in first place.

Let's say we prepare 2 regional redirectors, 1.1.1.1 and 2.2.2.2, and we
punt them in the DNS as xrootd.infn.it (no round robin: "host xrootd.infn.it"
will return 2 IP addresses).
Since we want to use xrootd.infn.it as fallback, we plan to have a nagios
test which checks 1.1.1.1 and 2.2.2.2 periodically, and in case one is NOT
ok, it is removed from the DNS.

So, question was:
let's say that site ABCD needs to restart the xrootd local servers, which
are configured as

all.manager meta all *xrootd.infn.it <http://xrootd.infn.it>*+ 1213

what happens if AT THE RESTART MOMENT xrootd.infn.it only resolves  to
1.1.1.1 (since eventually 2.2.2.2 is broken)? And even more, what if 2
hours later 2.2.2.2 enters again the DNS resolution for
xrootd.infn.it(since it recovered and the nagios realized)? Has the
cmsd running at site
ABCD a way to recognize that the # of IPs served by xrootd.infn.it has
changed, and hence consider also 2.2.2.2 as a manager?


> When all is used, all traffic marked to go to the one that is offline is
> shifted to a particular redirector that is still working. When that
> redirector comes back online the traffic is shifted back as it would have
> been had the redirector been online at the time of restart.
>
> When any is used, all traffic marked to go to that redirector, if any, is
> shifted to another working redirector. When that redirector comes back
> online the traffic is normally shifted back to that redirector if it was
> the redirector of choice (i.e. the one originally chosen to receive all of
> the traffic).
>
>  On Thu, Mar 27, 2014 at 4:08 PM, Marian Zvada <[log in to unmask]>
>> wrote:
>>
>>  we have very same setup for CMS T2 and currently in commissioning
>>> process.
>>> We have DNS round-robin alias between FNAL and UNL and use the following
>>> in
>>> the config:
>>>
>>
>  all.manager meta all cmsxrootd.fnal.gov+ 1213
>>> (xrootd1.fnal.gov and xrootd.unl.edu behind that)
>>>
>> Yes, this is the correct way of doing this. XRootD will take over the
> parcelling out of traffic as DNS round-robbin is (and always has been)
> problematic for the way XRootD needs to load balance requests.
>
>  => I think this should be adapted also for your setup as well I believe,
>>> though pointing to different aliased host.
>>>
>> Yes.
>
>  For the question of "all" as parameter I was already asking in developers
>>> list what's beneficial, all || any, but didn't get response so far. Here
>>> it's documented but I didn't get full understanding what's recommended
>>> from
>>> operation point of view while using DNS-RR setup:
>>> http://xrootd.org/doc/prod/cms_config.htm#_Toc339484334
>>>
>> Indeed, we should have had a better explanation. Could either of you post
> a problem ticket that requests that the documentation discuss why you would
> choose one over the other so it doesn't fall through the cracks?
>
> In absence of that let me explain.
>
> Choosing any is suitable when your interaction rate with xrootd is not
> overtaxing the system. You may be overtaxing the system if the CPU usage of
> the cmsd is getting high or it's memory starts getting out of hand (sort of
> c,sd using more than 10% of the CPU or sitting on more than 2GB of memory
> -- though we haven' really benchmarked that). With all, all traffic goes to
> a single cmsd and the other redirectors listed are merely used as a
> fallback if the primary cmsd fails.
>
> Choosing all is suitable when you interaction rate with xrootd is very
> high and you wish to distribute that load across all of the cmsd's that are
> available. This is suitable when all of the cmsd's are roughly of the same
> power (if not you will get rather odd skew effects in responsiveness).
> Here, requests are parcelled out using a determinstic algorithm so that
> each request relative to a particular file always goes to the same cmsd,
> thus maximizing the use of the routing cache.
>
>
>  However, we stick so far with:
>>> all.manager meta all cmsxrootd.fnal.gov+ 1213
>>>
>>> Also, currently each of top redirectors in the DNS-RR alias subscribes to
>>> global redirector:
>>> all.manager meta any cms-xrd-global.cern.ch+ 1098
>>> (see 'any' in this case)
>>>
>> Yes, I see that. So is there any reason why you used any here while you
> used all elsewhere? Not that it likely matters but I am curious why the
> difference.
>
> Andy
>
> ########################################################################
> Use REPLY-ALL to reply to list
>
> To unsubscribe from the XROOTD-L list, click the following link:
> https://listserv.slac.stanford.edu/cgi-bin/wa?SUBED1=XROOTD-L&A=1
>


-- 
Tommaso Boccali
INFN Pisa

########################################################################
Use REPLY-ALL to reply to list

To unsubscribe from the XROOTD-L list, click the following link:
https://listserv.slac.stanford.edu/cgi-bin/wa?SUBED1=XROOTD-L&A=1