I guess Matevz have this up at UCSD, maybe he could comment?
-Marian
On 9/3/15 9:51 AM, Tommaso Boccali wrote:
> ciao, any advice?
>
> thanks
>
> tom
>
> On Fri, Aug 28, 2015 at 2:23 PM, Tommaso Boccali
> <[log in to unmask] <mailto:[log in to unmask]>> wrote:
>
> ciao,
> the supervisor (started by hand with
> 'cmsd -d -l /var/log/xrootd/superv.log -c
> /etc/xrootd/xrootd-redir-cms-superv.cfg -k 7' and
> 'xrootd -d -l /var/log/xrootd/superv-x.log -c
> /etc/xrootd/xrootd-redir-cms-superv.cfg -k 7' )
>
> fileis below *
>
> in the manager cmsd.log I just see:
>
> 150828 14:08:52 28267 XrdSched: running main accept inq=0
> 150828 14:08:52 27343 ?:80@xrootd XrdPoll: FD 80 attached to poller
> 1; num=22
> 150828 14:08:52 27343 Protocol: Primary
> supervisor.28781:80@xrootd:33530 logged in suspended.
> =====> Routing for xrootd.ba.infn.it <http://xrootd.ba.infn.it>:
> local pub4 prv4
> =====> Route all4: xrootd.ba.infn.it <http://xrootd.ba.infn.it>
> Dest=[::90.147.66.75]:33530
>
>
> while in the supervisor cmsd log I see
>
> 150828 14:08:52 28799 Pander supervisor services to
> xrootd.ba.infn.it:1213 <http://xrootd.ba.infn.it:1213>
> 150828 14:08:52 28799 Pander trying to connect to lvl 0
> xrootd.ba.infn.it:1213 <http://xrootd.ba.infn.it:1213>
> 150828 14:08:52 28799 XrdInet: Connected to xrootd.ba.infn.it:1213
> <http://xrootd.ba.infn.it:1213>
> 150828 14:08:52 28820 XrdXeq: Worker thread started
> 150828 14:08:52 28799 Add xrootd.ba.infn.it
> <http://xrootd.ba.infn.it> to manager config; id=0
>
> which seems good, but then a series of:
>
> ...
> 150828 14:08:54 28799 manager.0:25@xrootd do_StateFWD: *Path find
> failed for state*
> /store/test/xrootd/T2_MY_UPM_BIRUNI/store/mc/HC/GenericTTbar/GEN-SIM-RECO/CMSSW_7_0_4_START70_V7-v1/00000/D00E55FF-F6CC-E311-9B51-02163E00E88E.root
> 150828 14:08:54 28799 Dispatch manager.0:25@xrootd for state dlen=148
> 150828 14:08:54 28799 manager.0:25@xrootd do_State:
> /store/test/xrootd/T2_MY_UPM_BIRUNI/store/mc/HC/GenericTTbar/GEN-SIM-RECO/CMSSW_7_0_4_START70_V7-v1/00000/4463C61D-03CD-E311-AF1A-02163E00F338.root
> 150828 14:08:54 28799 manager.0:25@xrootd do_StateFWD: *Path find
> failed for state*
> /store/test/xrootd/T2_MY_UPM_BIRUNI/store/mc/HC/GenericTTbar/GEN-SIM-RECO/CMSSW_7_0_4_START70_V7-v1/00000/4463C61D-03CD-E311-AF1A-02163E00F338.root
> ...
>
>
> and nothing else.
>
> So I have a few questions
>
> 0 - very naive: for the supervisor do I need to start cmsd AND
> xrootd? If I do not do that, I see no effect at all
> 1 - is the erorr expected with my config ?
> 2 - I did not set explicitly the port numbers to the supervisor, and
> they of course cannot be 1094/1213 since they are already taken by
> the manager. I just have
> xrd.port any
>
> is that enough?
>
> 3 - should I see some servers being "moved" by the manager to the
> supervisor?
> 4 - atm I have not opened any additional port on the firewall, since
> with 'any' I do not know which port will be used. Should I open
> something?
>
>
> thanks a lot and sorry for all the questions
>
>
>
> *:
>
> [root@xrootd xrootd]# cat /etc/xrootd/xrootd-redir-cms-superv.cfg
>
> xrd.port any
> all.role supervisor
> # The known managers
> all.manager xrootd.ba.infn.it <http://xrootd.ba.infn.it> 1213
>
> # Allow any path to be exported; this is further refined in the
> authfile.
> all.export / r/w
>
> # Hosts allowed to use this xrootd cluster
> cms.allow host *
> # Logging verbosity
> xrootd.trace emsg login stall redirect
> ofs.trace all -debug
> xrd.trace all -debug
> cms.trace all -debug
>
>
>
> On Thu, Aug 27, 2015 at 5:26 PM, Marian Zvada <[log in to unmask]
> <mailto:[log in to unmask]>> wrote:
>
> Hi Tom,
>
> I haven't tried it though, but looks good to me.
>
> One more thing, we should not use thread limit by hard from
> whatever v4.x.x version, I think. This is well cared of within
> recent fixes I believe. I don't recall details right now but can
> search through if needed.
>
> So, feel free remove the line "xrd.sched maxt 16000".
>
> -Marian
>
> On 8/27/15 3:49 AM, Tommaso Boccali wrote:
>
> Ciao, I am trying to see if there is the need for a
> supervisor on one of
> our CMS EU redirs.
> In the logs, I never really see anything like 'If you
> suspect this,
> check the manager’s log. It will contain warnings about
> orphaned data
> servers'
>
> so I am not sure we have a problem, but still we a re very
> close to the
> 64 limit so better to be proactive.
>
>
> What I want to do as step #1 is to run the supervisor as an
> additional
> daemon on the redirector (it is a test, I really want to see
> what
> happens first, and the machine is big so should not be an issue)
>
> I looked at the documentation below, but I have to admit it
> is a bit
> obscure (to me).
>
> So, I have a cmsd/xrootd (the eu redir) running on ports
> 1213/1094, and
> redirecting "up" if needed (to the global redirector).
> Starting from their config, I just wanted to prepare a
> config for the
> supervisor.
>
> The minimal one I am trying to guess from the documentation
> would be
> ===
> xrd.port any
> all.role supervisor
> # The known managers
> all.manager xrootd.ba.infn.it <http://xrootd.ba.infn.it>
> <http://xrootd.ba.infn.it> 1213
>
> # Allow any path to be exported; this is further refined in
> the authfile.
> all.export / r/w
>
> # Hosts allowed to use this xrootd cluster
> cms.allow host *
> # Logging verbosity
> xrootd.trace emsg login stall redirect
> ofs.trace all -debug
> xrd.trace all -debug
> cms.trace all -debug
>
> cms.fxhold 8h
> xrd.sched maxt 16000
> ===
>
> but again , this is a sort of guess .... Do you have an
> example of a
> standalone cfg file for a supervisor?
>
>
> thanks
>
> tom
>
> http://xrootd.org/doc/dev42/cms_config.htm#_Toc405927050
>
> --
> Tommaso Boccali
> INFN Pisa
>
> ------------------------------------------------------------------------
>
> Use REPLY-ALL to reply to list
>
> To unsubscribe from the XROOTD-L list, click the following link:
> https://listserv.slac.stanford.edu/cgi-bin/wa?SUBED1=XROOTD-L&A=1
>
>
>
>
> --
> Tommaso Boccali
> INFN Pisa
>
>
>
>
> --
> Tommaso Boccali
> INFN Pisa
########################################################################
Use REPLY-ALL to reply to list
To unsubscribe from the XROOTD-L list, click the following link:
https://listserv.slac.stanford.edu/cgi-bin/wa?SUBED1=XROOTD-L&A=1
|