I guess Matevz have this up at UCSD, maybe he could comment? -Marian On 9/3/15 9:51 AM, Tommaso Boccali wrote: > ciao, any advice? > > thanks > > tom > > On Fri, Aug 28, 2015 at 2:23 PM, Tommaso Boccali > <[log in to unmask] <mailto:[log in to unmask]>> wrote: > > ciao, > the supervisor (started by hand with > 'cmsd -d -l /var/log/xrootd/superv.log -c > /etc/xrootd/xrootd-redir-cms-superv.cfg -k 7' and > 'xrootd -d -l /var/log/xrootd/superv-x.log -c > /etc/xrootd/xrootd-redir-cms-superv.cfg -k 7' ) > > fileis below * > > in the manager cmsd.log I just see: > > 150828 14:08:52 28267 XrdSched: running main accept inq=0 > 150828 14:08:52 27343 ?:80@xrootd XrdPoll: FD 80 attached to poller > 1; num=22 > 150828 14:08:52 27343 Protocol: Primary > supervisor.28781:80@xrootd:33530 logged in suspended. > =====> Routing for xrootd.ba.infn.it <http://xrootd.ba.infn.it>: > local pub4 prv4 > =====> Route all4: xrootd.ba.infn.it <http://xrootd.ba.infn.it> > Dest=[::90.147.66.75]:33530 > > > while in the supervisor cmsd log I see > > 150828 14:08:52 28799 Pander supervisor services to > xrootd.ba.infn.it:1213 <http://xrootd.ba.infn.it:1213> > 150828 14:08:52 28799 Pander trying to connect to lvl 0 > xrootd.ba.infn.it:1213 <http://xrootd.ba.infn.it:1213> > 150828 14:08:52 28799 XrdInet: Connected to xrootd.ba.infn.it:1213 > <http://xrootd.ba.infn.it:1213> > 150828 14:08:52 28820 XrdXeq: Worker thread started > 150828 14:08:52 28799 Add xrootd.ba.infn.it > <http://xrootd.ba.infn.it> to manager config; id=0 > > which seems good, but then a series of: > > ... > 150828 14:08:54 28799 manager.0:25@xrootd do_StateFWD: *Path find > failed for state* > /store/test/xrootd/T2_MY_UPM_BIRUNI/store/mc/HC/GenericTTbar/GEN-SIM-RECO/CMSSW_7_0_4_START70_V7-v1/00000/D00E55FF-F6CC-E311-9B51-02163E00E88E.root > 150828 14:08:54 28799 Dispatch manager.0:25@xrootd for state dlen=148 > 150828 14:08:54 28799 manager.0:25@xrootd do_State: > /store/test/xrootd/T2_MY_UPM_BIRUNI/store/mc/HC/GenericTTbar/GEN-SIM-RECO/CMSSW_7_0_4_START70_V7-v1/00000/4463C61D-03CD-E311-AF1A-02163E00F338.root > 150828 14:08:54 28799 manager.0:25@xrootd do_StateFWD: *Path find > failed for state* > /store/test/xrootd/T2_MY_UPM_BIRUNI/store/mc/HC/GenericTTbar/GEN-SIM-RECO/CMSSW_7_0_4_START70_V7-v1/00000/4463C61D-03CD-E311-AF1A-02163E00F338.root > ... > > > and nothing else. > > So I have a few questions > > 0 - very naive: for the supervisor do I need to start cmsd AND > xrootd? If I do not do that, I see no effect at all > 1 - is the erorr expected with my config ? > 2 - I did not set explicitly the port numbers to the supervisor, and > they of course cannot be 1094/1213 since they are already taken by > the manager. I just have > xrd.port any > > is that enough? > > 3 - should I see some servers being "moved" by the manager to the > supervisor? > 4 - atm I have not opened any additional port on the firewall, since > with 'any' I do not know which port will be used. Should I open > something? > > > thanks a lot and sorry for all the questions > > > > *: > > [root@xrootd xrootd]# cat /etc/xrootd/xrootd-redir-cms-superv.cfg > > xrd.port any > all.role supervisor > # The known managers > all.manager xrootd.ba.infn.it <http://xrootd.ba.infn.it> 1213 > > # Allow any path to be exported; this is further refined in the > authfile. > all.export / r/w > > # Hosts allowed to use this xrootd cluster > cms.allow host * > # Logging verbosity > xrootd.trace emsg login stall redirect > ofs.trace all -debug > xrd.trace all -debug > cms.trace all -debug > > > > On Thu, Aug 27, 2015 at 5:26 PM, Marian Zvada <[log in to unmask] > <mailto:[log in to unmask]>> wrote: > > Hi Tom, > > I haven't tried it though, but looks good to me. > > One more thing, we should not use thread limit by hard from > whatever v4.x.x version, I think. This is well cared of within > recent fixes I believe. I don't recall details right now but can > search through if needed. > > So, feel free remove the line "xrd.sched maxt 16000". > > -Marian > > On 8/27/15 3:49 AM, Tommaso Boccali wrote: > > Ciao, I am trying to see if there is the need for a > supervisor on one of > our CMS EU redirs. > In the logs, I never really see anything like 'If you > suspect this, > check the manager’s log. It will contain warnings about > orphaned data > servers' > > so I am not sure we have a problem, but still we a re very > close to the > 64 limit so better to be proactive. > > > What I want to do as step #1 is to run the supervisor as an > additional > daemon on the redirector (it is a test, I really want to see > what > happens first, and the machine is big so should not be an issue) > > I looked at the documentation below, but I have to admit it > is a bit > obscure (to me). > > So, I have a cmsd/xrootd (the eu redir) running on ports > 1213/1094, and > redirecting "up" if needed (to the global redirector). > Starting from their config, I just wanted to prepare a > config for the > supervisor. > > The minimal one I am trying to guess from the documentation > would be > === > xrd.port any > all.role supervisor > # The known managers > all.manager xrootd.ba.infn.it <http://xrootd.ba.infn.it> > <http://xrootd.ba.infn.it> 1213 > > # Allow any path to be exported; this is further refined in > the authfile. > all.export / r/w > > # Hosts allowed to use this xrootd cluster > cms.allow host * > # Logging verbosity > xrootd.trace emsg login stall redirect > ofs.trace all -debug > xrd.trace all -debug > cms.trace all -debug > > cms.fxhold 8h > xrd.sched maxt 16000 > === > > but again , this is a sort of guess .... Do you have an > example of a > standalone cfg file for a supervisor? > > > thanks > > tom > > http://xrootd.org/doc/dev42/cms_config.htm#_Toc405927050 > > -- > Tommaso Boccali > INFN Pisa > > ------------------------------------------------------------------------ > > Use REPLY-ALL to reply to list > > To unsubscribe from the XROOTD-L list, click the following link: > https://listserv.slac.stanford.edu/cgi-bin/wa?SUBED1=XROOTD-L&A=1 > > > > > -- > Tommaso Boccali > INFN Pisa > > > > > -- > Tommaso Boccali > INFN Pisa ######################################################################## Use REPLY-ALL to reply to list To unsubscribe from the XROOTD-L list, click the following link: https://listserv.slac.stanford.edu/cgi-bin/wa?SUBED1=XROOTD-L&A=1