Print

Print


Hi Andy,

This is what I tried:

-) A new redirector (`xrootdtest.hep.wisc.edu`) is created and I installed new RPMs as you suggested.
-) Changed the xrootd.cfg file (see attached) in the redirector, one of the supervisors (`s15n01.hep.wisc.edu`) and one data server (`g10n02.hep.wisc.edu`) .
-) Xrdcp for a file using the redirector succeeds, but it doesn't test the supervisor. It directly goes through the data servers. 
-) It seems the supervisor is still in the same suspend + nostaging state.

Let me know if you want to try anything else ?

-Tapas


![screenshot 2015-04-14 16 08 12](https://cloud.githubusercontent.com/assets/1997192/7147424/8239b71c-e2c0-11e4-82d8-e6f60959c70f.png)

Redirector cmsd logs:
```
------ [log in to unmask] phase 2 manager initialization completed.
------ cmsd [log in to unmask]:1213 initialization completed.
150414 15:52:35 5979 Protocol: redirector.5865:18@xrootdtest logged in.
150414 15:52:35 5979 Admit_Redirector redirector.5865:18@xrootdtest assigned slot 1
150414 15:52:37 5994 AddNode srv supervisor.26915:19@s15n01:31094 cluster 1213xrootdtest.hep.wisc.edu mask=1 anum=0
150414 15:52:37 5994 Add supervisor.26915:19@s15n01:31094 to cluster anon-u 1213xrootdtest.hep.wisc.edu slot 0.2 (nodecnt=1 supn=1)
150414 15:52:37 5994 Update Counts Parm1=0 Parm2=0
150414 15:52:37 5994 Admit s15n01 TSpace=1GB NumFS=0 FSpace=0MB MinFR=0 MB Util=0 Share=100 TZone=-6
150414 15:52:37 5994 Admit s15n01 adding path: w /
150414 15:52:37 5994 supervisor.26915:19@s15n01:31094 do_Space: 0MB free; 0% util
150414 15:52:37 5994 Protocol: Primary supervisor.26915:19@s15n01:31094 logged in suspended.
=====> Routing for s15n01.hep.wisc.edu: local pub4 prv4 pub6 prv6
=====> Route all4: s15n01.hep.wisc.edu Dest=[::144.92.181.127]:31094
=====> Route all6: s15n01.hep.wisc.edu Dest=[2607:f388:101c:1000::335]:31094
150414 15:52:42 5978 Update Stage Parm1=-1 Parm2=0
150414 15:52:42 5978 Update Active Parm1=-1 Parm2=0
150414 15:52:42 5978 Config: manager service enabled.
150414 15:52:42 5993 State: Status changed to suspended + nostaging
150414 15:52:42 5993 Send status to redirector.5865:18@xrootdtest
150414 15:57:31 5996 AddNode srv server.17917:21@g10n02:31094 cluster 1213xrootdtest.hep.wisc.edu mask=3 anum=0
150414 15:57:31 5996 Add server.17917:21@g10n02:31094 to cluster anon-s 1213xrootdtest.hep.wisc.edu slot 1.3 (nodecnt=2 supn=1)
150414 15:57:31 5996 Update Counts Parm1=1 Parm2=0
150414 15:57:31 5996 Admit g10n02 TSpace=1GB NumFS=0 FSpace=0MB MinFR=0 MB Util=0 Share=100 TZone=-6
150414 15:57:31 5996 Admit g10n02 adding path: w /
150414 15:57:31 5996 server.17917:21@g10n02:31094 do_Space: 0MB free; 0% util
150414 15:57:31 5996 Protocol: Primary server.17917:21@g10n02:31094 logged in.
=====> Routing for g10n02.hep.wisc.edu: local pub4 prv4 pub6 prv6
=====> Route all4: g10n02.hep.wisc.edu Dest=[::144.92.180.226]:31094
=====> Route all6: g10n02.hep.wisc.edu Dest=[2607:f388:101c:1000::6]:31094
150414 15:57:31 5996 Dispatch server.17917:21@g10n02:31094 for status dlen=0
150414 15:57:31 5996 server.17917:21@g10n02:31094 do_Status: resume nostage 
150414 15:57:31 5993 State: Status changed to active
150414 15:57:31 5993 Send status to redirector.5865:18@xrootdtest
150414 15:58:45 5979 Dispatch redirector.5865:18@xrootdtest for select dlen=62
150414 15:58:45 6047 tapas.3244:7@login05 do_Select:  /store/user/tapas/file.list
150414 15:58:45 6047 SelNode g10n02.hep.wisc.edu serving /store/user/tapas/file.list
150414 15:58:45 6047 tapas.3244:7@login05 do_Select: Redirect -> g10n02.hep.wisc.edu:31094 for /store/user/tapas/file.list
150414 15:58:58 5979 Dispatch redirector.5865:18@xrootdtest for select dlen=63
150414 15:58:58 5978 tapas.3339:23@login05 do_Select:  /store/user/tapas/file.list
150414 15:58:58 5978 SelNode g10n02.hep.wisc.edu serving /store/user/tapas/file.list
150414 15:58:58 5978 tapas.3339:23@login05 do_Select: Redirect -> g10n02.hep.wisc.edu:31094 for /store/user/tapas/file.list
150414 15:59:44 5994 Dispatch supervisor.26915:19@s15n01:31094 for status dlen=0
150414 15:59:44 5994 supervisor.26915:19@s15n01:31094 do_Status: suspend 
150414 15:59:47 5994 Update Counts Parm1=0 Parm2=0
150414 15:59:47 5994 RemNode man supervisor.26915:19@s15n01:31094 cluster 1213xrootdtest.hep.wisc.edu mask=2 anum=0 n/p
150414 15:59:47 5994 Remove_Node supervisor.26915:19@s15n01:31094 node 0.2
150414 15:59:47 5994 Protocol: supervisor.26915:19@s15n01 logged out; request read failed
150414 16:00:00 5999 Add Reconnect supervisor.26915:19@s15n01:31094 to cluster anon-u 1213xrootdtest.hep.wisc.edu slot 0.3 (nodecnt=2 supn=1)
150414 16:00:00 5999 Update Counts Parm1=0 Parm2=0
150414 16:00:00 5999 Admit s15n01 TSpace=1GB NumFS=0 FSpace=0MB MinFR=0 MB Util=0 Share=100 TZone=-6
150414 16:00:00 5999 Admit s15n01 adding path: w /
150414 16:00:00 5999 supervisor.26915:19@s15n01:31094 do_Space: 0MB free; 0% util
150414 16:00:00 5999 Protocol: Primary supervisor.26915:19@s15n01:31094 logged in suspended.
=====> Routing for s15n01.hep.wisc.edu: local pub4 prv4 pub6 prv6
=====> Route all4: s15n01.hep.wisc.edu Dest=[::144.92.181.127]:31094
=====> Route all6: s15n01.hep.wisc.edu Dest=[2607:f388:101c:1000::335]:31094
150414 16:00:04 5999 Dispatch supervisor.26915:19@s15n01:31094 for status dlen=0
150414 16:00:04 5999 supervisor.26915:19@s15n01:31094 do_Status: suspend nostage 
150414 16:02:32 5999 Dispatch supervisor.26915:19@s15n01:31094 for load dlen=12
150414 16:02:32 5999 supervisor.26915:19@s15n01:31094 do_Load: cpu=0 net=0 xeq=0 mem=0 pag=0 dsk=0% 0MB load=0 mass=0
```



---
Reply to this email directly or view it on GitHub:
https://github.com/xrootd/xrootd/issues/227#issuecomment-93063862

########################################################################
Use REPLY-ALL to reply to list

To unsubscribe from the XROOTD-DEV list, click the following link:
https://listserv.slac.stanford.edu/cgi-bin/wa?SUBED1=XROOTD-DEV&A=1