Print

Print


URL:
  <http://savannah.cern.ch/bugs/?81584>

                 Summary: XrdCnsd looses connection and stop exporting log
files.
                 Project: XROOTD
            Submitted by: abh
            Submitted on: 2011-04-27 22:35
                Severity: 4 - Important
                Priority: 7 - High
                  Status: None
                 Privacy: Public
             Assigned to: abh
        Originator Email: 
             Open/Closed: Open
         Discussion Lock: Any
      Fixed by commit(s): 

    _______________________________________________________

Details:

Here is a history of the issue...


Thanks Alex for shedding more light on this. We’ve been having problem off
and on and now it looks like the real underlying problem is the way XrdCnsd
uses the xrootd client to copy files into the redirector. One idea we’ve
been considering is to get rid of that part altogther and just let you
specify how the copy-in is to be done. That way, you can use you favorite
program (scp, xrdcp, cp to nfs, etc). This would solve the underlying
problem. Additionally, if you choose anything other than xrdcp you would not
need to run an extra xrootd on the redirector node.
 
Andy
 
From: Alex Bogert 
Sent: Tuesday, April 26, 2011 11:32 AM
Subject: Xrootd CNSD bugged
 
Hi Everyone, 
 
I recently found an error in our xrootd setup. The CNS daemons are no longer
reporting correctly to the redirector's inventory. When I noticed files
missing in the Inventory I rebooted the xrootd system, and saw some strange
behavior. Each of the worker nodes connect's to the redirector and begins
updating their inventory, but after about 100 - 200 updates, the local
XrdCNSd reports a trashed connection to the redirector. Also, If i disable
xrootdfs about 100 extra updates will get through before the connection is
trashed. I am stumped as to the cause of this problem. I've attached the logs
from the redirector and from one worker node after a restart. 
 
I would appreciate any advice on how to resolve this.
 
Cheers,
Alex
 
 
Current xroot processes running on redirector:
 
$ ps aux | grep xroot
xrootd    8975  0.0  0.0  45424  4104 ?        Sl   Apr22   0:00
/opt/osg-v1.2/xrootd/bin//xrootd -l
/opt/osg-v1.2/xrootd/var/logs/xrdlog.atlas01.ucsc.edu -c
/opt/osg-v1.2/xrootd/etc/xrootd.cfg
xrootd    8976  0.0  0.0  41084  3856 ?        Sl   Apr22   0:00
/opt/osg-v1.2/xrootd/bin//xrootd -n cns -l
/opt/osg-v1.2/xrootd/var/logs/xrdlog.atlas01.ucsc.edu -c
/opt/osg-v1.2/xrootd/etc/xrootd.cfg
xrootd    9060  0.0  0.0 121032  4236 ?        Sl   Apr22   0:00
/opt/osg-v1.2/xrootd/bin//cmsd -l
/opt/osg-v1.2/xrootd/var/logs/cmslog.atlas01.ucsc.edu -c
/opt/osg-v1.2/xrootd/etc/xrootd.cfg
xrootd    9206  0.0  0.0 368432 12228 ?        Ssl  Apr22   0:01
/opt/osg-v1.2/xrootdfs/bin/xrootdfsd /xrootdfs/atlas -o
allow_other,fsname=xrootdfs,max_write=131072,attr_timeout=10,entry_timeout=10
 
Current xroot processing on worker (other workers look the same).
 
$ ps aux | grep xroot
1060     14469  0.0  0.0  61148   760 pts/1    S+   11:19   0:00 grep xroot
xrootd   19697  0.0  0.0  49684  4412 ?        Sl   Apr22   0:00
/opt/osg-v1.2/xrootd/bin//xrootd -l
/opt/osg-v1.2/xrootd/var/logs/xrdlog.wrk2prv.ucsc.edu -c
/opt/osg-v1.2/xrootd/etc/xrootd.cfg
xrootd   19716  0.0  0.0 123740 12672 ?        Sl   Apr22   0:00
/opt/osg-v1.2/xrootd/bin/XrdCnsd -d -D 2 -i 90 -b
atlas01.ucsc.edu:1095:/atlas/inventory
xrootd   19778  0.0  0.0  46020  3864 ?        Sl   Apr22   0:00
/opt/osg-v1.2/xrootd/bin//cmsd -l
/opt/osg-v1.2/xrootd/var/logs/cmslog.wrk2prv.ucsc.edu -c
/opt/osg-v1.2/xrootd/etc/xrootd.cfg





    _______________________________________________________

Reply to this item at:

  <http://savannah.cern.ch/bugs/?81584>

_______________________________________________
  Message sent via/by LCG Savannah
  http://savannah.cern.ch/