URL:
<http://savannah.cern.ch/bugs/?81584>
Summary: XrdCnsd looses connection and stop exporting log
files.
Project: XROOTD
Submitted by: abh
Submitted on: 2011-04-27 22:35
Severity: 4 - Important
Priority: 7 - High
Status: None
Privacy: Public
Assigned to: abh
Originator Email:
Open/Closed: Open
Discussion Lock: Any
Fixed by commit(s):
_______________________________________________________
Details:
Here is a history of the issue...
Thanks Alex for shedding more light on this. We’ve been having problem off
and on and now it looks like the real underlying problem is the way XrdCnsd
uses the xrootd client to copy files into the redirector. One idea we’ve
been considering is to get rid of that part altogther and just let you
specify how the copy-in is to be done. That way, you can use you favorite
program (scp, xrdcp, cp to nfs, etc). This would solve the underlying
problem. Additionally, if you choose anything other than xrdcp you would not
need to run an extra xrootd on the redirector node.
Andy
From: Alex Bogert
Sent: Tuesday, April 26, 2011 11:32 AM
Subject: Xrootd CNSD bugged
Hi Everyone,
I recently found an error in our xrootd setup. The CNS daemons are no longer
reporting correctly to the redirector's inventory. When I noticed files
missing in the Inventory I rebooted the xrootd system, and saw some strange
behavior. Each of the worker nodes connect's to the redirector and begins
updating their inventory, but after about 100 - 200 updates, the local
XrdCNSd reports a trashed connection to the redirector. Also, If i disable
xrootdfs about 100 extra updates will get through before the connection is
trashed. I am stumped as to the cause of this problem. I've attached the logs
from the redirector and from one worker node after a restart.
I would appreciate any advice on how to resolve this.
Cheers,
Alex
Current xroot processes running on redirector:
$ ps aux | grep xroot
xrootd 8975 0.0 0.0 45424 4104 ? Sl Apr22 0:00
/opt/osg-v1.2/xrootd/bin//xrootd -l
/opt/osg-v1.2/xrootd/var/logs/xrdlog.atlas01.ucsc.edu -c
/opt/osg-v1.2/xrootd/etc/xrootd.cfg
xrootd 8976 0.0 0.0 41084 3856 ? Sl Apr22 0:00
/opt/osg-v1.2/xrootd/bin//xrootd -n cns -l
/opt/osg-v1.2/xrootd/var/logs/xrdlog.atlas01.ucsc.edu -c
/opt/osg-v1.2/xrootd/etc/xrootd.cfg
xrootd 9060 0.0 0.0 121032 4236 ? Sl Apr22 0:00
/opt/osg-v1.2/xrootd/bin//cmsd -l
/opt/osg-v1.2/xrootd/var/logs/cmslog.atlas01.ucsc.edu -c
/opt/osg-v1.2/xrootd/etc/xrootd.cfg
xrootd 9206 0.0 0.0 368432 12228 ? Ssl Apr22 0:01
/opt/osg-v1.2/xrootdfs/bin/xrootdfsd /xrootdfs/atlas -o
allow_other,fsname=xrootdfs,max_write=131072,attr_timeout=10,entry_timeout=10
Current xroot processing on worker (other workers look the same).
$ ps aux | grep xroot
1060 14469 0.0 0.0 61148 760 pts/1 S+ 11:19 0:00 grep xroot
xrootd 19697 0.0 0.0 49684 4412 ? Sl Apr22 0:00
/opt/osg-v1.2/xrootd/bin//xrootd -l
/opt/osg-v1.2/xrootd/var/logs/xrdlog.wrk2prv.ucsc.edu -c
/opt/osg-v1.2/xrootd/etc/xrootd.cfg
xrootd 19716 0.0 0.0 123740 12672 ? Sl Apr22 0:00
/opt/osg-v1.2/xrootd/bin/XrdCnsd -d -D 2 -i 90 -b
atlas01.ucsc.edu:1095:/atlas/inventory
xrootd 19778 0.0 0.0 46020 3864 ? Sl Apr22 0:00
/opt/osg-v1.2/xrootd/bin//cmsd -l
/opt/osg-v1.2/xrootd/var/logs/cmslog.wrk2prv.ucsc.edu -c
/opt/osg-v1.2/xrootd/etc/xrootd.cfg
_______________________________________________________
Reply to this item at:
<http://savannah.cern.ch/bugs/?81584>
_______________________________________________
Message sent via/by LCG Savannah
http://savannah.cern.ch/
|