Print

Print


  Hi Fulvio,

On Mon, Apr 04, 2005 at 11:55:06AM +0200, Peter Elmer wrote:
> On Mon, Apr 04, 2005 at 11:40:40AM +0200, Fulvio Galeazzi wrote:
> >     Enrica started xrootd/olbd on a new disk server last friday and 
> > noticed the following funny message in the redirector log file:
> > 
> > 050404 10:24:50 24196 olb_Manager: Server 
> > bbr-stk01.cr.cnaf.infn.it:34459 logged in.
> > 
> >   Any idea why the olbd is not using the usual 1094 port? We checked in 
> > the configuration files and could not find anything different wrt other 
> > datamovers.
> 
>   The olbd doesn't (by default) run on port 1094. It is the xrootd that
> (by default) would run on port 1094. Normally you would tell it which
> port to use in the config file with the "olb.port" config directive.
> 
>   Hmm, I just checked on that machine and the _xrootd_ is definitely
> running on port 1094 and not port 34459, so something strange happened.
> 
>   I'm in the process of screwing around with things on bbr-stk1, so
> don't be surprised...

  Something strange is going on. There must be some small bug or race
condition in reporting the dataserver xrootd port number to the redirector 
olbd. (I restarted the the olbd on rdr01 and now it has the proper 1094
port number for bbr-stk01, but rdr02 still has a screwy port number.) When 
Andy arrives this afternoon we'll take a look at it.

  (BTW, I see that this machine is running SL3. This means that you have
RH73, RH9, SL3, SLC3 _and_ RHEL3 running on these machines, no? Or was
there also a machine with RH8?)  

>                                    Pete
> 
> >   Sure enough, I cannot access collection on bbr-stk01...
> > 
> > 
> > galeazzi@bbr-fe02 ~> KanCollUtil 
> > /store/PR/R14/AllEvents/0004/55/14.3.2/AllEvents_00045581_14.3.2V00
> > 2005-04-04 11:13:07 18611 Err : TUnixSystem::GetServiceByName  - no 
> > service "rootd" with protocol "tcp"
> > 
> > 2005-04-04 11:13:07 18611 SysError: TUnixSystem::UnixTcpConnect    - 
> > connect (bbr-stk01.cr.cnaf.infn.it:34459) (Connection refused)
> > 2005-04-04 11:13:07 18611 Err : TXPhyConnection::Connect       - can't 
> > open connection to xrootd/rootd on host [bbr-stk01.cr.cnaf.infn.it:34459]
> > 2005-04-04 11:13:07 18611 Err : TXNetConn::TXNetFile           - Error 
> > creating logical connection with [bbr-stk01.cr.cnaf.infn.it:34459]
> > 2005-04-04 11:13:07 18611 Err : TXNetConn::GoToAnotherServer   - Error 
> > connecting to [bbr-stk01.cr.cnaf.infn.it:34459]
> > 
> >   In the xrootd logfile on bbr-stk01 i read:
> > 
> > ...
> > 050404 10:24:40 001 Prep log directory not specified; prepare tracking 
> > disabled.
> > 050404 10:24:40 001 Exporting /store
> > 050404 10:24:40 001 XRootd protocol version 2.3.0 build 20050328-0656 
> > successfully loaded.
> > 050404 10:24:40 001 [log in to unmask]:1094 initialization 
> > completed.
> > 050404 10:24:50 1553 odc_olb: Connected to olb via /tmp/.olb/olbd.admin
> > 
> >   whereas on another datamover, where things seem to be working, I get 
> > an error/warning message:
> > 
> > ...
> > 050404 11:12:52 001 Exporting /store
> > 050404 11:12:52 001 XRootd protocol version 2.3.0 build 20050328-0656 
> > successfully loaded.
> > 050404 11:12:52 001 [log in to unmask]:1094 
> > initialization completed.
> > 050404 11:12:52 25036 odc_Open: Unable to connect socket to 
> > /tmp/.olb/olbd.admin; connection refused
> > 
> >   I cannot remember whether I should expect to see that warning, nor I 
> > remember what the meaning is, can you please remind me?
> > 
> > 
> >   Thanks!
> >   Ciao ciao
> > 
> > 				Fulvio
> 
> 
> 
> -------------------------------------------------------------------------
> Peter Elmer     E-mail: [log in to unmask]      Phone: +41 (22) 767-4644
> Address: CERN Division PPE, Bat. 32 2C-14, CH-1211 Geneva 23, Switzerland
> -------------------------------------------------------------------------


                                   Pete

-------------------------------------------------------------------------
Peter Elmer     E-mail: [log in to unmask]      Phone: +41 (22) 767-4644
Address: CERN Division PPE, Bat. 32 2C-14, CH-1211 Geneva 23, Switzerland
-------------------------------------------------------------------------