Hi Manfred, On Thu, Nov 11, 2004 at 09:52:28AM +0100, Manfred Alef wrote: > the redirector server was upgraded to SL 3.03. Now we could > start olbd from xrootd's RHEL RPM without any problem. Ok, that is strange. The only guess I have is that it could have been the xinetd stuff interfering with starting by hand (if that is what you did). In any case, I looked at the logs for the xrootd/olbd on babar2 and there is still a problem. Normally the xrootd should connect to the olbd on the same machine, but there are errors in the xrootd log file: 041110 09:43:54 3349 odc_Manager: Connected to l01-001-122 041110 09:43:54 3349 odc_GetLine: Unable to reading request ; connection reset b y peer 041110 09:43:54 3349 odc_Manager: Unable to receive msg from l01-001-122; connec tion reset by peer and in the olbd log file: 041110 09:43:54 3361 olb_Accept: Unable to accept connection from l01-001-122.gr idka.de; permission denied Looking at the config file, I see: olb.allow host l01-001-122 # babar2.fzk.de olb.allow host f01-001-121 olb.allow host f01-001-122 I think you may need to specify the full hostname, including domain, i.e. olb.allow host l01-001-122.gridka.de # babar2.fzk.de olb.allow host f01-001-121.gridka.de olb.allow host f01-001-122.gridka.de Does that work? Pete > Peter Elmer wrote: > > Hi Manfred and Rolf, > > > > Sorry for the late reply. (You picked a somewhat awkward time to try this > > since Andy is away and I'm just back from vacation in a series of > > meetings/transits this past week!) > > > > I'll give this a try to see if I can reproduce it. I see, however, that > > you restarted things yesterday: > > > > root 3349 3082 0 Nov10 pts/0 00:00:11 /opt/xrootd/bin/xrootd -r -l /var/log/babar2.xrdlog -c /opt/xrootd/etc/redirector.cf > > root 3361 3082 0 Nov10 pts/0 00:00:11 /opt/xrootd/bin/olbd -m -l /var/log/babar2.olblog -c /opt/xrootd/etc/redirector.cf > > > > and I don't see anything in the log files about "Unable to bind socket; > > address already in use". There are other problems related to the dataservers > > connecting to the redirector, I think, but I'll look at those now. > > > > One thing that I recall is that Jos and I looked at setting up xinitd > > style restarts of the server. That wasn't still there, was it? (I don't > > see it now, but presumably it would have interfered with separate attempts > > to start the daemons by hand.) > > > > Pete > > > > On Fri, Nov 05, 2004 at 01:00:36PM +0100, Manfred Alef wrote: > > > >>Hi Pete, > >> > >>the config files are from http://xrootd.slac.stanford.edu/ > >>examples/multserver/index.html. > >> > >>Best regards > >>Manfred > >> > >>babar2 # cat redirector.cf > >># > >># redirector.cf > >># > >># xrootd > >>#+xrootd.fslib /opt/xrootd/lib/libXrdOfs.so > >>xrootd.fslib /usr/local/xrootd/lib/i386_linux24/libXrdOfs.so > >>xrootd.export /data > >>odc.manager l01-001-122 3121 > >>odc.trace redirect > >># olbd > >>olb.port 3121 > >>#+olb.allow host kanrdr.slac.stanford.edu > >>#+olb.allow host kan001.slac.stanford.edu > >>#+olb.allow host kan002.slac.stanford.edu > >>olb.allow host l01-001-122 # babar2.fzk.de > >>olb.allow host f01-001-121 > >>olb.allow host f01-001-122 > >>babar2 # > >> > >>[root@f01-001-122 etc]# cat dataserver.cf > >># > >># dataserver.cf > >># > >># xrootd > >>#+xrootd.fslib /opt/xrootd/lib/libXrdOfs.so > >>xrootd.fslib /usr/local/xrootd/lib/i386_linux24/libXrdOfs.so > >>xrootd.export /data > >>oss.readonly > >>odc.manager l01-001-122 3121 > >># olbd > >>olb.port 3121 > >>olb.subscribe l01-001-122 3121 > >>[root@f01-001-122 etc]# > >> > >> > >> > >>Peter Elmer wrote: > >> > >>> [CC the xrootd mailing list] > >>> > >>> Hi Rolf, > >>> > >>> Do you have the config files you are using to try to start xrootd and > >>>the olbd (on the redirector and the file servers)? > >>> > >>> Pete > >>> > >>>On Fri, Nov 05, 2004 at 11:33:47AM +0100, Manfred Alef wrote: > >>> > >>> > >>>>Hi Pete, > >>>> > >>>>I am sitting here at GridKa together with Manfred Alef and we are trzing > >>>>to install xrootd on two of the fileservers and on babar2, a login > >>>>mashine which will also be the redirector. > >>>>We use the current production versin and had no problems starting xrootd > >>>>and albd on one of the fileservers. However, when we trz to start the > >>>>olbd on the redirector, it exits with exit code 1. The logfile is > >>>>attached. We made sure nothing else is going on on the mashine (reboot) > >>>>and also removed anz old socket we could find in /tmp/.olb/ > >>>>Do you have an idea what could go wrong or what else we could try? > >>>> > >>>>Cheers, Rolf > >>>> > >>>>--------------------------------------------------------------- > >>>>41105 10:44:31 32156 olb_Config: (c) 2004 SLAC olbd version > >>>>20040907-0403 initializing as Manager > >>>>041105 10:44:31 32156 olb_Bind: Unable to bind socket; address already > >>>>in use > >>>>041105 10:44:31 32156 olb_Config: Manager initialization failed. > >>>>041105 10:46:15 32191 olb_Config: (c) 2004 SLAC olbd version > >>>>20040907-0403 initializing as Manager > >>>>041105 10:46:15 32191 olb_Bind: Unable to bind socket; address already > >>>>in use > >>>>041105 10:46:15 32191 olb_Config: Manager initialization failed. > >>>>041105 10:48:49 32248 Schedule scheduling midnight runner in 47471 seconds > >>>>041105 10:48:49 32248 olb_Config: (c) 2004 SLAC olbd version > >>>>20040907-0403 initializing as Manager > >>>>041105 10:48:49 32248 olb_Bind: Unable to bind socket; address already > >>>>in use > >>>>041105 10:48:49 32248 olb_Config: Manager initialization failed. > >>>>041105 11:10:37 3175 olb_Config: (c) 2004 SLAC olbd version > >>>>20040907-0403 initializing as Manager > >>>>041105 11:10:37 3175 olb_Bind: Unable to bind socket; address already in use > >>>>041105 11:10:37 3175 olb_Config: Manager initialization failed. > >>>>041105 11:18:16 3332 olb_Config: (c) 2004 SLAC olbd version > >>>>20040907-0403 initializing as Manager > >>>>041105 11:18:16 3332 olb_Bind: Unable to bind socket; address already in use > >>>>041105 11:18:16 3332 olb_Config: Manager initialization failed. > >>>>---------------------------------------------------------------- > >>> > >>> > >>> > >>> > >>>------------------------------------------------------------------------- > >>>Peter Elmer E-mail: [log in to unmask] Phone: +41 (22) 767-4644 > >>>Address: CERN Division PPE, Bat. 32 2C-14, CH-1211 Geneva 23, Switzerland > >>>------------------------------------------------------------------------- > >>> > >> > >> > >>-- > >> > >>Mit freundlichen Gruessen > >>Manfred Alef > >> > >>________________________________________________________________ > >>Manfred Alef > >>Forschungszentrum Karlsruhe GmbH > >>Institut f. Wissenschaftliches Rechnen (IWR) > >>Hermann-von-Helmholtz-Platz 1, D-76344 Eggenstein-Leopoldshafen > >>Tel.: (07247) 82-5732, Fax: (07247) 82-4972 > >>Email: [log in to unmask] > > > > > > > > > > ------------------------------------------------------------------------- > > Peter Elmer E-mail: [log in to unmask] Phone: +41 (22) 767-4644 > > Address: CERN Division PPE, Bat. 32 2C-14, CH-1211 Geneva 23, Switzerland > > ------------------------------------------------------------------------- > > > > > -- > > Mit freundlichen Gruessen > Manfred Alef > > ________________________________________________________________ > Manfred Alef > Forschungszentrum Karlsruhe GmbH > Institut f. Wissenschaftliches Rechnen (IWR) > Hermann-von-Helmholtz-Platz 1, D-76344 Eggenstein-Leopoldshafen > Tel.: (07247) 82-5732, Fax: (07247) 82-4972 > Email: [log in to unmask] > ------------------------------------------------------------------------- Peter Elmer E-mail: [log in to unmask] Phone: +41 (22) 767-4644 Address: CERN Division PPE, Bat. 32 2C-14, CH-1211 Geneva 23, Switzerland -------------------------------------------------------------------------