Hi! Any news on this? Sorry for thread bumping but i have an storage server waiting to be put online and spurious test error with "no space left on disk" and i don't know what to do... Can i help the debugging process somehow? Thank you! Adrian On 10/12/2013 10:20 AM, Adrian Sevcenco wrote: > On 10/12/2013 12:44 AM, Andrew Hanushevsky wrote: >> Hi Adrian, > Hi! > >> Very odd. You must have a log somewhere that includes xrootd >> initialization. I'd like to see that to find out why it thinks the >> config is not what appears to be in the config file. Could you send me >> the snippet that just includes the lines from start-up to >> "initialization completed". > below the outputs with cmslog and xrdlog (preview : cms see corect size > in xrd:: storage system initialization i have double inclusion of the > storage filesystems) > > Thanks!!! > Adrian > > in cmslog : > Copr. 2004-2012 Stanford University, xrd version v3.3.2 > ++++++ cmsd [log in to unmask] initialization started. > Config using configuration file > /home/aliprod/xrdserver/etc/xrootd/server/xrootd.cf > =====> xrd.protocol xrootd * > =====> all.adminpath /home/aliprod/xrdserver/admin > Config warning: ignoring unknown xrd directive 'pidpath'. > =====> xrd.pidpath > =====> xrd.port 1094 > =====> xrd.sched mint 32 maxt 2048 avlt 512 idle 780 > =====> xrd.network buffsz 0 nodnr > Config maximum number of connections restricted to 65000 > Copr. 2007 Stanford University/SLAC cmsd. > ++++++ [log in to unmask] phase 1 initialization started. > =====> all.export / nolock r/w nocheck norcreate > =====> all.role server > =====> all.manager rd.spacescience.ro 3122 > =====> all.adminpath /home/aliprod/xrdserver/admin > =====> cms.pidpath /home/aliprod/xrdserver/admin > =====> cms.sched cpu 10 io 10 space 80 > =====> oss.defaults nomig nodread nocheck norcreate nolock > =====> oss.namelib > /home/aliprod/xrdserver/lib64/libXrdAggregatingName2Name.so > /home/aliprod/data > =====> oss.localroot /storage01/xrdnamespace > =====> cms.space min 1g 1g > The following paths are available to the redirector: > w / > > ------ [log in to unmask] phase 1 server initialization > completed. > ++++++ [log in to unmask] phase 2 server initialization > started. > 131012 10:07:18 13529 Configure2 Global System Identification: anon-s > 3122rd.spacescience.ro > Plugin loaded unversioned XrdOucgetName2Name from namelib > /home/aliprod/xrdserver/lib64/libXrdAggregatingName2Name.so > ++++++ XrdAggregatingN2N initializing. Local lfn prefix > '/home/aliprod/data'. > ++++++ XrdAggregatingN2N initializing. Remote root is null > ++++++ Storage system initialization started. > =====> all.export / nolock r/w nocheck norcreate > =====> oss.defaults nomig nodread nocheck norcreate nolock > =====> oss.alloc 512M 2 0 > =====> oss.fdlimit * max > =====> oss.space public /storage01/xrddata > =====> oss.space public /storage02/xrddata > =====> oss.space public /storage03/xrddata > =====> oss.namelib > /home/aliprod/xrdserver/lib64/libXrdAggregatingName2Name.so > /home/aliprod/data > =====> oss.localroot /storage01/xrdnamespace > Plugin loaded unversioned XrdOucgetName2Name from namelib > /home/aliprod/xrdserver/lib64/libXrdAggregatingName2Name.so > ++++++ XrdAggregatingN2N initializing. Local lfn prefix > '/home/aliprod/data'. > ++++++ XrdAggregatingN2N initializing. Remote root is null > ++++++ Configuring standalone mode . . . > 131012 10:07:18 13529 oss_AioInit: started AIO read signal thread; > tid=3135912256 > 131012 10:07:18 13529 oss_AioInit: started AIO write signal thread; > tid=3136964928 > XrdAggregatingN2N processing. buff='/' > XrdAggregatingN2N processing. buff='/home/aliprod/data/' > XrdAggregatingN2N processing. buff='/' > XrdAggregatingN2N processing. buff='/home/aliprod/data/' > Config effective /home/aliprod/xrdserver/etc/xrootd/server/xrootd.cf oss > configuration: > oss.alloc 536870912 2 0 > oss.cachescan 600 > oss.fdlimit 32500 65000 > oss.maxsize 0 > oss.namelib > /home/aliprod/xrdserver/lib64/libXrdAggregatingName2Name.so > oss.localroot /storage01/xrdnamespace > oss.trace fff > oss.xfr 1 deny 10800 keep 1200 > oss.memfile off max 2071848960 > oss.space public /storage01/xrddata > oss.space public /storage02/xrddata > oss.space public /storage03/xrddata > oss.defaults r/w nocheck nodread nomig norcreate nopurge > nostage xattr > oss.path / r/w nocheck nodread nomig norcreate nopurge nostage > xattr > ------ Storage system initialization completed. > 131012 10:07:18 13529 Start Srv=0 dfs=0 lcl=0 Pre=1 dmLife=0 0 > 131012 10:07:18 13529 Start Lim=0 0 fix=0 Qmax=1 > 131012 10:07:18 13529 calcSpace New fs info; maxfree=6473MB utilized=100% > 131012 10:07:18 13529 Meter: Found 3 filesystem(s); 39TB total (100% > util); 18GB free (6GB max) > ------ [log in to unmask] phase 2 server initialization > completed. > > > ################################################################## > > xrdlog: > > > 131012 10:07:18 13496 Scalla is starting. . . > Copr. 2004-2012 Stanford University, xrd version v3.3.2 > > ++++++ xrootd [log in to unmask] initialization started. > Config using configuration file > /home/aliprod/xrdserver/etc/xrootd/server/xrootd.cf > =====> xrd.protocol xrootd * > =====> all.adminpath /home/aliprod/xrdserver/admin > Config warning: ignoring unknown xrd directive 'pidpath'. > =====> xrd.pidpath > =====> xrd.port 1094 > =====> xrd.sched mint 32 maxt 2048 avlt 512 idle 780 > =====> xrd.network buffsz 0 nodnr > Config maximum number of connections restricted to 65000 > Copr. 2012 Stanford University, xrootd protocol 2.9.7 version v3.3.2 > ++++++ xrootd protocol initialization started. > 131012 10:07:18 13496 XrootdConfig: non-absolute export path - server > =====> all.export / nolock > =====> xrootd.async off > =====> xrootd.fslib /home/aliprod/xrdserver/lib64/libXrdxFtsOfs.so > =====> xrootd.seclib /home/aliprod/xrdserver/lib64/libXrdSec.so > Config exporting / > Plugin loaded > > > ++++++ Authentication system initialization started. > sec_PM: Loading unix protocol object from > /home/aliprod/xrdserver/lib64/libXrdSecunix.so > Plugin loaded > =====> sec.protocol /home/aliprod/xrdserver/lib64 unix > Config 1 authentication directives processed in > /home/aliprod/xrdserver/etc/xrootd/server/xrootd.cf > 131012 10:07:18 13496 sec_ProtBind_Complete: Default sectoken built: > '&P=unix' > ------ Authentication system initialization completed. > Plugin loaded unversioned XrdSfsGetFileSystem from fslib > /home/aliprod/xrdserver/lib64/libXrdxFtsOfs.so > ++++++ (c) 2012 CERN/IT-DSS v 2.0 > > > ++++++ Storage system initialization started. > =====> all.export / nolock r/w nocheck norcreate > =====> oss.defaults nomig nodread nocheck norcreate nolock > =====> oss.alloc 512M 2 0 > =====> oss.fdlimit * max > =====> oss.space public /storage01/xrddata > =====> oss.space public /storage02/xrddata > =====> oss.space public /storage03/xrddata > =====> oss.namelib > /home/aliprod/xrdserver/lib64/libXrdAggregatingName2Name.so > /home/aliprod/data > =====> oss.localroot /storage01/xrdnamespace > Plugin loaded unversioned XrdOucgetName2Name from namelib > /home/aliprod/xrdserver/lib64/libXrdAggregatingName2Name.so > ++++++ XrdAggregatingN2N initializing. Local lfn prefix > '/home/aliprod/data'. > ++++++ XrdAggregatingN2N initializing. Remote root is null > 131012 10:07:18 13496 oss_AioInit: started AIO read signal thread; > tid=52988224 > 131012 10:07:18 13496 oss_AioInit: started AIO write signal thread; > tid=54040896 > XrdAggregatingN2N processing. buff='/' > XrdAggregatingN2N processing. buff='/home/aliprod/data/' > XrdAggregatingN2N processing. buff='/' > XrdAggregatingN2N processing. buff='/home/aliprod/data/' > Config effective /home/aliprod/xrdserver/etc/xrootd/server/xrootd.cf oss > configuration: > oss.alloc 536870912 2 0 > oss.cachescan 600 > oss.fdlimit 32500 65000 > oss.maxsize 0 > oss.namelib > /home/aliprod/xrdserver/lib64/libXrdAggregatingName2Name.so > oss.localroot /storage01/xrdnamespace > oss.trace fff > oss.xfr 1 deny 10800 keep 1200 > oss.memfile off max 2071848960 > oss.space public /storage01/xrddata > oss.space public /storage02/xrddata > oss.space public /storage03/xrddata > oss.defaults r/w nocheck nodread nomig norcreate nopurge > nostage xattr > oss.path / r/w nocheck nodread nomig norcreate nopurge nostage > xattr > ------ Storage system initialization completed. > > > =====> ftsofs.thirdparty: yes > =====> ftsofs.thirdparty.slots: 20 > =====> ftsofs.thirdparty.rate: 50 Mb/s > =====> ftsofs.thirdparty.statedirectory: /home/aliprod/xrdserver/admin > ++++++ File system initialization started. > =====> all.role server > =====> ofs.trace open > =====> ofs.authlib /home/aliprod/xrdserver/lib64/libXrdAliceTokenAcc.so > =====> ofs.authorize > Plugin loaded unversioned XrdAccAuthorizeObject from authlib > /home/aliprod/xrdserver/lib64/libXrdAliceTokenAcc.so > ++++++ (c) 2008 CERN/IT-DM-SMD AliceTokenAcc (Alice Token Access > Authorization) v 1.0 > =====> XrdAliceTokenAcc: No Authorizationfile set via environment > variable 'TTOKENAUTHZ_AUTHORIZATIONFILE' > =====> XrdAliceTokenAcc: No Authorizationfile like > '/etc/grid-security/xrootd/TkAuthz.Authorization' found > =====> XrdAliceTokenAcc: No Authorizationfile like > '/home/aliprod/.globus/xrootd/TkAuthz.Authorization' found > =====> XrdAliceTokenAcc: Using Authorizationfile > '/home/aliprod/.authz/xrootd/TkAuthz.Authorization'! > ------ AliceTokenAcc initialization completed > > > ++++++ Storage system initialization started. > =====> all.export / nolock r/w nocheck norcreate > =====> oss.defaults nomig nodread nocheck norcreate nolock > =====> oss.alloc 512M 2 0 > =====> oss.fdlimit * max > =====> oss.space public /storage01/xrddata > =====> oss.space public /storage02/xrddata > =====> oss.space public /storage03/xrddata > =====> oss.namelib > /home/aliprod/xrdserver/lib64/libXrdAggregatingName2Name.so > /home/aliprod/data > =====> oss.localroot /storage01/xrdnamespace > Plugin loaded unversioned XrdOucgetName2Name from namelib > /home/aliprod/xrdserver/lib64/libXrdAggregatingName2Name.so > > ++++++ XrdAggregatingN2N initializing. Local lfn prefix > '/home/aliprod/data'. > ++++++ XrdAggregatingN2N initializing. Remote root is null > 131012 10:07:18 13496 oss_AioInit: started AIO read signal thread; > tid=63666496 > 131012 10:07:18 13496 oss_AioInit: started AIO write signal thread; > tid=64719168 > XrdAggregatingN2N processing. buff='/' > XrdAggregatingN2N processing. buff='/home/aliprod/data/' > XrdAggregatingN2N processing. buff='/' > XrdAggregatingN2N processing. buff='/home/aliprod/data/' > Config effective /home/aliprod/xrdserver/etc/xrootd/server/xrootd.cf oss > configuration: > oss.alloc 536870912 2 0 > oss.cachescan 600 > oss.fdlimit 32500 65000 > oss.maxsize 0 > oss.namelib > /home/aliprod/xrdserver/lib64/libXrdAggregatingName2Name.so > oss.localroot /storage01/xrdnamespace > oss.trace fff > oss.xfr 1 deny 10800 keep 1200 > oss.memfile off max 2071848960 > oss.space public /storage01/xrddata > oss.space public /storage02/xrddata > oss.space public /storage03/xrddata > oss.space public /storage01/xrddata > oss.space public /storage02/xrddata > oss.space public /storage03/xrddata > oss.defaults r/w nocheck nodread nomig norcreate nopurge > nostage xattr > oss.path / r/w nocheck nodread nomig norcreate nopurge nostage > xattr > ------ Storage system initialization completed. > > > ++++++ Configuring server role. . . > =====> all.manager rd.spacescience.ro 3122 > =====> all.adminpath /home/aliprod/xrdserver/admin > 131012 10:07:18 13496 Configure Global System Identification: anon-s > 3122rd.spacescience.ro > Config effective /home/aliprod/xrdserver/etc/xrootd/server/xrootd.cf ofs > configuration: > ofs.role server > ofs.authorize > ofs.authlib /home/aliprod/xrdserver/lib64/libXrdAliceTokenAcc.so > ofs.maxdelay 60 > ofs.persist manual hold 600 logdir > /home/aliprod/xrdserver/admin/.ofs/posc.log > ofs.trace 4 > ------ File system server initialization completed. > 131012 10:07:18 13496 none ftsofs_configure: Configured > Config warning: asynchronous I/O has been disabled! > Config warning: 'xrootd.prepare logdir' not specified; prepare tracking > disabled. > 131012 10:07:18 13523 cms_Open: Unable to connect socket to > /home/aliprod/xrdserver/admin/.olb/olbd.admin; connection refused > ------ xrootd protocol initialization completed. > ------ xrootd [log in to unmask]:1094 initialization completed. > > > > > > > >> >> Andy >> >> On Fri, 11 Oct 2013, Adrian Sevcenco wrote: >> >>> On 10/11/2013 12:57 AM, Andrew Hanushevsky wrote: >>>> Hi Adrian, >>> Hi! >>> >>>> Looks to me that the double space problem comes from the fact that >>>> /storage01 has been included twice in the configuration. Additionally, >>> in configuration i have nothing twice .. >>> >>>> it would appear that /storage02 and /storage03 have not been >>>> included in >>>> the configuration. Could you send me your config file? I suspect that >>>> this is the problem here. >>> all config files attached .. xrootd.xrootd.cf is (re) created based on >>> system.cnf file >>> >>> Thanks for helping me with this! >>> Adrian >>> >>>> >>>> Andy >>>> >>>> -----Original Message----- From: Adrian Sevcenco >>>> Sent: Thursday, October 10, 2013 12:04 AM >>>> To: Andrew Hanushevsky >>>> Cc: [log in to unmask] >>>> Subject: Re: xrootd :: server 3.3.2 bug :: double size reporting >>>> >>>> On 10/10/2013 08:52 AM, Adrian Sevcenco wrote: >>>>> On 10/10/2013 03:26 AM, Andrew Hanushevsky wrote: >>>>>> Hi Adrian, >>>>> Hi! >>>>> >>>>>> Could you try to use xrdfs (the new client based xrd replacement) >>>>>> to see >>>>>> what you get there? >>>>> it reports double space also: >>>>> aliprod@storage03: ~ $ xrdfs localhost statvfs / >>>>> Path: / >>>>> Nodes with RW space: 1 >>>>> Size of RW space (MB): 7978 >>>> this is not so informative it seems .. it reports only ~8 Gb space! >>>> >>>> aliprod@storage03: ~ $ xrdfs localhost query space / >>>> oss.cgroup=public&oss.space=86636386320384&oss.free=11850958060&oss.maxf=2089872388&oss.used=10885940771&oss.quota=-1 >>>> >>>> >>>> >>>> >>>> aliprod@storage03: ~ $ xrdfs localhost query stats total >>>> <statistics tod="1381388316" ver="v3.3.2" >>>> src="storage03.spacescience.ro:1094" tos="1381231940" pgm="xrootd" >>>> ins="anon" pid="12987" site=""><stats >>>> id="info"><host>storage03.spacescience.ro</host><port>1094</port><name>anon</name></stats><stats >>>> >>>> >>>> >>>> id="buff"><reqs>105006</reqs><mem>137090048</mem><buffs>180</buffs><adj>0</adj></stats><stats >>>> >>>> >>>> >>>> id="link"><num>10</num><maxn>73</maxn><tot>28885</tot><in>18096047509</in><out>1510999236517</out><ctime>1995692</ctime><tmo>45893</tmo><stall>0</stall><sfps>0</sfps></stats><stats >>>> >>>> >>>> >>>> id="poll"><att>10</att><en>49466</en><ev>45888</ev><int>0</int></stats><stats >>>> >>>> >>>> >>>> id="proc"><usr><s>40</s><u>428853</u></usr><sys><s>2509</s><u>239537</u></sys></stats><stats >>>> >>>> >>>> >>>> id="xrootd"><num>28884</num><ops><open>40772</open><rf>0</rf><rd>19695905</rd><pr>0</pr><rv>931313</rv><rs>10051282</rs><wr>4169</wr><sync>0</sync><getf>0</getf><putf>0</putf><misc>86049</misc></ops><aio><num>0</num><max>0</max><rej>0</rej></aio><err>201</err><rdr>0</rdr><dly>0</dly><lgn><num>28882</num><af>0</af><au>28880</au><ua>0</ua></lgn></stats><stats >>>> >>>> >>>> >>>> id="ofs"><role>server</role><opr>3</opr><opw>0</opw><opp>0</opp><ups>0</ups><han>3</han><rdr>0</rdr><bxq>0</bxq><rep>0</rep><err>0</err><dly>0</dly><sok>0</sok><ser>0</ser><tpc><grnt>0</grnt><deny>0</deny><err>0</err><exp>0</exp></tpc></stats><stats >>>> >>>> >>>> >>>> id="oss" v="2"><paths>2<stats >>>> id="0"><lp>"/"</lp><rp>"/storage01/xrdnamespace/home/aliprod/data/"</rp><tot>13959964628</tot><free>7213124</free><ino>1772814336</ino><ifr>1769450005</ifr></stats><stats >>>> >>>> >>>> >>>> id="1"><lp>"/"</lp><rp>"/storage01/xrdnamespace/home/aliprod/data/"</rp><tot>13959964628</tot><free>7213124</free><ino>1772814336</ino><ifr>1769450005</ifr></stats></paths><space>2<stats >>>> >>>> >>>> >>>> id="0"><name>public</name><tot>84605846016</tot><free>2010157</free><maxf>443841</maxf><fsn>6</fsn><usg>10693215</usg></stats></space></stats><stats >>>> >>>> >>>> >>>> id="sched"><jobs>78587</jobs><inq>0</inq><maxinq>2</maxinq><threads>28</threads><idle>27</idle><tcr>28</tcr><tde>0</tde><tlimr>0</tlimr></stats><stats >>>> >>>> >>>> >>>> id="sgen"><as>1</as><et>9</et><toe>1381388316</toe></stats></statistics> >>>> >>>> >>>> the problem is more clear in this output.. >>>> >>>> Thanks! >>>> Adrian >>>> >>>> >>>> >>>> >>>>> Utilization of RW space (%): c >>>>> Nodes with staging space: 0 >>>>> Size of staging space (MB): 0 >>>>> Utilization of staging space (%): >>>>> >>>>> >>>>> Thanks! >>>>> Adrian >>>>> >>>>> >>>>>> >>>>>> Andy >>>>>> >>>>>> -----Original Message----- From: Adrian Sevcenco >>>>>> Sent: Tuesday, October 08, 2013 4:58 AM >>>>>> To: [log in to unmask] >>>>>> Subject: xrootd :: server 3.3.2 bug :: double size reporting >>>>>> >>>>>> Hi! I have a nagging problem with the reporting of size in xrootd: >>>>>> >>>>>> aliprod@storage03: ~ $ echo exit | ~/xrdserver/bin/xrd localhost >>>>>> queryspace / - >>>>>> Disk space approximations (MB): >>>>>> Total : 82622896 >>>>>> Free : 39192 >>>>>> Used : 0 >>>>>> Largest chunk : 7190 >>>>>> >>>>>> aliprod@storage03: ~ $ df -B M | grep storage >>>>>> /dev/sdc1 13632778M 13626433M 6346M 100% /storage01 >>>>>> /dev/sdc2 13632778M 13625588M 7191M 100% /storage02 >>>>>> /dev/sdc3 14045893M 14039832M 6061M 100% /storage03 >>>>>> >>>>>> aliprod@storage03: ~ $ df -BM | grep storage | awk 'BEGIN {total=0} >>>>>> {gsub("M",""); total+= $2;} END {print total}' >>>>>> 41311449 >>>>>> >>>>>> given that the problem is the queryspace result, i imagine that is a >>>>>> problem internal to xrd. ( it seems that the 3.2.6 version is >>>>>> ok.)(this >>>>>> are ALICE packaged versions) >>>>>> >>>>>> Any idea about the problem and how can i investigate more? >>>>>> Thanks a lot! >>>>>> Adrian >>>>>> >>>>>> >>>>>> ######################################################################## >>>>>> >>>>>> >>>>>> Use REPLY-ALL to reply to list >>>>>> >>>>>> To unsubscribe from the XROOTD-L list, click the following link: >>>>>> https://listserv.slac.stanford.edu/cgi-bin/wa?SUBED1=XROOTD-L&A=1 >>>>> >>>>> >>>>> >>>>> ######################################################################## >>>>> >>>>> >>>>> Use REPLY-ALL to reply to list >>>>> >>>>> To unsubscribe from the XROOTD-L list, click the following link: >>>>> https://listserv.slac.stanford.edu/cgi-bin/wa?SUBED1=XROOTD-L&A=1 >>>>> >>>> >>>> >>> >>> > > > > ######################################################################## > Use REPLY-ALL to reply to list > > To unsubscribe from the XROOTD-L list, click the following link: > https://listserv.slac.stanford.edu/cgi-bin/wa?SUBED1=XROOTD-L&A=1 ######################################################################## Use REPLY-ALL to reply to list To unsubscribe from the XROOTD-L list, click the following link: https://listserv.slac.stanford.edu/cgi-bin/wa?SUBED1=XROOTD-L&A=1