So it may be the proxy. CCing the xrootd developers list. Lukasz On 31.05.2013 15:44, Mol, Xavier (SCC) wrote: > Hi Lukasz, > > from the perspective of dCache, today either a transfer for /atlas/dq2/user/ilijav/HCtest/user.ilijav.HCtest.1/group.test.hc.NTUP_SMWZ.root (which equals /pnfs/gridka.de/atlas/disk-only/atlasdatadisk/rucio/user/ivukotic/a6/57/group.test.hc.NTUP_SMWZ.root which equals 0000000BDA41933C4E8A8EC629636A4F3624 for dCache) succeeded regularly, or this error occurred: > > "No connection from Xrootd client after 300 seconds. Giving up." > > Ciao, > Xavier. > >> -----Original Message----- >> From: Lukasz Janyst [mailto:[log in to unmask]] >> Sent: Friday, May 31, 2013 3:29 PM >> To: Mol, Xavier (SCC) >> Cc: Ilija Vukotic; Petzold, Andreas (SCC); Lukasz Janyst; [log in to unmask]; Lincoln Bryant; Wei Yang; dcache- >> [log in to unmask] >> Subject: Re: [dcache-admin] Re: default ReadCacheSize value and other questions >> >> Hi Xavier, >> >> I don't thinks so. I have no experience with dCache and this kind of >> setups, but looking at your configs it's either a problem in the proxy >> code (Wei, could you test it somehow?) or in dCache itself. >> >> Cheers, >> Lukasz >> >> On 31.05.2013 15:07, Mol, Xavier (SCC) wrote: >>> Hi Lukasz, >>> >>> this setup is following the instructions for beeing a proxy: https://twiki.cern.ch/twiki/bin/view/Atlas/FAXDECloud. As far as I know, >> we only deviated from that guide with regards to the GSI security configuration - we tried to grant dteam users (read) access >> additionally to ATLAS clients (https://twiki.cern.ch/twiki/bin/view/Atlas/XrootdVOMSsecPlugin). This should be unrelated to your >> issues, right? >>> >>> Ciao, >>> Xavier. >>> >>>> -----Original Message----- >>>> From: Lukasz Janyst [mailto:[log in to unmask]] >>>> Sent: Friday, May 31, 2013 2:40 PM >>>> To: Ilija Vukotic >>>> Cc: Petzold, Andreas (SCC); Lukasz Janyst; [log in to unmask]; Lincoln Bryant; Wei Yang; dcache- >>>> [log in to unmask] >>>> Subject: [dcache-admin] Re: default ReadCacheSize value and other questions >>>> >>>> Hi Andreas, >>>> >>>> The server behaves incorrectly and erratically when presented with a >>>> request for a chunk that exceeds the file boundary. Also, it sometimes >>>> refuses to open a file that it has stated correctly. Please see the >>>> attached test case. What kind of setup are you running on that box? Is >>>> it a proxy, standard xrootd, something else? >>>> >>>> With the attached test program, I get the following on lxplus: >>>> >>>> ]==> ./a.out >>>> File size: 797152257 >>>> Read 16777216 bytes at offset 788529152 >>>> Unable to read: [ERROR] Server responded with an error: [3011] Unable to >>>> read >>>> /atlas/dq2/user/ilijav/HCtest/user.ilijav.HCtest.1/group.test.hc.NTUP_SMWZ.root; >>>> no such file or directory >>>> >>>> ]==> ./a.out >>>> File size: 797152257 >>>> Read 16777216 bytes at offset 788529152 >>>> Status of all operations: OK >>>> Received bytes: 8388608 >>>> Should have received: 8623105 >>>> >>>> ]==> ./a.out >>>> File size: 797152257 >>>> Unable to open: [ERROR] Server responded with an error: [3005] Unable to >>>> open >>>> /atlas/dq2/user/ilijav/HCtest/user.ilijav.HCtest.1/group.test.hc.NTUP_SMWZ.root; >>>> operation canceled >>>> >>>> In the case where read returned the OK response but incorrect chunk, >>>> xrdcopy should have been able to detect the error. I will make sure it does. >>>> >>>> Cheers, >>>> Lukasz >>>> >>>> >>>> On 30.05.2013 20:35, Ilija Vukotic wrote: >>>>> Hi Andreas, >>>>> >>>>> would you mind trying the same command from lxplus? >>>>> >>>>> Ilija >>>>> >>>>> -- >>>>> Ilija Vukotic • Skype ivukotic • +1 872 230 6435 • University of Chicago >>>>> >>>>> >>>>> >>>>> On May 30, 2013, at 13:32 , Andreas Petzold <[log in to unmask]> >>>>> wrote: >>>>> >>>>>> Hi Ilija, >>>>>> >>>>>> yes, in both cases I was using 3.3.2. >>>>>> >>>>>> Andreas >>>>>> >>>>>> >>>>>> On 05/30/2013 08:25 PM, Ilija Vukotic wrote: >>>>>>> Hi, >>>>>>> Was the client 3.3.2? >>>>>>> >>>>>>> ilija >>>>>>> >>>>>>> Thumb typed >>>>>>> >>>>>>> ----- Reply message ----- >>>>>>> From: "Andreas Petzold" <[log in to unmask]> >>>>>>> To: "Ilija Vukotic" <[log in to unmask]> >>>>>>> Cc: "Lukasz Janyst" <[log in to unmask]>, >>>>>>> "[log in to unmask]" >>>>>>> <[log in to unmask]>, "Lincoln Bryant" >>>>>>> <[log in to unmask]>, "Wei Yang" <[log in to unmask]> >>>>>>> Subject: default ReadCacheSize value and other questions >>>>>>> Date: Thu, May 30, 2013 11:54 >>>>>>> >>>>>>> >>>>>>> >>>>>>> Hi Ilja, >>>>>>> >>>>>>> FYI, I'm able to run xrdcopy successfully for your file on my desktop at >>>>>>> KIT and on lxplus. >>>>>>> >>>>>>> Cheers, >>>>>>> >>>>>>> Andreas >>>>>>> >>>>>>> On 05/30/2013 06:04 PM, Ilija Vukotic wrote: >>>>>>>> Hi, >>>>>>>> >>>>>>>> I add Andreas and Guenter. Lincoln attempted to xrdcopy file form FZK today between 17:00 and 18:00 (your time) and failed. >>>> We would like to know what is in the servers log. Would you mind looking it up? >>>>>>>> (ordinary xrdcp works ok, but xrdcopy always fails from your site.) >>>>>>>> >>>>>>>> Thanks, >>>>>>>> Ilija >>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>> On May 30, 2013, at 10:57 , Lukasz Janyst <[log in to unmask]> >>>>>>>> wrote: >>>>>>>> >>>>>>>>> Yes, this is a message that the client gets from the server. Can you actually check what is happening server-side? >>>>>>>>> >>>>>>>>> Cheers, >>>>>>>>> Lukasz >>>>>>>>> >>>>>>>>> On 30.05.2013 17:55, Lincoln Bryant wrote: >>>>>>>>>> Hi all, >>>>>>>>>> >>>>>>>>>> I also get this error. >>>>>>>>>> >>>>>>>>>> [lincolnb@uct2-int ~]$ xrdcopy --streams 1 root://f01-140-115- >>>> e.gridka.de:1094//atlas/dq2/user/ilijav/HCtest/user.ilijav.HCtest.1/group.test.hc.NTUP_SMWZ.root - > /dev/null >>>>>>>>>> [ 98%][=================================================>] [752M/760M] >>>>>>>>>> Run: [ERROR] Server responded with an error: [3011] Unable to read >>>> /atlas/dq2/user/ilijav/HCtest/user.ilijav.HCtest.1/group.test.hc.NTUP_SMWZ.root; no such file or directory >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> [lincolnb@uct2-int ~]$ rpm -q xrootd-client >>>>>>>>>> xrootd-client-3.3.2-1.el5 >>>>>>>>>> >>>>>>>>>> --Lincoln >>>>>>>>>> >>>>>>>>>> On May 23, 2013, at 9:36 AM, Ilija Vukotic wrote: >>>>>>>>>> >>>>>>>>>>> Hi, >>>>>>>>>>> >>>>>>>>>>> I added Wei and Lincoln to this mail so they can try too. >>>>>>>>>>> >>>>>>>>>>> I've just retried with both xrdcp and xrdcopy in a virgin shell on lxplus and still get the same behavior: >>>>>>>>>>> >>>>>>>>>>> ~ >xrdcopy --streams 1 root://f01-140-115- >>>> e.gridka.de:1094//atlas/dq2/user/ilijav/HCtest/user.ilijav.HCtest.1/group.test.hc.NTUP_SMWZ.root - > /dev/null >>>>>>>>>>> [ 98%][=================================================>] [752M/760M] >>>>>>>>>>> Run: [ERROR] Server responded with an error: [3011] Unable to read >>>> /atlas/dq2/user/ilijav/HCtest/user.ilijav.HCtest.1/group.test.hc.NTUP_SMWZ.root; no such file or directory >>>>>>>>>>> >>>>>>>>>>> and >>>>>>>>>>> >>>>>>>>>>> xrdcp --streams 1 root://f01-140-115- >>>> e.gridka.de:1094//atlas/dq2/user/ilijav/HCtest/user.ilijav.HCtest.1/group.test.hc.NTUP_SMWZ.root - > /dev/null >>>>>>>>>>> >>>>>>>>>>> Cheers, >>>>>>>>>>> Ilija >>>>>>>>>>> >>>>>>>>>>> -- >>>>>>>>>>> Nullius in verba >>>>>>>>>>> >>>>>>>>>>> >>>>>>>>>>> >>>>>>>>>>> On May 23, 2013, at 2:07 , Lukasz Janyst <[log in to unmask]> wrote: >>>>>>>>>>> >>>>>>>>>>>> I get this error with both clients. >>>>>>>>>>>> >>>>>>>>>>>> Lukasz >>>>>>>>>>>> >>>>>>>>>>>> On 23.05.2013 09:05, Lukasz Janyst wrote: >>>>>>>>>>>>> Hi Ilija, >>>>>>>>>>>>> >>>>>>>>>>>>> I constantly get: >>>>>>>>>>>>> >>>>>>>>>>>>> ]==> xrdcopy --streams 1 >>>>>>>>>>>>> root://f01-140-115- >> e.gridka.de:1094//atlas/dq2/user/ilijav/HCtest/user.ilijav.HCtest.1/group.test.hc.NTUP_SMWZ.root >>>>>>>>>>>>> - > /dev/null >>>>>>>>>>>>> [100%][==================================================] [0/0] >>>>>>>>>>>>> Run: [ERROR] Server responded with an error: [3005] Unable to open >>>>>>>>>>>>> /atlas/dq2/user/ilijav/HCtest/user.ilijav.HCtest.1/group.test.hc.NTUP_SMWZ.root; >>>>>>>>>>>>> operation canceled >>>>>>>>>>>>> >>>>>>>>>>>>> Are you sure the server config is OK? >>>>>>>>>>>>> >>>>>>>>>>>>> Lukasz >>>>>>>>>>>>> >>>>>>>>>>>>> On 23.05.2013 02:20, Ilija Vukotic wrote: >>>>>>>>>>>>>> Hi Lukasz, >>>>>>>>>>>>>> >>>>>>>>>>>>>> when doing this: >>>>>>>>>>>>>> xrdcopy --streams 1 >>>>>>>>>>>>>> root://f01-140-115- >> e.gridka.de:1094//atlas/dq2/user/ilijav/HCtest/user.ilijav.HCtest.1/group.test.hc.NTUP_SMWZ.root >>>>>>>>>>>>>> - > /dev/null >>>>>>>>>>>>>> from lxplus it always (!) fails between 94% and 98% in one of the two >>>>>>>>>>>>>> ways. >>>>>>>>>>>>>> >>>>>>>>>>>>>> [ 96%][================================================> ] [736M/760M] >>>>>>>>>>>>>> Run: [ERROR] Server responded with an error: [3011] Unable to read >>>>>>>>>>>>>> /atlas/dq2/user/ilijav/HCtest/user.ilijav.HCtest.1/group.test.hc.NTUP_SMWZ.root; >>>>>>>>>>>>>> no such file or directory >>>>>>>>>>>>>> [ 98%][=================================================>] [748M/760M] >>>>>>>>>>>>>> Run: [ERROR] Internal error >>>>>>>>>>>>>> >>>>>>>>>>>>>> >>>>>>>>>>>>>> at the same time standard xrdcp works as expected. >>>>>>>>>>>>>> >>>>>>>>>>>>>> Security is off at that server so please feel free to try and replicate. >>>>>>>>>>>>>> >>>>>>>>>>>>>> Cheers, >>>>>>>>>>>>>> Ilija >>>>>>>>>>>>>> >>>>>>>>>>>>>> -- >>>>>>>>>>>>>> Nullius in verba >>>>>>>>>>>>>> >>>>>>>>>>>>>> >>>>>>>>>>>>>> >>>>>>>>>>>>>> On May 22, 2013, at 10:47 , Ilija Vukotic <[log in to unmask]> wrote: >>>>>>>>>>>>>> >>>>>>>>>>>>>>> Hi, >>>>>>>>>>>>>>> >>>>>>>>>>>>>>> I was using standard xrdcp you get at lxplus (which is 3.3.2) >>>>>>>>>>>>>>> Now I'll repeat it all with xrdcopy. Will let you know what I measure. >>>>>>>>>>>>>>> >>>>>>>>>>>>>>> Thanks, >>>>>>>>>>>>>>> Ilija >>>>>>>>>>>>>>> >>>>>>>>>>>>>>> >>>>>>>>>>>>>>> -- >>>>>>>>>>>>>>> Nullius in verba >>>>>>>>>>>>>>> >>>>>>>>>>>>>>> >>>>>>>>>>>>>>> >>>>>>>>>>>>>>> On May 22, 2013, at 8:32 , Lukasz Janyst <[log in to unmask]> >>>>>>>>>>>>>>> wrote: >>>>>>>>>>>>>>> >>>>>>>>>>>>>>>> Andreas has now done some testing as well: >>>>>>>>>>>>>>>> https://github.com/xrootd/xrootd/issues/20 >>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>> Lukasz >>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>> On 22.05.2013 14:34, Lukasz Janyst wrote: >>>>>>>>>>>>>>>>> Hi Ilija, >>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>> which version of xrootd are you using? Would it be possible for you >>>>>>>>>>>>>>>>> to try the new client (xrdcopy) instead of the old one (xrdcp)? At >>>>>>>>>>>>>>>>> least >>>>>>>>>>>>>>>>> from version 3.3.2? We were able to get around 750MB/s with just one >>>>>>>>>>>>>>>>> client on 10GB link. >>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>> The default for the old client is indeed one stream, but if you >>>>>>>>>>>>>>>>> don't disable the cache completely it will send many requests >>>>>>>>>>>>>>>>> synchronously. >>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>> Cheers, >>>>>>>>>>>>>>>>> Lukasz >>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>> On 21.05.2013 09:00, Ilija Vukotic wrote: >>>>>>>>>>>>>>>>>> Hi Lukasz, >>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>> I was trying to find out why xrdcp is not using the full available >>>>>>>>>>>>>>>>>> bandwidth at one 10Gb WAN link and so was playing with parameters >>>>>>>>>>>>>>>>>> like >>>>>>>>>>>>>>>>>> number of streams, size of ReadCache etc. >>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>> What I don't understand is that whatever value I set for >>>>>>>>>>>>>>>>>> ReadCacheSize I >>>>>>>>>>>>>>>>>> get performance roughly 10 times worse than if I set nothing. >>>>>>>>>>>>>>>>>> Strange thing here is that performance is 10x worse even when I >>>>>>>>>>>>>>>>>> set it >>>>>>>>>>>>>>>>>> to it's default value: 4000000. >>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>> Also one more thing is puzzling to me: I have a way to intercept >>>>>>>>>>>>>>>>>> messages exchanged between xrootd server and client. Looking in >>>>>>>>>>>>>>>>>> them I >>>>>>>>>>>>>>>>>> see that default behavior of xrdcp is to at the start request 20 >>>>>>>>>>>>>>>>>> chunks >>>>>>>>>>>>>>>>>> of 4MB each. Documentation claims that the default is one stream? >>>>>>>>>>>>>>>>>> Could you explain this to me or point me to some documentation I >>>>>>>>>>>>>>>>>> could >>>>>>>>>>>>>>>>>> read? >>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>> Thanks a lot, >>>>>>>>>>>>>>>>>> Ilija >>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>> -- >>>>>>>>>>>>>>>>>> Dr Ilija Vukotic [log in to unmask] >>>>>>>>>>>>>>>>>> University of Chicagohttp://www.vukotic.me >>>>>>>>>>>>>>>>>> 5620 S Ellis Ave Tel: +1-773-702-7475 >>>>>>>>>>>>>>>>>> Chicago IL 60637, USA >>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>> >>>>>>>>>>>>>>> >>>>>>>>>>>>>> >>>>>>>>>>>>> >>>>>>>>>>>> >>>>>>>>>>> >>>>>>>>>> >>>>>>>>> >>>>>>>> >>>>>>> >>>>>>> >>>>>>> -- >>>>>>> Karlsruhe Institute of Technology (KIT) >>>>>>> Steinbuch Centre for Computing (SCC) >>>>>>> >>>>>>> Andreas Petzold >>>>>>> >>>>>>> Hermann-von-Helmholtz-Platz 1, Building 449, Room 202 >>>>>>> D-76344 Eggenstein-Leopoldshafen >>>>>>> >>>>>>> Tel: +49 721 608 24916 >>>>>>> Fax: +49 721 608 24972 >>>>>>> Email: [log in to unmask] >>>>>>> www.scc.kit.edu <http://www.scc.kit.edu> >>>>>>> >>>>>>> KIT – University of the State of Baden-Wuerttemberg and >>>>>>> National Research Center of the Helmholtz Association >>>>>>> >>>>>> >>>>>> >>>>>> -- >>>>>> Karlsruhe Institute of Technology (KIT) >>>>>> Steinbuch Centre for Computing (SCC) >>>>>> >>>>>> Andreas Petzold >>>>>> >>>>>> Hermann-von-Helmholtz-Platz 1, Building 449, Room 202 >>>>>> D-76344 Eggenstein-Leopoldshafen >>>>>> >>>>>> Tel: +49 721 608 24916 >>>>>> Fax: +49 721 608 24972 >>>>>> Email: [log in to unmask] >>>>>> www.scc.kit.edu >>>>>> >>>>>> KIT – University of the State of Baden-Wuerttemberg and >>>>>> National Research Center of the Helmholtz Association >>>>>> >>>>> >>> > ######################################################################## Use REPLY-ALL to reply to list To unsubscribe from the XROOTD-DEV list, click the following link: https://listserv.slac.stanford.edu/cgi-bin/wa?SUBED1=XROOTD-DEV&A=1