URL: <http://savannah.cern.ch/bugs/?99674> Summary: odd transcient behaviour on google compute element(gce) storage cluster Project: XROOTD Submitted by: bdouglas Submitted on: 2013-01-07 12:46 Report Type: Bug Priority: 5 - Normal Severity: 3 - Normal Status: None Privacy: Public Assigned to: None Originator Email: Open/Closed: Open Discussion Lock: Any Fixed by commit(s): _______________________________________________________ Details: Hi, I have setup an xrootd storage cluster in the google cloud. (gce) and have seen this odd behaviour. I have successfully copied files into the storage but when I go to locate the files sometimes I see them and some times I do not. For example: root://headnode.c.atlasgce.internal:1094//> dirlist /atlas/local/benjamin/ drwx(051) 4096 2012-12-22 23:13:32 /atlas/local/benjamin/dpb-test-15 -rw-(048) 104857600 2012-12-22 23:19:36 /atlas/local/benjamin/testfile_100MB -rw-(048) 104857600 2012-12-22 23:19:23 /atlas/local/benjamin/testfile_100MB drwx(051) 4096 2012-12-22 23:13:32 /atlas/local/benjamin/dpb-test-05 -rw-(048) 104857600 2012-12-22 23:19:43 /atlas/local/benjamin/testfile_100MB drwx(051) 4096 2012-12-22 23:13:32 /atlas/local/benjamin/dpb-test-20 drwx(051) 4096 2012-12-22 23:13:32 /atlas/local/benjamin/dpb-test-19 -rw-(048) 104857600 2012-12-22 23:19:42 /atlas/local/benjamin/testfile_100MB -rw-(048) 104857600 2012-12-22 23:19:40 /atlas/local/benjamin/testfile_100MB drwx(051) 4096 2012-12-22 23:13:32 /atlas/local/benjamin/dpb-test-18 -rw-(048) 104857600 2012-12-22 23:19:39 /atlas/local/benjamin/testfile_100MB drwx(051) 4096 2012-12-22 23:13:32 /atlas/local/benjamin/dpb-test-17 -rw-(048) 104857600 2012-12-22 23:19:37 /atlas/local/benjamin/testfile_100MB drwx(051) 4096 2012-12-22 23:13:32 /atlas/local/benjamin/dpb-test-16 drwx(051) 4096 2012-12-22 23:13:32 /atlas/local/benjamin/dpb-test-14 -rw-(048) 104857600 2012-12-22 23:19:35 /atlas/local/benjamin/testfile_100MB drwx(051) 4096 2012-12-22 23:13:32 /atlas/local/benjamin/dpb-test-13 -rw-(048) 104857600 2012-12-22 23:19:33 /atlas/local/benjamin/testfile_100MB -rw-(048) 104857600 2012-12-22 23:19:32 /atlas/local/benjamin/testfile_100MB drwx(051) 4096 2013-01-07 11:35:09 /atlas/local/benjamin/data12_8TeV.00208484.physics_Egamma.merge.AOD.f472_m1208 drwx(051) 4096 2012-12-22 23:13:32 /atlas/local/benjamin/dpb-test-12 drwx(051) 4096 2012-12-22 23:13:32 /atlas/local/benjamin/dpb-test-11 drwx(051) 4096 2013-01-07 11:34:25 /atlas/local/benjamin/data12_8TeV.00208484.physics_Egamma.merge.AOD.f472_m1208 -rw-(048) 104857600 2012-12-22 23:19:31 /atlas/local/benjamin/testfile_100MB drwx(051) 4096 2013-01-07 11:33:34 /atlas/local/benjamin/data12_8TeV.00208484.physics_Egamma.merge.AOD.f472_m1208 -rw-(048) 104857600 2012-12-22 23:19:29 /atlas/local/benjamin/testfile_100MB drwx(051) 4096 2012-12-22 23:13:32 /atlas/local/benjamin/dpb-test-10 drwx(051) 4096 2013-01-07 11:33:12 /atlas/local/benjamin/data12_8TeV.00208484.physics_Egamma.merge.AOD.f472_m1208 -rw-(048) 104857600 2012-12-22 23:19:28 /atlas/local/benjamin/testfile_100MB drwx(051) 4096 2012-12-22 23:13:32 /atlas/local/benjamin/dpb-test-09 drwx(051) 4096 2012-12-22 23:13:32 /atlas/local/benjamin/dpb-test-04 -rw-(048) 104857600 2012-12-22 23:19:21 /atlas/local/benjamin/testfile_100MB drwx(051) 4096 2013-01-07 11:32:21 /atlas/local/benjamin/data12_8TeV.00208484.physics_Egamma.merge.AOD.f472_m1208 drwx(051) 4096 2013-01-07 11:31:35 /atlas/local/benjamin/data12_8TeV.00208484.physics_Egamma.merge.AOD.f472_m1208 drwx(051) 4096 2012-12-22 23:13:32 /atlas/local/benjamin/dpb-test-07 -rw-(048) 104857600 2012-12-22 23:19:25 /atlas/local/benjamin/testfile_100MB drwx(051) 4096 2012-12-22 23:13:32 /atlas/local/benjamin/dpb-test-03 -rw-(048) 104857600 2012-12-22 23:19:20 /atlas/local/benjamin/testfile_100MB drwx(051) 4096 2013-01-07 11:31:25 /atlas/local/benjamin/data12_8TeV.00208484.physics_Egamma.merge.AOD.f472_m1208 drwx(051) 4096 2012-12-22 23:13:32 /atlas/local/benjamin/dpb-test-08 -rw-(048) 104857600 2012-12-22 23:19:27 /atlas/local/benjamin/testfile_100MB drwx(051) 4096 2013-01-07 11:30:39 /atlas/local/benjamin/data12_8TeV.00208484.physics_Egamma.merge.AOD.f472_m1208 drwx(051) 4096 2013-01-07 11:29:53 /atlas/local/benjamin/data12_8TeV.00208484.physics_Egamma.merge.AOD.f472_m1208 -rw-(048) 104857600 2012-12-22 23:19:24 /atlas/local/benjamin/testfile_100MB drwx(051) 4096 2012-12-22 23:13:32 /atlas/local/benjamin/dpb-test-06 drwx(051) 4096 2013-01-07 11:29:06 /atlas/local/benjamin/data12_8TeV.00208484.physics_Egamma.merge.AOD.f472_m1208 -rw-(048) 104857600 2012-12-22 23:19:19 /atlas/local/benjamin/testfile_100MB drwx(051) 4096 2012-12-22 23:13:32 /atlas/local/benjamin/dpb-test-02 -rw-(048) 104857600 2012-12-22 23:19:17 /atlas/local/benjamin/testfile_100MB drwx(051) 4096 2013-01-07 11:27:23 /atlas/local/benjamin/data12_8TeV.00208484.physics_Egamma.merge.AOD.f472_m1208 drwx(051) 4096 2012-12-22 23:11:53 /atlas/local/benjamin/dpb-test-01 -rw-(048) 104857600 2012-12-22 23:19:16 /atlas/local/benjamin/testfile_100MB drwx(051) 4096 2013-01-07 05:41:03 /atlas/local/benjamin/ddo.000001.Atlas.Ideal.DBRelease.v210501 drwx(051) 4096 2012-12-22 23:12:12 /atlas/local/benjamin/dpb-test-00-clone drwx(051) 4096 2012-12-22 23:12:01 /atlas/local/benjamin/dpb-test-00 -rw-(048) 104857600 2012-12-22 23:19:14 /atlas/local/benjamin/testfile_100MB -rw-(048) 10 2013-01-07 05:13:32 /atlas/local/benjamin/dpb-apf-00.testfile Yet a short time late (after the help command in xrd) root://headnode.c.atlasgce.internal:1094//> dirlistrec /atlas/local/benjamin/ drwx(051) 4096 2012-12-22 23:13:32 /atlas/local/benjamin/dpb-test-15 -rw-(048) 104857600 2012-12-22 23:19:36 /atlas/local/benjamin/testfile_100MB -rw-(048) 104857600 2012-12-22 23:19:23 /atlas/local/benjamin/testfile_100MB drwx(051) 4096 2012-12-22 23:13:32 /atlas/local/benjamin/dpb-test-05 -rw-(048) 104857600 2012-12-22 23:19:43 /atlas/local/benjamin/testfile_100MB drwx(051) 4096 2012-12-22 23:13:32 /atlas/local/benjamin/dpb-test-20 drwx(051) 4096 2012-12-22 23:13:32 /atlas/local/benjamin/dpb-test-19 -rw-(048) 104857600 2012-12-22 23:19:42 /atlas/local/benjamin/testfile_100MB -rw-(048) 104857600 2012-12-22 23:19:40 /atlas/local/benjamin/testfile_100MB drwx(051) 4096 2012-12-22 23:13:32 /atlas/local/benjamin/dpb-test-18 -rw-(048) 104857600 2012-12-22 23:19:39 /atlas/local/benjamin/testfile_100MB drwx(051) 4096 2012-12-22 23:13:32 /atlas/local/benjamin/dpb-test-17 -rw-(048) 104857600 2012-12-22 23:19:37 /atlas/local/benjamin/testfile_100MB drwx(051) 4096 2012-12-22 23:13:32 /atlas/local/benjamin/dpb-test-16 drwx(051) 4096 2012-12-22 23:13:32 /atlas/local/benjamin/dpb-test-14 -rw-(048) 104857600 2012-12-22 23:19:35 /atlas/local/benjamin/testfile_100MB drwx(051) 4096 2012-12-22 23:13:32 /atlas/local/benjamin/dpb-test-13 -rw-(048) 104857600 2012-12-22 23:19:33 /atlas/local/benjamin/testfile_100MB -rw-(048) 104857600 2012-12-22 23:19:32 /atlas/local/benjamin/testfile_100MB drwx(051) 4096 2013-01-07 11:35:09 /atlas/local/benjamin/data12_8TeV.00208484.physics_Egamma.merge.AOD.f472_m1208 drwx(051) 4096 2012-12-22 23:13:32 /atlas/local/benjamin/dpb-test-12 drwx(051) 4096 2012-12-22 23:13:32 /atlas/local/benjamin/dpb-test-11 -rw-(048) 104857600 2012-12-22 23:19:31 /atlas/local/benjamin/testfile_100MB -rw-(048) 104857600 2012-12-22 23:19:29 /atlas/local/benjamin/testfile_100MB drwx(051) 4096 2012-12-22 23:13:32 /atlas/local/benjamin/dpb-test-10 -rw-(048) 104857600 2012-12-22 23:19:28 /atlas/local/benjamin/testfile_100MB drwx(051) 4096 2012-12-22 23:13:32 /atlas/local/benjamin/dpb-test-09 drwx(051) 4096 2012-12-22 23:13:32 /atlas/local/benjamin/dpb-test-04 -rw-(048) 104857600 2012-12-22 23:19:21 /atlas/local/benjamin/testfile_100MB drwx(051) 4096 2012-12-22 23:13:32 /atlas/local/benjamin/dpb-test-07 -rw-(048) 104857600 2012-12-22 23:19:25 /atlas/local/benjamin/testfile_100MB drwx(051) 4096 2012-12-22 23:13:32 /atlas/local/benjamin/dpb-test-03 -rw-(048) 104857600 2012-12-22 23:19:20 /atlas/local/benjamin/testfile_100MB drwx(051) 4096 2012-12-22 23:13:32 /atlas/local/benjamin/dpb-test-08 -rw-(048) 104857600 2012-12-22 23:19:27 /atlas/local/benjamin/testfile_100MB -rw-(048) 104857600 2012-12-22 23:19:24 /atlas/local/benjamin/testfile_100MB drwx(051) 4096 2012-12-22 23:13:32 /atlas/local/benjamin/dpb-test-06 -rw-(048) 104857600 2012-12-22 23:19:19 /atlas/local/benjamin/testfile_100MB drwx(051) 4096 2012-12-22 23:13:32 /atlas/local/benjamin/dpb-test-02 -rw-(048) 104857600 2012-12-22 23:19:17 /atlas/local/benjamin/testfile_100MB drwx(051) 4096 2012-12-22 23:11:53 /atlas/local/benjamin/dpb-test-01 -rw-(048) 104857600 2012-12-22 23:19:16 /atlas/local/benjamin/testfile_100MB drwx(051) 4096 2013-01-07 05:41:03 /atlas/local/benjamin/ddo.000001.Atlas.Ideal.DBRelease.v210501 drwx(051) 4096 2012-12-22 23:12:12 /atlas/local/benjamin/dpb-test-00-clone drwx(051) 4096 2012-12-22 23:12:01 /atlas/local/benjamin/dpb-test-00 -rw-(048) 104857600 2012-12-22 23:19:14 /atlas/local/benjamin/testfile_100MB -rw-(048) 10 2013-01-07 05:13:32 /atlas/local/benjamin/dpb-apf-00.testfile Error 3011: Unable to open directory /atlas/local/benjamin/dpb-test-00-clone; No such file or directory In server headnode.c.atlasgce.internal:1094 or in some of its child nodes. root://headnode.c.atlasgce.internal:1094//> locateall /atlas/local/benjamin/data12_8TeV.00208484.physics_Egamma.merge.AOD.f472_m120/ No matching files were found. root://headnode.c.atlasgce.internal:1094//> exit Goodbye. [benjamin@dpb-apf-00 d3pd_testjob]$ xrd headnode.c.atlasgce.internal locateall /atlas/local/benjamin/data12_8TeV.00208484.physics_Egamma.merge.AOD.f472_m120/ No matching files were found. Now the files are not found? A short time later - [benjamin@dpb-apf-00 d3pd_testjob]$ xrd headnode.c.atlasgce.internal (C) 2004-2010 by the Xrootd group. Xrootd version: v3.2.7 Welcome to the xrootd command line interface. Type 'help' for a list of available commands. root://headnode.c.atlasgce.internal:1094//> dirlist /atlas/local/benjamin/data12_8TeV.00208484.physics_Egamma.merge.AOD.f472_m1208/ Error 3011: Unable to open directory /atlas/local/benjamin/data12_8TeV.00208484.physics_Egamma.merge.AOD.f472_m1208/; No such file or directory In server headnode.c.atlasgce.internal:1094 or in some of its child nodes. -rw-(048) 801094670 2013-01-07 11:35:18 /atlas/local/benjamin/data12_8TeV.00208484.physics_Egamma.merge.AOD.f472_m1208/data12_8TeV.00208484.physics_Egamma.merge.AOD.f472_m1208._lb0095._0001.1 -rw-(048) 3559362431 2013-01-07 11:35:04 /atlas/local/benjamin/data12_8TeV.00208484.physics_Egamma.merge.AOD.f472_m1208/data12_8TeV.00208484.physics_Egamma.merge.AOD.f472_m1208._lb0094._0001.1 -rw-(048) 3674852869 2013-01-07 11:34:20 /atlas/local/benjamin/data12_8TeV.00208484.physics_Egamma.merge.AOD.f472_m1208/data12_8TeV.00208484.physics_Egamma.merge.AOD.f472_m1208._lb0093._0001.1 -rw-(048) 1708802906 2013-01-07 11:33:29 /atlas/local/benjamin/data12_8TeV.00208484.physics_Egamma.merge.AOD.f472_m1208/data12_8TeV.00208484.physics_Egamma.merge.AOD.f472_m1208._lb0092._0001.1 -rw-(048) 3720590161 2013-01-07 11:33:07 /atlas/local/benjamin/data12_8TeV.00208484.physics_Egamma.merge.AOD.f472_m1208/data12_8TeV.00208484.physics_Egamma.merge.AOD.f472_m1208._lb0091._0001.1 -rw-(048) 3597669746 2013-01-07 11:32:16 /atlas/local/benjamin/data12_8TeV.00208484.physics_Egamma.merge.AOD.f472_m1208/data12_8TeV.00208484.physics_Egamma.merge.AOD.f472_m1208._lb0090._0001.1 -rw-(048) 559179199 2013-01-07 11:31:30 /atlas/local/benjamin/data12_8TeV.00208484.physics_Egamma.merge.AOD.f472_m1208/data12_8TeV.00208484.physics_Egamma.merge.AOD.f472_m1208._lb0089._0001.1 -rw-(048) 3755235005 2013-01-07 11:31:20 /atlas/local/benjamin/data12_8TeV.00208484.physics_Egamma.merge.AOD.f472_m1208/data12_8TeV.00208484.physics_Egamma.merge.AOD.f472_m1208._lb0088._0001.1 -rw-(048) 3748590483 2013-01-07 11:30:34 /atlas/local/benjamin/data12_8TeV.00208484.physics_Egamma.merge.AOD.f472_m1208/data12_8TeV.00208484.physics_Egamma.merge.AOD.f472_m1208._lb0087._0001.1 -rw-(048) 3814234452 2013-01-07 11:29:48 /atlas/local/benjamin/data12_8TeV.00208484.physics_Egamma.merge.AOD.f472_m1208/data12_8TeV.00208484.physics_Egamma.merge.AOD.f472_m1208._lb0086._0001.1 -rw-(048) 3865215588 2013-01-07 11:28:08 /atlas/local/benjamin/data12_8TeV.00208484.physics_Egamma.merge.AOD.f472_m1208/data12_8TeV.00208484.physics_Egamma.merge.AOD.f472_m1208._lb0085._0001.1 root://headnode.c.atlasgce.internal:1094//> The files are found. There were no changes to the system. Are there timeouts that I can set to make the system a bit more robust against these transient issues? Thanks, Doug Benjamin _______________________________________________________ Reply to this item at: <http://savannah.cern.ch/bugs/?99674> _______________________________________________ Message sent via/by LCG Savannah http://savannah.cern.ch/ ######################################################################## Use REPLY-ALL to reply to list To unsubscribe from the XROOTD-DEV list, click the following link: https://listserv.slac.stanford.edu/cgi-bin/wa?SUBED1=XROOTD-DEV&A=1