On 03/10/2017 04:43 PM, Andrew Hanushevsky wrote: > Hi Patrick, > > On Fri, 10 Mar 2017, Patrick McGuigan wrote: > >> We normally mount four partitions as /xrd_cache_[1-4]. This dataserver was >> incorrectly setup and /xrd_cache_1 and /xrd_cache_4 are mounting the same device! > OK, that just means the partition for cache_1 and cache_4 were used twice as > often than would be expected. > >> I think I need to turn off xrootd, unmount the disks, clean up /etc/fstab, >> remount the disks, modify the links under /xrd to replace references to >> /xrd_cache_4 with /xrd_cache_1 and then restart xrootd. > Indeed the symlink needs to change all 4's need to become 1's. > >> Is the naming structure guaranteed to be the same under different partitions >> to the name of the file?: >> >> for instance: >> /xrd/datadisk/some/path is a link to /xrd_cache_4/ATLASDATADISK/xx/HASH% >> >> >> If the file should be on /xrd_cache_1, would xx and HASH% be same regardless >> of partition? > Well, you are in luck here because the cache file paths were identical in > length. So, you should be able to simply change the symlink and not bother with > anything in the cache. >> Good enough. I am glad that this is the case. >> >> I am debating about allowing some access (read, maybe delete) to xrootd while >> I am renaming links since this would allow the correct two partitions to operate. > I don't immediately see how. Once you separate the caches, any reference to a > cache_4 file will fail because cache_4 is now empty, right? As I see it, all > files in cache_4 also exist in cache_1. So, you should be able to rename the > links to point to their Cache_1 image. Then fix up the mount point after that's > done. You can allow read access during this. If you also want to have write > access then remove cache_4 from the list of available spaces (i.e. specify > cache_1, 2, and 3 explicitly -- don't use an asterisk. > I was planning on using a read only (or read +delete, but no write) so that normal access will available for cache_2 and cache_3. The cache_1 space should also be in good shape from the perspective of the exported path. I need to make some backup listings of things so that I have good picture of where everything is right now. > Make sense? > So far. > Andy > >> >> >> Patrick >> >> On 03/10/2017 02:48 PM, Andrew Hanushevsky wrote: >>> Hi Patrick, >>> >>> Hmm, well audit space does check that the files are actually linked via the name >>> space. So, if it tells you all of tehm are reachable then they should be. On the >>> other hand it also said, if I am not mistaken, that >>> the space had no files in it (which I find hard to beleive). So, something else >>> must be going on. You can rerrun audit space but specify -d (debug) on the >>> command line to see what it actually is doing. >>> >>> Andy >>> >>> >>> On Fri, 10 Mar 2017, Yang, Wei wrote: >>> >>>> HI Patrick, >>>> >>>> I think the best way is to rebuild the symlinks in name space (/xrd). For each >>>> file in /xrd_cache_*, you can find what its symlinks should be by doing >>>> something like this: >>>> >>>> $ getfattr -d >>>> //atlas/xrdcache/sdb/ATLASDATADISK/22/CB0C6157C00C00000000ac172c8922E9010014D% >>>> getfattr: Removing leading '/' from absolute path names >>>> # file: >>>> atlas/xrdcache/sdb/ATLASDATADISK/22/CB0C6157C00C00000000ac172c8922E9010014D% >>>> user.XrdCks.adler32=0sYWRsZXIzMgAAAAAAAAAAAAAAAABXRH7/ADNKMQAAAATRL9rKAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA >>>> >>>> >>>> user.XrdFrm.Pfn="/atlas/xrootd/atlasdatadisk/rucio/mc15_5TeV/97/7d/log.08435138._004167.job.log.tgz.1" >>>> >>>> >>>> >>>> After you rebuild the symlinks in name space, you can then decide which files >>>> are not needed (or rely on the RUCIO team to delete for you). >>>> >>>> regards, >>>> -- >>>> Wei Yang | [log in to unmask] | 650-926-3338 >>>> >>>> ________________________________________ >>>> From: [log in to unmask] <[log in to unmask]> on behalf of >>>> Patrick McGuigan <[log in to unmask]> >>>> Sent: Friday, March 10, 2017 12:06 PM >>>> To: xrootd-l >>>> Subject: audit with frm_admin >>>> >>>> Hi, >>>> >>>> I have an odd situation with a dataserver that uses XA space names and how >>>> space >>>> usage is being reported. >>>> >>>> I export an area named /xrd >>>> >>>> We are not using mig or stage options on the export. >>>> >>>> >>>> I have a named space: >>>> oss.space ATLASDATADISK /xrd_cache_* >>>> >>>> >>>> >>>> At the system level, only files written to /xrd/datadisk should be in the >>>> ATLASDATADISK space. >>>> >>>> I have a script that walks the exported directory to discover the files being >>>> stored on the server and record date and size information: >>>> >>>> find -L /xrd -type f -fprintf /tmp/xrd_files '%p %T@ %s\n' >>>> >>>> >>>> If I run 'frm_admin -c /etc/xrootd/xrootd-clustered.cfg audit usage' I get >>>> amounts for claimed and actual bytes used by the various named spaces including >>>> ATLASDATADISK. >>>> >>>> The problem is that the amount claimed in the audit output is 20TB (33%) larger >>>> than what I expect by walking the path /xrd/datadisk. >>>> >>>> It looks like there are files in /xrd_cache_x/ATLASDATDISK/... that do not have >>>> links in /xrd/datadisk. >>>> >>>> What is the correct way to rectify this issue? >>>> >>>> I tried to run >>>> >>>> frm_admin -c /etc/xrootd/xrootd-clustered.cfg audit space ATLASDATADISK >>>> >>>> 0 problems found; 0 fixed. >>>> Space ATLASDATADISK has 0 files with 0 bytes in use (0 unreachable). >>>> >>>> >>>> >>>> Should I run an audit names? >>>> >>>> Should the audit invocation look like: >>>> ... audit name /xrd/datadisk >>>> ? >>>> >>>> >>>> Thanks for any insights, >>>> >>>> Patrick >>>> >>>> ######################################################################## >>>> Use REPLY-ALL to reply to list >>>> >>>> To unsubscribe from the XROOTD-L list, click the following link: >>>> https://listserv.slac.stanford.edu/cgi-bin/wa?SUBED1=XROOTD-L&A=1 >>>> ######################################################################## >>>> Use REPLY-ALL to reply to list >>>> >>>> To unsubscribe from the XROOTD-L list, click the following link: >>>> https://listserv.slac.stanford.edu/cgi-bin/wa?SUBED1=XROOTD-L&A=1 >>>> >>> >>> ######################################################################## >>> Use REPLY-ALL to reply to list >>> >>> To unsubscribe from the XROOTD-L list, click the following link: >>> https://listserv.slac.stanford.edu/cgi-bin/wa?SUBED1=XROOTD-L&A=1 >> > > ######################################################################## > Use REPLY-ALL to reply to list > > To unsubscribe from the XROOTD-L list, click the following link: > https://listserv.slac.stanford.edu/cgi-bin/wa?SUBED1=XROOTD-L&A=1 ######################################################################## Use REPLY-ALL to reply to list To unsubscribe from the XROOTD-L list, click the following link: https://listserv.slac.stanford.edu/cgi-bin/wa?SUBED1=XROOTD-L&A=1