Print

Print


It seems readthedocs.io doesn't understand HTTP range. Below I am asking for the first 13 byes,  but I got 110K bytes:

$ curl -o /dev/null -H 'Range: bytes=0-12' https://cvmfs.readthedocs.io/en/stable/_images/xcache2.svg 
  % Total    % Received % Xferd  Average Speed   Time    Time     Time  Current
                                 Dload  Upload   Total   Spent    Left  Speed
100  110k    0  110k    0     0   566k      0 --:--:-- --:--:-- --:--:--  567k

As a test, if you have the latest Xcache, including the latest XrdclHttp, can you use the following URL instead (note the CGI)

https://cvmfs.readthedocs.io/en/stable/_images/xcache2.svg?xrddclhttp_avoidrange

You may or may not need XcacheH - I forgot.

regards,
--
Wei Yang  |  [log in to unmask]  |  650-926-3338(O)

-----Original Message-----
From: <[log in to unmask]> on behalf of "Yang, Wei" <[log in to unmask]>
Date: Tuesday, May 11, 2021 at 1:43 AM
To: Bertrand RIGAUD <[log in to unmask]>
Cc: xrootd-l <[log in to unmask]>
Subject: Re: http ingest using xcache fails

    From my pass experience:
    
    Xcache fetch data blocks by blocks. This translate to HTTP range request in XrdCl::Http. We have seen examples where web servers don't like HTTP range header and ignoring it. For this reason, we built something in XcacheH to handle this situation. But web servers can behave in other weird ways. I think we have run in to a similar issue (maybe not so similar - we don't know) and I don't know we ever understand whther the problem is solved or not.
    
    regards,
    --
    Wei Yang  |  [log in to unmask]  |  650-926-3338(O)
    
    -----Original Message-----
    From: Bertrand RIGAUD <[log in to unmask]>
    Date: Monday, May 10, 2021 at 3:07 AM
    To: Wei Yang <[log in to unmask]>
    Cc: xrootd-l <[log in to unmask]>
    Subject: Re: http ingest using xcache fails
    
        Hello,
        
        well I don't think XcacheH is related to this issue. It's the same behaviour whether or not XcacheH is activated
        
        in xcache data folder, I have the file but the size is just under the entire size (the last chunk is missing)
        it depends on the pfc.blocksize I choose.
        
        let's say I have a file of 10MB and a pfc.blocksize set to 4MB, my downloaded file (in xcache data folder and by extension on the client machine) will be 8MB, and, in the xcache logs, I will have two 4MB chunks OK, and the last chunk will be failed and it will be mentionned that the rest (2MB) can't be downloaded because it cannot be read.
        
        [2021-05-10 10:30:18.127994 +0200][Error  ][XrdClHttp         ] Could not read URL: http://u23@source_server:8000//path/to/the/file?, error: [ERROR] Internal error: no such device or address: Result Invalid Read in request after 3 attempts
        
        
        
        I performed another test:
        
        I downloaded the file from the source server using curl. I have the fully downloaded file (same amount of bytes that is told by curl "Content-Length" attribute).
        I copied this file on a simple VM at our site and expose it through a basic http server (python -m SimpleHTTPServer 80)
        I downloaded the entire file through xcache! No error message in xcache logs, file is fully downloaded on xcache data folder and on client machine.
        
        So,
        
        there is no problem with the file itself
        there is no problem when downloading from the source server using curl or wget
        there is no problem between xcache and my basic http server
        but there is a problem between xcache and the source server
        
        What in the communication between xcache and the source server can prevents from reading the last chunk of a file?
        
        regards,
        
        Bertrand Rigaud
        
        Centre de Calcul de l'IN2P3 - CNRS
        21 avenue Pierre de Coubertin
        69627 Villeurbanne CEDEX
        Tél : 04.78.93.08.80
        
        ----- Mail original -----
        De: "Yang, Wei" <[log in to unmask]>
        À: "Bertrand RIGAUD" <[log in to unmask]>, "xrootd-l" <[log in to unmask]>
        Envoyé: Samedi 8 Mai 2021 00:46:50
        Objet: Re: http ingest using xcache fails
        
        Hmm, I think XcacheH is still more or less experimental. That said, it should still work. Can you go inside the cache directory to find a file name http/path/to/my/file and see if that file is fully cache (size, checksum) ?
        
        regards,
        --
        Wei Yang  |  [log in to unmask]  |  650-926-3338(O)
        
        -----Original Message-----
        From: <[log in to unmask]> on behalf of Bertrand RIGAUD <[log in to unmask]>
        Date: Thursday, May 6, 2021 at 4:47 AM
        To: <[log in to unmask]>
        Subject: http ingest using xcache fails
        
            Hi,
            
            Trying to deploy this architecture (https://cvmfs.readthedocs.io/en/stable/_images/xcache2.svg), I'm facing a problem when downloading a file through http.
            
            Everythting works well till the last chunk is downloaded.
            
            As an example, here is a simple curl performed from the client machine (and this is the same behaviour with cvmfs client) :
            
            ### Through xcache server ###
            
            $ curl -v -o file1  http://my_xcache_server:1094//http://path/to/my/file
              % Total    % Received % Xferd  Average Speed   Time    Time     Time  Current
                                             Dload  Upload   Total   Spent    Left  Speed
              0     0    0     0    0     0      0      0 --:--:-- --:--:-- --:--:--     0* About to connect() to my_xcache_server port 1094 (#0)
            *   Trying xxx.xxx.xxx.xxx...
            * Connected to my_xcache_server (xxx.xxx.xxx.xxx) port 1094 (#0)
            > GET //http://path/to/my/file HTTP/1.1
            > User-Agent: curl/7.29.0
            > Host: my_xcache_server:1094
            > Accept: */*
            > 
            < HTTP/1.1 200 OK
            < Connection: Keep-Alive
            < Content-Length: 506173152
            < 
            { [data not shown]
             99  482M   99  480M    0     0  57.9M      0  0:00:08  0:00:08 --:--:-- 29.9M* transfer closed with 2856672 bytes remaining to read
             99  482M   99  480M    0     0  57.7M      0  0:00:08  0:00:08 --:--:-- 11.5M
            * Closing connection 0
            curl: (18) transfer closed with 2856672 bytes remaining to read
            
            
            ### Direct download ###
            
            curl -v -o file2 http://path/to/my/file
              % Total    % Received % Xferd  Average Speed   Time    Time     Time  Current
                                             Dload  Upload   Total   Spent    Left  Speed
              0     0    0     0    0     0      0      0 --:--:-- --:--:-- --:--:--     0* About to connect() to source_server port 8000 (#0)
            *   Trying 145.146.100.30...
            * Connected to source_server (xxx.xxx.xxx.xxx) port 8000 (#0)
            > GET path/to/my/file HTTP/1.1
            > User-Agent: curl/7.29.0
            > Host: source_server:8000
            > Accept: */*
            > 
            < HTTP/1.1 200 OK
            < Connection: Keep-Alive
            < Content-Length: 506173152
            < 
            { [data not shown]
            100  482M  100  482M    0     0   101M      0  0:00:04  0:00:04 --:--:--  101M
            * Connection #0 to host source_server left intact
            
            ### Xcache server logs (last chunk) ###
            
            210506 08:53:05 155627 sysXrdHttp:  XrdHttpReq::Data! final=1
            210506 08:53:05 155627 unknown.2:23@clientmachine sysXrdHttp: PostProcessHTTPReq req: 2 reqstate: 481
            210506 08:53:05 155627 unknown.2:23@clientmachine sysXrdHttp: Got data vectors to send:1
            210506 08:53:05 155627 sysXrdHttp: Sending 1048576 bytes
            210506 08:53:05 155627 unknown.2:23@clientmachine sysXrdHttp:  Process. lp:0 reqstate: 481
            210506 08:53:05 155627 unknown.2:23@clientmachine sysXrdHttp: Process is exiting rc:0
            210506 08:53:05 155627 XrdPfc_IO: dump Read() 0x7fc1d00019f0 off: 503316480 size: 1048576 http:[log in to unmask]:8000/http/source.server:8000/path/to/file?
            210506 08:53:05 155627 XrdPfc_File: dump Read() idx 120 http/source.server:8000/path/to/file
            210506 08:53:05 155627 XrdPfc_File: dump inc_ref_count 0x7fc1a0403530 refcnt  1 http/source.server:8000/path/to/file
            210506 08:53:05 155627 XrdPfc_File: dump Read() 0x7fc1f0028000inc_ref_count for existing block 0x7fc1a0403530 idx = 120 http/source.server:8000/path/to/file
            XcacheH: stagein list snapshot: available workers: 10, list length: 0
            [2021-05-06 08:53:09.092614 +0200][Error  ][XrdClHttp         ] Could not read URL: http:[log in to unmask]:8000//path/to/file?, error: [ERROR] Internal error: no such device or address: Result Invalid Read in request after 3 attempts
            210506 08:53:09 155645 XrdPfc_File: dump Prefetch enter to check download status http/source.server:8000/path/to/file
            210506 08:53:09 155645 XrdPfc_File: debug Prefetch file is complete, stopping prefetch. http/source.server:8000/path/to/file
            210506 08:53:09 155626 XrdPfc_File: dump ProcessBlockResponse block=0x7fc1a0403530, idx=120, off=503316480, res=-6 http/source.server:8000/path/to/file
            210506 08:53:09 155626 XrdPfc_File: debug ProcessBlockResponse after failed prefetch on io 0x7fc1d00019f0 disabling prefetching on this io. http/source.server:8000/path/to/file
            210506 08:53:09 155626 XrdPfc_File: error ProcessBlockResponse block 0x7fc1a0403530, idx=120, off=503316480 error=-6 http/source.server:8000/path/to/file
            210506 08:53:09 155627 XrdPfc_File: dump Read() requested block finished 0x7fc1a0403530, is_failed()=True http/source.server:8000/path/to/file
            210506 08:53:09 155627 XrdPfc_File: error Read() io 0x7fc1d00019f0, block 120 finished with error 6 no such device or address http/source.server:8000/path/to/file
            210506 08:53:09 155627 XrdPfc_File: dump Read() dec_ref_count 0x7fc1a0403530 idx = 120 http/source.server:8000/path/to/file
            210506 08:53:09 155627 XrdPfc_File: dump free_block block 0x7fc1a0403530  idx =  120 http/source.server:8000/path/to/file
            210506 08:53:09 155627 XrdPfc_IO: warning Read() error in File::Read(), exit status=-6, error=no such device or address http:[log in to unmask]:8000/http/source.server:8000/path/to/file?
            210506 08:53:09 155627 ofs_read: unknown.2:23@clientmachine Unable to read /http:/source.server:8000/path/to/file; no such device or address
            210506 08:53:09 155627 sysXrdHttp:  XrdHttpReq::Error
            210506 08:53:09 155627 unknown.2:23@clientmachine sysXrdHttp: PostProcessHTTPReq req: 2 reqstate: 482
            210506 08:53:09 155627 unknown.2:23@clientmachine sysXrdHttp: PostProcessHTTPReq mapping Xrd error [3005] to status code [500]
            210506 08:53:09 155627 unknown.2:23@clientmachine sysXrdHttp: Stopping request because more data is expected but no data has been read.
            210506 08:53:09 155627 sysXrdHttp:  XrdHttpReq request ended.
            210506 08:53:09 155627 sysXrdHttp:  Cleanup
            210506 08:53:09 155627 sysXrdHttp:  Reset
            210506 08:53:09 155627 sysXrdHttp:  XrdHttpReq request ended.
            210506 08:53:09 155627 XrootdXeq: unknown.2:23@clientmachine disc 0:00:09 (send failure)
            210506 08:53:09 155627 XrdPfc_File: debug ioActive start for io 0x7fc1d00019f0 http/path/to/file
            210506 08:53:09 155627 XrdPfc_File: info ioActive for io 0x7fc1d00019f0, active_prefetches 0, allow_prefetching False, ioactive_false_reported False, ios_in_detach 0
            210506 08:53:09 155627 XrdPfc_File: info 	io_map.size() 1, block_map.size() 0, file http/path/to/file
            210506 08:53:09 155627 XrdPfc_File: info ioActive for io 0x7fc1d00019f0 returning False, file http/path/to/file
            210506 08:53:09 155627 XrdPfc_IO: info DetachFinalize() 0x7fc1d00019f0
            210506 08:53:09 155627 XrdPfc_Cache: debug ReleaseFile http/path/to/file, io 0x7fc1d00019f0
            210506 08:53:09 155627 XrdPfc_File: debug RemoveIO() io = 0x7fc1d00019f0 http/path/to/file
            210506 08:53:09 155627 XrdPfc_Cache: debug dec_ref_cnt http/path/to/file, cnt at entry = 1
            210506 08:53:09 155627 XrdPfc_File: debug FinalizeSyncBeforeExit requesting sync to write detach stats http/path/to/file
            210506 08:53:09 155627 XrdPfc_Cache: debug dec_ref_cnt http/path/to/file, scheduling final sync
            210506 08:53:09 155627 XrdPfc_IO: debug ~IOEntireFile() 0x7fc1d00019f0 http:[log in to unmask]:8000/http/path/to/file?
            210506 08:53:09 155627 Posix_PrepIODisable: Disabling defered open http:[log in to unmask]:8000//path/to/file?
            210506 08:53:09 160872 XrdPfc_File: dump Sync() http/path/to/file
            210506 08:53:09 160872 XrdPfc_File: dump Sync 0 blocks written during sync http/path/to/file
            210506 08:53:09 160872 XrdPfc_Cache: debug dec_ref_cnt http/path/to/file, cnt at entry = 1
            210506 08:53:09 160872 XrdPfc_File: debug FinalizeSyncBeforeExit sync not required http/path/to/file
            210506 08:53:09 160872 XrdPfc_Cache: debug dec_ref_cnt http/path/to/file, cnt after sync_check and dec_ref_cnt = 0
            210506 08:53:09 160872 XrdPfc_File: debug ~File() close info  http/path/to/file
            210506 08:53:09 160872 XrdPfc_File: debug ~File() close output   http/path/to/file
            210506 08:53:09 160872 XrdPfc_File: debug ~File() ended, prefetch score = 3.95238 http/path/to/file
            210506 08:53:11 155631 Posix_DDestroy: DLY destory of 1 objects; 0 already lost.
            210506 08:53:11 155631 Posix_DDestroy: DLY destory end; 0 objects deferred and 0 lost.
            
            ### Xcache config ###
            
            all.role     proxy server
            
            all.export /http:/
            all.export /https:/
            
            ofs.osslib      libXrdPss.so
            
            xrootd.seclib /usr/lib64/libXrdSec.so
            
            pss.origin =http,https
            
            # US data servers
            pss.permit      source.server
            pss.permit      source.server
            pss.permit      source.server
            pss.permit      source.server
            pss.permit      source.server
            pss.permit      source.server
            pss.permit      source.server
            # EU data servers
            pss.permit      source.server
            pss.permit      source.server
            pss.permit      source.server
            
            pss.cachelib    libXrdPfc.so
            pss.config streams 8
            
            # XcacheH
            pss.namelib -lfncachesrc+ /usr/lib64/XrdName2NameXcacheH.so cacheLife=1d cacheBlockSize=4m
            pss.ccmlib /usr/lib64/XrdName2NameXcacheH.so
            
            oss.localroot   /xcache/ns
            
            # Metadata directories (cinfo files)
            oss.space meta /xcache/meta
            
            # Data directories
            oss.space data /xcache/data
            
            # Xcache spaces assignement
            pfc.spaces data meta
            
            if exec xrootd
              xrd.protocol http:1094 libXrdHttp.so
            fi
            
            pfc.diskusage 0.90 0.95
            
            pfc.ram 6g
            pfc.blocksize 4M
            pfc.prefetch 32
            
            pfc.trace dump
            http.trace   all
            pss.trace  all
            pss.debug
            
            ### xrootd version: 5.1.1 ###
            
            As a result, I got an almost file downloaded on client side, and also in xcache server, cached data is an almost file. Just this last chunk missing.
            
            Are there directives missing in congig file?
            
            Thank you,
            
            Bertrand Rigaud
            
            Centre de Calcul de l'IN2P3 - CNRS
            21 avenue Pierre de Coubertin
            69627 Villeurbanne CEDEX
            Tél : 04.78.93.08.80
            
            ########################################################################
            Use REPLY-ALL to reply to list
            
            To unsubscribe from the XROOTD-L list, click the following link:
            https://listserv.slac.stanford.edu/cgi-bin/wa?SUBED1=XROOTD-L&A=1
    
    
    ########################################################################
    Use REPLY-ALL to reply to list
    
    To unsubscribe from the XROOTD-L list, click the following link:
    https://listserv.slac.stanford.edu/cgi-bin/wa?SUBED1=XROOTD-L&A=1
    


########################################################################
Use REPLY-ALL to reply to list

To unsubscribe from the XROOTD-L list, click the following link:
https://listserv.slac.stanford.edu/cgi-bin/wa?SUBED1=XROOTD-L&A=1