Hi Daniel -
Okay, I have qserv working in single-node mode, but have not had success running multiple workers. Recall you helped me setup a pre-qms, pre-xrootd_qsPatch4 instance of qserv that used a manager and two xrootd workers. I need this working with the head of the master in order to simulate fault recovery.
Nothing stands out to me in the log files immediately after starting mysql, two xrootd workers, xrootd manager, the corresponding cmsd processes, qms, and qserv. Upon running a simple query (after waiting several minutes for a response) I receive the following error:
mysql> select count(*) from Object;
ERROR 4120 (Proxy): Error during execution: 'open failed for chunk(s): 3360 3718 3478 3841 3480 3361 3721 3720 3601 3838 3840 3241 3598 3600 3481 3240'
That is, all chunkQueries fail. Within the log file for the xrootd manager, I see entries like:
130904 19:13:50 22220 qsmaster.22164:[log in to unmask] XrootdProtocol: 0600 req=3010 dlen=12
130904 19:13:50 22220 qsmaster.22164:[log in to unmask] XrootdProtocol: 0600 open ut /q/LSST/3360
130904 19:13:50 22220 qsmaster.22164:[log in to unmask] ofs_open: 2-600 fn=/q/LSST/3360
130904 19:13:50 22099 Receive lsst-dev03 48 bytes on 16383
130904 19:13:50 22099 Decode lsst-dev03 gave qsmaster.22164:[log in to unmask] err -2 'No servers are available to write the file.' /q/LSST/3360
130904 19:13:50 22220 qsmaster.22164:[log in to unmask] XrootdResponse: 0600 sending err 3011: No servers are available to write the file.
Are you aware of changes in xrootd patch4 that might impact a multi-node setup like mine? Any thoughts?
(Btw, I did have to create two new databases: qservw_worker1 and qservw_worker2, each containing the Dbs table with a single entry 'LSST' so that the xrootd workers would successfully initialize.)
Thanks,
Bill
On Sep 4, 2013, at 5:02 PM, "Daniel L. Wang" <[log in to unmask]> wrote:
> On 09/04/2013 04:56 PM, Bill Chickering wrote:
>> Hi Daniel -
>> Looks like my most recent issue in setting up qserv with your data was indeed the emptyChunks.txt file. Having copied your version emptyChunks.txt, qserv appears to be working properly. As a sanity check, I receive the following count of Object records:
>> mysql> select count(*) from Object;
>> +----------------+
>> | SUM(QS1_COUNT) |
>> +----------------+
>> | 3738244 |
>> +----------------+
>> Does this agree with your qserv instance?
> Yes, that looks right.
>
> -Daniel
>
########################################################################
Use REPLY-ALL to reply to list
To unsubscribe from the QSERV-L list, click the following link:
https://listserv.slac.stanford.edu/cgi-bin/wa?SUBED1=QSERV-L&A=1
|