LISTSERV mailing list manager LISTSERV 16.5

Help for QSERV-L Archives


QSERV-L Archives

QSERV-L Archives


QSERV-L@LISTSERV.SLAC.STANFORD.EDU


View:

Message:

[

First

|

Previous

|

Next

|

Last

]

By Topic:

[

First

|

Previous

|

Next

|

Last

]

By Author:

[

First

|

Previous

|

Next

|

Last

]

Font:

Proportional Font

LISTSERV Archives

LISTSERV Archives

QSERV-L Home

QSERV-L Home

QSERV-L  December 2014

QSERV-L December 2014

Subject:

Re: qservMeta index and multi-column primary keys

From:

Serge Monkewitz <[log in to unmask]>

Reply-To:

General discussion for qserv (LSST prototype baseline catalog)

Date:

Fri, 19 Dec 2014 12:34:15 -0800

Content-Type:

text/plain

Parts/Attachments:

Parts/Attachments

text/plain (52 lines)

Hi Andy,

On Dec 19, 2014, at 12:09 PM, Salnikov, Andrei A. <[log in to unmask]> wrote:

> we just talked with Fabrice about one of the problems that new 
> data loader created for Fabrice and we need to understand how
> to fix it. The essence of it is that data loader tries to create
> index table in qservMeta for every partitioned table. This breaks 
> if when the table's primary key has more than one colum because 
> data loader only supports one-column PK. 

By index, do you mean the PK -> chunkId secondary index? If so, I think we only plan to provide this for director tables. For a table like Source, or the AvgForcedPhotYearly table Fabrice was trying to load, I don’t think such an index is expected. (Daniel, please correct me if I’m wrong).

> To understand how to fix it I'd like to get an answer to few 
> questions:
> - Do we need an index for every partitioned table? If not then 
>  we should add a parameter to config file which disables index
>  generation for specific tables. Or do we need index only for 
>  director table?

I think the answer is only for directors.

> - Do we need an index for tables which have multi-column PK, will
>  qserv even support this? If yes then index table needs to have 
>  the same PK columns as the original table. If not then I can 
>  just skip generating index for those problematic tables.

For now, I believe we should stick with single column (integer) PKs for directors. Supporting multi-column PKs is of course doable, but it would complicate query analysis. We'd have to look for multiple equality predicate parse tree nodes ANDed together (or worse, split over multiple ON clauses) to identify equijoins.

> Related questions:
> - should duplicator support multi-column PK in the future?
>  I guess 'id' option in config file needs to be a list in this 
>  case.

My 2 cents: please no. The duplicator is intended as a stop-gap way to generate lots of data for testing, and is already complicated enough. If I cannot assume that the director and table PKs are 64 bit integers, then generating unique IDs for duplicated records becomes much harder, and in some cases  impossible.

I also don’t think that we get a lot of value out of the effort that would be needed to do this, but others may disagree.

> - what is the official name for the qservMeta index, I think 
>  "secondary index" is mentioned, but this does not make too
>  much sens to me.

That’s a question for Daniel. I think of (chunkId, subChunkId) as the primary way of looking things up in the Qserv system, so it makes sense to me that an index on director PK be “secondary".

Cheers,
Serge
########################################################################
Use REPLY-ALL to reply to list

To unsubscribe from the QSERV-L list, click the following link:
https://listserv.slac.stanford.edu/cgi-bin/wa?SUBED1=QSERV-L&A=1

Top of Message | Previous Page | Permalink

Advanced Options


Options

Log In

Log In

Get Password

Get Password


Search Archives

Search Archives


Subscribe or Unsubscribe

Subscribe or Unsubscribe


Archives

March 2018
February 2018
January 2018
December 2017
August 2017
December 2016
November 2016
October 2016
September 2016
August 2016
July 2016
June 2016
May 2016
April 2016
March 2016
February 2016
January 2016
December 2015
November 2015
October 2015
September 2015
August 2015
July 2015
June 2015
May 2015
April 2015
March 2015
February 2015
January 2015
December 2014
November 2014
October 2014
September 2014
August 2014
July 2014
June 2014
May 2014
April 2014
March 2014
February 2014
January 2014
December 2013
November 2013
October 2013
September 2013
August 2013
July 2013
June 2013
May 2013
April 2013
March 2013
February 2013
January 2013
December 2012

ATOM RSS1 RSS2



LISTSERV.SLAC.STANFORD.EDU

Secured by F-Secure Anti-Virus CataList Email List Search Powered by the LISTSERV Email List Manager

Privacy Notice, Security Notice and Terms of Use