LISTSERV mailing list manager LISTSERV 16.5

Help for QSERV-L Archives


QSERV-L Archives

QSERV-L Archives


QSERV-L@LISTSERV.SLAC.STANFORD.EDU


View:

Message:

[

First

|

Previous

|

Next

|

Last

]

By Topic:

[

First

|

Previous

|

Next

|

Last

]

By Author:

[

First

|

Previous

|

Next

|

Last

]

Font:

Proportional Font

LISTSERV Archives

LISTSERV Archives

QSERV-L Home

QSERV-L Home

QSERV-L  February 2015

QSERV-L February 2015

Subject:

Re: data set for large scale test

From:

Jacek Becla <[log in to unmask]>

Reply-To:

General discussion for qserv (LSST prototype baseline catalog)

Date:

Thu, 26 Feb 2015 13:16:38 -0800

Content-Type:

text/plain

Parts/Attachments:

Parts/Attachments

text/plain (70 lines)

I added an epic

https://jira.lsstcorp.org/browse/DM-2187

to capture loading tables with no ra/decl

Jacek



On 02/26/2015 12:39 PM, Daniel L. Wang wrote:
> I would like to note that the current system requires the extra raObject
> and declObject columns in ForcedSource, so that table's size will be
> proportionally larger than it would be in production. Do we actually
> have ForcedSource data? I think it would be useful to add some
> artificial data to one of our test cases (generate 10 randomly generated
> ForcedSource rows per Object row in case01), so we can actually run that
> test.
>
> The code to load child table rows that lack director positioning (other
> than the director's primary key) needs to be on our schedule eventually.
> I don't know when. The general case is very expensive (lookup position
> and chunk for each position!?), and we are only going to get away with
> it because our bulk-loads for ForcedSource will be spatially-restricted.
>
> -Daniel
>
> On 02/26/2015 12:32 PM, Jacek Becla wrote:
>> So, we said we'd do 10% of DR1. We need to think carefully
>> how we want to look at DR1, because the full data set with
>> indexes, object_extra etc is 1.7 petabyte.
>>
>> I think a fair and realistic test would be to look at 10%
>> of the core data (Object, Source, ForcedSource, Exposures),
>> exercise some scans and joins, but just forget Object_extra
>> (which we can argue will be less frequently used, and testing
>> with it won't really stress qserv software in any serious
>> way anyway. After all, we always ran with Object and Source
>> only in the past too)
>>
>> That basically is ~27 TB + indexes. (The data sizes for DR1:
>>   38 TB Object
>>  186 TB Source
>>   45 TB ForcedSource)
>>
>>
>> We have ~8 TB on each machine at IN2P3, if I recall,
>> so as we said earlier, ~10 machines would be a minimum
>> to run the test, 25 would be comfortable, 50 would be
>> even better.
>>
>> Are we in position to generate Object, Source
>> and ForcedSource tables? Which data set would we be
>> using to start with?
>>
>> Jacek
>>
>> ########################################################################
>> Use REPLY-ALL to reply to list
>>
>> To unsubscribe from the QSERV-L list, click the following link:
>> https://listserv.slac.stanford.edu/cgi-bin/wa?SUBED1=QSERV-L&A=1
>

########################################################################
Use REPLY-ALL to reply to list

To unsubscribe from the QSERV-L list, click the following link:
https://listserv.slac.stanford.edu/cgi-bin/wa?SUBED1=QSERV-L&A=1

Top of Message | Previous Page | Permalink

Advanced Options


Options

Log In

Log In

Get Password

Get Password


Search Archives

Search Archives


Subscribe or Unsubscribe

Subscribe or Unsubscribe


Archives

March 2018
February 2018
January 2018
December 2017
August 2017
December 2016
November 2016
October 2016
September 2016
August 2016
July 2016
June 2016
May 2016
April 2016
March 2016
February 2016
January 2016
December 2015
November 2015
October 2015
September 2015
August 2015
July 2015
June 2015
May 2015
April 2015
March 2015
February 2015
January 2015
December 2014
November 2014
October 2014
September 2014
August 2014
July 2014
June 2014
May 2014
April 2014
March 2014
February 2014
January 2014
December 2013
November 2013
October 2013
September 2013
August 2013
July 2013
June 2013
May 2013
April 2013
March 2013
February 2013
January 2013
December 2012

ATOM RSS1 RSS2



LISTSERV.SLAC.STANFORD.EDU

Secured by F-Secure Anti-Virus CataList Email List Search Powered by the LISTSERV Email List Manager

Privacy Notice, Security Notice and Terms of Use