Attendees: Daniel, Serge, Jacek * recent update to redhat 6.3 broke boost (1.41) on slac machines. - related Red Hat bugzilla case: #908774 - ended up downgrading offending packages to get back to working version - redhat is downplaying the importance of this fix, but other labs are starting to join us and complain too * weekly status report --> need to update more regularly, many qserv tasks not updated * qserv review - tentatively planning for late July * need to plan & document our plans re schema evolution and provenance - started trac page about schema evolution https://dev.lsstcorp.org/trac/wiki/db/SchemaEvolution - provenance harder! - dedicated phone call to discuss these issues Tue Mar 5 at 10:00am pacific * upgrading boost to boot 1.53? - Serge would find some new features handy - no, too aggressive, better to rely on stable, mainstream packages that come with OS * extra feature for partitioner: auto-detecting csv separator type - would be nice to auto-detect in loader if input rows are tab or comma separated, - requested/suggested by in2p3 - ok to assume all files from given group have same separator? Yes --> defer adding new features like this until ongoing work on partitioner finished * another extra feature for partitioner: read field names from the first line of file - would be nice to have - implement later, low priority * to keep things simpler, we should require input data to perfectly match schema of table we load into - if realignment needed, pre-process input data * should partitioner be able to reorder columns? - don't over-complicate for now * should partitioner be able to drop columns from input data - needed to drop _chunkId column from our input data from pt1.2 - don't over-complicate, clean up in separate clean up step, not in partitioner * we want partitioner/duplicator to support sampling (eg produce 10% of what could be produced) - synchronizing ids between tables would be useful - implement in duplicator, not partitioner - already have ids in memory in duplicator, so it is easy refmatch - patching qserv to deal with refmatch will take ~1 week - do shortly after partitioner ready how should refmatch be treated in metadata - just add flag "isMatchTable" - should we also store info which tables are related to a given match table? - particularly useful if we want to manage multiple clusters of match tables, eg 2mass2object, sdss2object, etc... - would be nice, for later DC_W13_Stripe82_40deg does not have index on RunDeepForcedSource.objectId - DC_W13_Stripe82 does - need to add index building to scripts used for production so that things are consistent and not forgotten - a mess... ingest for either Source or ForcedSource was rewritten, not sure which code is exercised, need to talk to KT myisam - can't sort by random column, only by index Daniel redoing logging system for worker as part of export path work Jacek working on integrating metadata with qserv. Helped Dave Monet, no need to use qserv for that. Still want to try performance for differently sorted data - managed to kill mysqld on lsst10 while sorting a copy of stripe82_40 deg last night... Thanks, Jacek ######################################################################## Use REPLY-ALL to reply to list To unsubscribe from the QSERV-L list, click the following link: https://listserv.slac.stanford.edu/cgi-bin/wa?SUBED1=QSERV-L&A=1