Details
-
Type:
Story
-
Status: Done
-
Resolution: Done
-
Fix Version/s: None
-
Component/s: None
-
Labels:None
-
Story Points:4
-
Team:Data Release Production
Description
In the spirit of disentangling computations from visualizations in the interactive QA framework of qa_explorer, write a task to compute columns of all quantities desired for pipe_analysis-style plots. This should use the new object table parquet files (deepCoadd_obj dataset) generated by writeObjectTable.py to do computations on whole tracts of data at a time.
The following works for me now:
writeObjectTable.py /datasets/hsc/repo/rerun/RC/w_2018_14/DM-13890/ --output /project/tmorton/DM-13877 --id tract=9615 filter=HSC-G^HSC-R --no-versions -j 18
writeQATable.py /project/tmorton/DM-13877 --output /project/tmorton/DM-13877 --id tract=9615 --no-versions -j 20
consolidateQATable.py /project/tmorton/DM-13877 --output /project/tmorton/DM-13877 --id tract=9615 --no-versions
These tasks all use the new ParquetStorage to write and read (sometimes partial) dataframes, mediated by the ParquetTable and/or MultilevelParquetTable objects. WriteQATableTask also uses a YAML specification of what functors to calculate for the QA table. In principle this can be made configurable, but this is just a demo for now.
Up next will be to connect this new tract-level dataset to the QADataset object to enable visualizations on top of these calculations.