Details
-
Type:
Story
-
Status: Done
-
Resolution: Done
-
Fix Version/s: None
-
Component/s: meas_algorithms
-
Story Points:10
-
Epic Link:
-
Sprint:AP F21-4 (September)
-
Team:Alert Production
-
Urgent?:No
Description
Currently, the only way to get a gen3 refcat is to ingest it into a gen2 repo an then convert it to gen3. IngestIndexedReferenceTask is a CmdLineTask, and we need a PipelineTask refcat ingester. Much of the internals are the same, so we can probably do it with a shared base class, but gen3 has additional information loaded into the registry about the shards, etc. that will take some care. The multiprocessing manager code, IngestIndexManager, may not be compatible with the gen3 multiprocessing system: it's designed to do all of its own multiprocessing internally, without an external controller.
These docs about how to ingest refcat data will probably be useful in putting together the gen3 code: https://pipelines.lsst.io/modules/lsst.meas.algorithms/creating-a-reference-catalog.html
I also wrote a custom Gaia ingester (IngestGaiaManager and IngestGaiaReferenceTask) to handle the gaia fluxes, which might be able to work "for free" with the new system.
Attachments
Issue Links
- blocks
-
DM-20572 Remove "Generation 2" Middleware
- Done
-
DM-32389 Create refcat management script for ap_verify datasets
- Done
- is blocked by
-
DM-31320 Prep work for creating Gen 3 refcat ingester
- Done
- is triggering
-
DM-31698 Move ConvertReferenceCatalog classes out of ingestIndex file
- Done
-
DM-31817 Remove default value on DatasetConfig.ref_dataset_name
- Done
- relates to
-
DM-31704 Design and document reference catalog dataset policy
- To Do
-
DM-30624 Document gen2 to gen3 refcat ingestion
- Done
-
DM-33639 Fix refcat ingest docs and convert message to ingest into a RUN collection
- Done
-
DM-31367 Add butler register-dataset-type subcommand
- Done
Is the sprint label for this accurate (July). I'm really looking forward to being able to stop relying on gen2 for gen3.