Uploaded image for project: 'Data Management'
  1. Data Management
  2. DM-22178

queryDatasets produces lots of duplicate outputs

    Details

    • Story Points:
      2
    • Team:
      Data Release Production

      Description

      Running a query like

      list(butler.registry.queryDatasets("calexp", collections=["shared/ci_hsc_output"],
                                         skymap="discrete/ci_hsc", tract=0, patch=70))
      

      produces many duplicate records for unclear reasons (they're not from having multiple collections, so deduplicate doesn't help).  Try to fix this, or at least document when duplicate results must be expected.

        Attachments

          Issue Links

            Activity

            jbosch Jim Bosch created issue -
            tjenness Tim Jenness made changes -
            Field Original Value New Value
            Description Running a query like

            {{list(butler.registry.queryDatasets("calexp", collections=["shared/ci_hsc_output"],}}
            {{ skymap="discrete/ci_hsc", tract=0, patch=70))}}

            produces many duplicate records for unclear reasons (they're not from having multiple collections, so {{deduplicate}} doesn't help).  Try to fix this, or at least document when duplicate results must be expected.
            Running a query like

            {code}
            list(butler.registry.queryDatasets("calexp", collections=["shared/ci_hsc_output"],
                                                                 skymap="discrete/ci_hsc", tract=0, patch=70))
            {code}

            produces many duplicate records for unclear reasons (they're not from having multiple collections, so {{deduplicate}} doesn't help).  Try to fix this, or at least document when duplicate results must be expected.
            tjenness Tim Jenness made changes -
            Description Running a query like

            {code}
            list(butler.registry.queryDatasets("calexp", collections=["shared/ci_hsc_output"],
                                                                 skymap="discrete/ci_hsc", tract=0, patch=70))
            {code}

            produces many duplicate records for unclear reasons (they're not from having multiple collections, so {{deduplicate}} doesn't help).  Try to fix this, or at least document when duplicate results must be expected.
            Running a query like

            {code}
            list(butler.registry.queryDatasets("calexp", collections=["shared/ci_hsc_output"],
                                                 skymap="discrete/ci_hsc", tract=0, patch=70))
            {code}

            produces many duplicate records for unclear reasons (they're not from having multiple collections, so {{deduplicate}} doesn't help).  Try to fix this, or at least document when duplicate results must be expected.
            tjenness Tim Jenness made changes -
            Description Running a query like

            {code}
            list(butler.registry.queryDatasets("calexp", collections=["shared/ci_hsc_output"],
                                                 skymap="discrete/ci_hsc", tract=0, patch=70))
            {code}

            produces many duplicate records for unclear reasons (they're not from having multiple collections, so {{deduplicate}} doesn't help).  Try to fix this, or at least document when duplicate results must be expected.
            Running a query like

            {code}
            list(butler.registry.queryDatasets("calexp", collections=["shared/ci_hsc_output"],
                                               skymap="discrete/ci_hsc", tract=0, patch=70))
            {code}

            produces many duplicate records for unclear reasons (they're not from having multiple collections, so {{deduplicate}} doesn't help).  Try to fix this, or at least document when duplicate results must be expected.
            tjenness Tim Jenness made changes -
            Link This issue is duplicated by DM-21448 [ DM-21448 ]
            tjenness Tim Jenness made changes -
            Link This issue is duplicated by DM-21448 [ DM-21448 ]
            tjenness Tim Jenness made changes -
            Link This issue duplicates DM-21448 [ DM-21448 ]
            tjenness Tim Jenness made changes -
            Link This issue relates to DM-21448 [ DM-21448 ]
            tjenness Tim Jenness made changes -
            Link This issue duplicates DM-21448 [ DM-21448 ]
            tjenness Tim Jenness made changes -
            Link This issue relates to DM-22286 [ DM-22286 ]
            jbosch Jim Bosch made changes -
            Resolution Done [ 10000 ]
            Status To Do [ 10001 ] Won't Fix [ 10405 ]

              People

              • Assignee:
                jbosch Jim Bosch
                Reporter:
                jbosch Jim Bosch
                Watchers:
                Arun Kannawadi, Jim Bosch, Tim Jenness
              • Votes:
                0 Vote for this issue
                Watchers:
                3 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved:

                  Summary Panel