Uploaded image for project: 'Data Management'
  1. Data Management
  2. DM-13207

Provide a capability for reproducible sub-selections on tabular data results

    XMLWordPrintable

    Details

    • Type: Improvement
    • Status: To Do
    • Resolution: Unresolved
    • Fix Version/s: None
    • Component/s: Science Platform
    • Labels:
      None
    • Team:
      Architecture

      Description

      This is a reminder-to-self ticket.  The task is, at some point, to do some brainstorming about whether we can provide general means for performing reproducible sub-selections of tabular data results in the context of the Science Platform.

      This is useful for purposes like the division of data into training, validation, and testing samples for machine learning, and the decimation of large results for "quick look" display.

      It is often possible to implement such selections based on a hash.  I would like to think about what it would take to do this in a way that was generically applicable and that could be connected to a simple UI.

      E.g., I'd like to have an API, and a UI above it, that would enable actions like "divide the result into thirds" - reproducibly, so that the same division would be available in the Portal and programmatically in the Notebook or remotely.

      (This is not something I want to spend any time on right now, however.)

        Attachments

          Issue Links

            Activity

            No builds found.
            gpdf Gregory Dubois-Felsmann created issue -
            tjenness Tim Jenness made changes -
            Field Original Value New Value
            Remote Link This issue links to "Page (Confluence)" [ 31022 ]
            Hide
            tjenness Tim Jenness added a comment -

            Gregory Dubois-Felsmann what would you like to do with this ticket?

            Show
            tjenness Tim Jenness added a comment - Gregory Dubois-Felsmann what would you like to do with this ticket?

              People

              Assignee:
              gpdf Gregory Dubois-Felsmann
              Reporter:
              gpdf Gregory Dubois-Felsmann
              Watchers:
              Fritz Mueller, Gregory Dubois-Felsmann, Kian-Tat Lim, Tim Jenness, Xiuqin Wu [X] (Inactive)
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

                Dates

                Created:
                Updated:

                  Jenkins

                  No builds found.