Uploaded image for project: 'Data Management'
  1. Data Management
  2. DM-2264

Implement task switching between work job machines

    Details

    • Type: Story
    • Status: Won't Fix
    • Resolution: Done
    • Fix Version/s: None
    • Component/s: None
    • Labels:
      None

      Description

      AP requires that jobs are handed off to different worker job clusters as the previous set of images is being worked on.

        Attachments

          Issue Links

            Activity

            Hide
            spietrowicz Steve Pietrowicz added a comment -

            Waiting on new master node for this. I believe I can take existing worker nodes and split them up further to accommodate new slots.

            Show
            spietrowicz Steve Pietrowicz added a comment - Waiting on new master node for this. I believe I can take existing worker nodes and split them up further to accommodate new slots.
            Hide
            spietrowicz Steve Pietrowicz added a comment -

            I have an additional VM designated as a master node, but it may be possible to run both sets of tasks through one condor master. I'm going to check with the HTCondor group to see what their experience is in this situation.

            Show
            spietrowicz Steve Pietrowicz added a comment - I have an additional VM designated as a master node, but it may be possible to run both sets of tasks through one condor master. I'm going to check with the HTCondor group to see what their experience is in this situation.
            Hide
            spietrowicz Steve Pietrowicz added a comment -

            It was recommended by a member of the HTCondor team that we have all machines in one pool, rather than trying to coordinate between two pools. This eliminates the need to try and coordinate which pool to submit to, in case there is a failure on the machine that is submitting.

            Show
            spietrowicz Steve Pietrowicz added a comment - It was recommended by a member of the HTCondor team that we have all machines in one pool, rather than trying to coordinate between two pools. This eliminates the need to try and coordinate which pool to submit to, in case there is a failure on the machine that is submitting.

              People

              • Assignee:
                spietrowicz Steve Pietrowicz
                Reporter:
                spietrowicz Steve Pietrowicz
                Watchers:
                Steve Pietrowicz
              • Votes:
                0 Vote for this issue
                Watchers:
                1 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved:

                  Summary Panel