Details
-
Type:
Story
-
Status: Done
-
Resolution: Done
-
Fix Version/s: None
-
Component/s: None
-
Labels:None
-
Story Points:20
-
Epic Link:
-
Team:DM Science
Description
Based on the w_2019_38 stack (before the backward incompatible change in the Gen3 registry and the missing RC2 repo generation code), integrate the steps and test execution. Use tract=9615, the HSC-RC2 tract as in the June/July milestone run. Troubleshoot issues.
20191127T192022+0000 20191127T192345+0000
Attempted again running two graphs simultaneously. The master was a m5.2xlarge on-demand instance. The CPU utilization was ~50-70% on the master instance while it managed the sfm jobs. There was a surge of database transactions shortly after the sfm jobs started, but soon it evened out to be ~50-70 average active sessions during sfm processing. For the workers, I got 150 m5.xlarge Spot instances for the single frame processing part and 150 r4.2xlarge Spot instances later. From the RDS performance insight, xact_commit.avg (transaction commits per second) was typically ~10 and db load (average active sessions) ~1.
Workflow wall time : 8 hrs, 44 mins
Cumulative job wall time : 62 days, 16 hrs
Cumulative job wall time as seen from submit side : 62 days, 23 hrs
Cumulative job badput wall time : 0.0 secs
Cumulative job badput wall time as seen from submit side : 0.0 secs
Workflow wall time : 9 hrs, 59 mins
Cumulative job wall time : 62 days, 23 hrs
Cumulative job wall time as seen from submit side : 63 days, 6 hrs
Cumulative job badput wall time : 0.0 secs
Cumulative job badput wall time as seen from submit side : 0.0 secs
# All (All)
Transformation Count Succeeded Failed Min Max Mean Total
dagman::post 29788 29788 0 0.0 9.0 1.052 31343.0
pegasus::dirmanager 1 1 0 3.0 3.0 3.0 3.0
pegasus::transfer 2712 2712 0 2.176 6.768 4.21 11417.243
pipetask 27075 27075 0 17.861 6243.819 199.636 5405141.464
# All (All)
Transformation Count Succeeded Failed Min Max Mean Total
dagman::post 29788 29788 0 0.0 7.0 1.012 30148.0
pegasus::dirmanager 1 1 0 2.0 2.0 2.0 2.0
pegasus::transfer 2712 2712 0 2.167 9.002 4.255 11540.378
pipetask 27075 27075 0 19.297 6300.657 200.601 5431273.315