Details
-
Type:
Story
-
Status: Done
-
Resolution: Done
-
Fix Version/s: None
-
Component/s: None
-
Labels:None
-
Story Points:1
-
Epic Link:
-
Team:Data Facility
Description
Do a storage analysis, similar to DM-10904, for an output repo from a HSC RC reprocessing
Attachments
Issue Links
- relates to
-
DM-10904 analyze the storage usage of the output butler repos from S17B reprocessing
- Done
As suggested, I ran my stat code previously used for
DM-10904on /project/hsc_rc/w_2017_27/DM-11273. Overall, there were 356249 files present, taking up a total of 6.77 Terabytes.The average file size was 18.38 Megabytes, with the largest 2000 files having sizes from 170 Megabytes to 310 Megabytes and the smallest 2000 files having sizes from 128 kilobytes to 0 bytes.
The largest files were warp fits files, found in the deepCoadd subdirectory, as well as calexp, meas, and ref fits files found within the deepCoadd-results subdirectory. However, the smallest files were almost exclusively BKGD fits files, found within one of the numbered directories, along with the following file: deepCoadd/skyMap.pickle.
In another comment, I will introduce and discuss the histograms describing the file size distribution and the overall sizes of the butler data types.