Details
-
Type:
Story
-
Status: Done
-
Resolution: Done
-
Fix Version/s: None
-
Component/s: None
-
Story Points:2
-
Epic Link:
-
Team:Data Release Production
-
Urgent?:No
Description
As brought up on this slack thread, when running step5 for tract 3828 on IDF (monitoring output at https://panda-doma.cern.ch/jobs/?jeditaskid=7004&jobstatus=failed&display_limit=100 ) a large number of drpAssociation jobs (10 of 49) failed due to unexpected floating point values. An example stderr is here, with the most relevant lines being the following:
File "/opt/lsst/software/stack/stack/miniconda3-py38_4.9.2-0.7.0/Linux64/pipe_tasks/21.0.0-147-g0e635eb1+1acddb5be5/python/lsst/pipe/tasks/simpleAssociation.py", line 191, in run |
diaSources.set_index("diaSourceId", inplace=True, verify_integrity=True) |
File "/opt/lsst/software/stack/conda/miniconda3-py38_4.9.2/envs/lsst-scipipe-0.7.0/lib/python3.8/site-packages/pandas/core/frame.py", line 4779, in set_index |
raise ValueError(f"Index has duplicate keys: {duplicates}") |
ValueError: Index has duplicate keys: Float64Index([1.0374117693849605e+17, 1.0374117693849606e+17, |
1.1895663545548805e+17, 1.1895663545548806e+17, |
2.4602324290764803e+17, 2.4602324290764806e+17, |
1.0374117747536699e+17, 1.03741177475367e+17, |
1.0374117747536702e+17, 2.4602324344451894e+17, |
...
|
1.0163888976822278e+17, 1.2930653275501376e+17, |
1.4244108015396456e+17, 1.4244108015396458e+17, |
1.0163889137883547e+17, 1.0163889137883549e+17, |
1.424410806908355e+17, 1.4244108069083552e+17, |
1.4244108069083554e+17, 1.4244108069083555e+17], |
dtype='float64', name='diaSourceId', length=593) |
The slack discussion indicated a connection to the problem in RFC-808 (Set sentinel values for non-floating point columns for missing bands in Object tables), but in a different context.
I also encountered this problem with DC2 w40 also for tract 3828. Running tract 3829 now. Here's an example:
...
Path to the jobs is /scratch/brendal4/bps-gen3-dc2_OLD/submit/2.2i/runs/test-med-1/w_2021_40/
DM-32024/20211014T205139Z/jobs/drpAssociation/3828