Details
-
Type:
Story
-
Status: To Do
-
Resolution: Unresolved
-
Fix Version/s: None
-
Component/s: None
-
Labels:None
-
Story Points:1.4
-
Epic Link:
-
Team:SQuaRE
-
Urgent?:No
Description
TLS certificate expiration on the SQuaSH InfluxDB instance caused data from 2020-11-08 runs to be missing in Chronograf.
For example: https://ci.lsst.codes/blue/organizations/jenkins/sqre%2Fvalidate_drp/detail/validate_drp/2165/pipeline
From the API worker logs, we see the problem started on
[2020-11-08 10:20:43,188: ERROR/ForkPoolWorker-2] Task app.tasks.influxdb.job_to_influxdb[91e24fe9-ccfa-4fe3-9bc1-11526725b587] raised unexpected: SSLError(MaxRetryError("HTTPSConnectionPool(host='influxdb-demo.lsst.codes', port=443): Max retries exceeded with url: /write?db=squash-prod (Caused by SSLError(SSLError(1, '[SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed (_ssl.c:852)'),))",),)
|
subsequent jobs have been sent to InfluxDB successfully.
This ticket is to verify which data is missing in InfluxDB and restore it manually from the SQuaSH DB.