Uploaded image for project: 'Data Management'
  1. Data Management
  2. DM-21396

Write a technote on the EFD operations

    XMLWordPrintable

    Details

    • Type: Story
    • Status: Done
    • Resolution: Done
    • Fix Version/s: None
    • Component/s: None
    • Labels:
      None

      Description

      DMTN-82 makes the initial proposal on how the EFD data would be available for the Science Platform. In SQR-029, we prototyped our EFD solution based on Kafka and InfluxDB. In SQR-031 we deployed the EFD at the lab in Tucson and at the Summit supporting AuxTel tests.

      In this technote, we'll detail the EFD operations, which includes the data flow from the Summit to LDF, data replication, fault tolerance, data retention policies and formats to store the raw EFD data for the lifetime of the experiment.

        Attachments

          Issue Links

            Activity

            No builds found.
            afausti Angelo Fausti created issue -
            afausti Angelo Fausti made changes -
            Field Original Value New Value
            Epic Link DM-20445 [ 342674 ]
            afausti Angelo Fausti made changes -
            Description DMTN-82 makes the initial propose on how the EFD data would be available for the Science Platform. In the meantime, in SQR-029, we prototyped our EFD solution based on Kafka and InfluxDB and deployed the EFD at the lab in Tucson and at the Summit testing AuxTel components, see SQR-031.

            In this technote, we'll detail the EFD data workflow based on initial requirements, backup needs and on what learned so far using Kafka for multi-datacenter replication, and in particular, Kafka Connect to write data to the Oracle database at NCSA and to other formats that are useful for the Science Platform like parquet.
            DMTN-82 makes the initial proposal on how the EFD data would be available for the Science Platform. In the meantime, in SQR-029, we prototyped our EFD solution based on Kafka and InfluxDB and deployed the EFD at the lab in Tucson and at the Summit testing AuxTel components, see SQR-031.

            In this technote, we'll detail the EFD data workflow based on initial requirements, backup needs and on what learned so far using Kafka for multi-datacenter replication, and in particular, Kafka Connect to write data to the Oracle database at NCSA and to other formats that are useful for the Science Platform like parquet.
            afausti Angelo Fausti made changes -
            Description DMTN-82 makes the initial proposal on how the EFD data would be available for the Science Platform. In the meantime, in SQR-029, we prototyped our EFD solution based on Kafka and InfluxDB and deployed the EFD at the lab in Tucson and at the Summit testing AuxTel components, see SQR-031.

            In this technote, we'll detail the EFD data workflow based on initial requirements, backup needs and on what learned so far using Kafka for multi-datacenter replication, and in particular, Kafka Connect to write data to the Oracle database at NCSA and to other formats that are useful for the Science Platform like parquet.
            DMTN-82 makes the initial proposal on how the EFD data would be available for the Science Platform. In the meantime, in SQR-029, we prototyped our EFD solution based on Kafka and InfluxDB and deployed the EFD at the lab in Tucson and at the Summit testing AuxTel components (SQR-031)

            In this technote, we'll detail the EFD data workflow based on initial requirements, backup needs and on what learned so far using Kafka for multi-datacenter replication, and in particular, Kafka Connect to write data to the Oracle database at NCSA and to other formats that are useful for the Science Platform like parquet.
            frossie Frossie Economou made changes -
            Watchers Angelo Fausti, Frossie Economou [ Angelo Fausti, Frossie Economou ] Angelo Fausti, Frossie Economou, Kian-Tat Lim, Michelle Butler [ Angelo Fausti, Frossie Economou, Kian-Tat Lim, Michelle Butler ]
            afausti Angelo Fausti made changes -
            Description DMTN-82 makes the initial proposal on how the EFD data would be available for the Science Platform. In the meantime, in SQR-029, we prototyped our EFD solution based on Kafka and InfluxDB and deployed the EFD at the lab in Tucson and at the Summit testing AuxTel components (SQR-031)

            In this technote, we'll detail the EFD data workflow based on initial requirements, backup needs and on what learned so far using Kafka for multi-datacenter replication, and in particular, Kafka Connect to write data to the Oracle database at NCSA and to other formats that are useful for the Science Platform like parquet.
            DMTN-82 makes the initial proposal on how the EFD data would be available for the Science Platform. In the meantime, in SQR-029, we prototyped our EFD solution based on Kafka and InfluxDB and deployed the EFD at the lab in Tucson and at the Summit testing AuxTel components (SQR-031)

            In this technote, we'll detail the EFD data workflow based on initial requirements, backup needs and on what learned so far using Kafka for multi-datacenter replication, writing data to the Oracle database at NCSA and to other formats that are useful for the Science Platform like parquet.
            afausti Angelo Fausti made changes -
            Status To Do [ 10001 ] In Progress [ 3 ]
            Hide
            afausti Angelo Fausti added a comment -

            The technote on EFD Operations SQR-034 is shaping up

            https://sqr-034.lsst.io/v/DM-21396/index.html

            Show
            afausti Angelo Fausti added a comment - The technote on EFD Operations SQR-034 is shaping up https://sqr-034.lsst.io/v/DM-21396/index.html
            Hide
            afausti Angelo Fausti added a comment - - edited

            Still missing a description of the aggregator (I am looking at the Kafka Stream API) and long-term storage options at the LDF.

            Show
            afausti Angelo Fausti added a comment - - edited Still missing a description of the aggregator (I am looking at the Kafka Stream API) and long-term storage options at the LDF.
            Hide
            afausti Angelo Fausti added a comment -

            This library for Kafka stream processing in Python looks like a good option for the Aggregator https://faust.readthedocs.io/en/latest/

            Show
            afausti Angelo Fausti added a comment - This library for Kafka stream processing in Python looks like a good option for the Aggregator https://faust.readthedocs.io/en/latest/
            plutchak Joel Plutchak (Inactive) made changes -
            Link This issue blocks DM-18778 [ DM-18778 ]
            Hide
            afausti Angelo Fausti added a comment -
            • Add discussion on fault tolerance with Kafka
            • Disscuss downsampling of the time-series in InfluxDB to increase query performance over extended periods using multiple retention policies.
            Show
            afausti Angelo Fausti added a comment - Add discussion on fault tolerance with Kafka Disscuss downsampling of the time-series in InfluxDB to increase query performance over extended periods using multiple retention policies.
            afausti Angelo Fausti made changes -
            Summary Write technote proposing the EFD end-to-end data workflow Write a technote on the EFD operations
            afausti Angelo Fausti made changes -
            Description DMTN-82 makes the initial proposal on how the EFD data would be available for the Science Platform. In the meantime, in SQR-029, we prototyped our EFD solution based on Kafka and InfluxDB and deployed the EFD at the lab in Tucson and at the Summit testing AuxTel components (SQR-031)

            In this technote, we'll detail the EFD data workflow based on initial requirements, backup needs and on what learned so far using Kafka for multi-datacenter replication, writing data to the Oracle database at NCSA and to other formats that are useful for the Science Platform like parquet.
            DMTN-82 makes the initial proposal on how the EFD data would be available for the Science Platform. In SQR-029, we prototyped our EFD solution based on Kafka and InfluxDB. In SQR-031 we deployed the EFD at the lab in Tucson and at the Summit supporting AuxTel tests.

            In this technote, we'll detail the EFD operations, which includes the data flow from the Summit to LDF, data replication, fault tolerance, data retention policies and formats to store the raw EFD data for the lifetime of the experiment.
            Hide
            afausti Angelo Fausti added a comment -

            Complete version of the document with the following structure:

            1 Introduction
            2 The SAL Kafka producer
            3 The Kafka Connect manager
            4 Data replication and fault tolerance
            5 Downsampling and data retention
            6 The Aggregator
            7 Options for long-term storage at the LDF
            8 Monitoring
            9 References

            Show
            afausti Angelo Fausti added a comment - Complete version of the document with the following structure: 1 Introduction 2 The SAL Kafka producer 3 The Kafka Connect manager 4 Data replication and fault tolerance 5 Downsampling and data retention 6 The Aggregator 7 Options for long-term storage at the LDF 8 Monitoring 9 References
            Show
            afausti Angelo Fausti added a comment - https://github.com/lsst-sqre/sqr-034/pull/2
            afausti Angelo Fausti made changes -
            Story Points 2.8 5.6
            afausti Angelo Fausti made changes -
            Resolution Done [ 10000 ]
            Status In Progress [ 3 ] Done [ 10002 ]
            afausti Angelo Fausti made changes -
            Epic Link DM-20445 [ 342674 ] DM-21713 [ 424613 ]

              People

              Assignee:
              afausti Angelo Fausti
              Reporter:
              afausti Angelo Fausti
              Watchers:
              Angelo Fausti, Frossie Economou, Gregory Dubois-Felsmann, Kian-Tat Lim, Michelle Butler [X] (Inactive), Michelle Gower
              Votes:
              0 Vote for this issue
              Watchers:
              6 Start watching this issue

                Dates

                Created:
                Updated:
                Resolved:

                  Jenkins

                  No builds found.