I have taken the Glossary terms from DMTN-085 and put them into the spreadsheet format used for DM-14877 (see uploaded QA_Definitions_20190613.numbers and .csv file).
Once the DMTN-085 authors (Eric Bellm or a co-author) review and correct the following, the attached file can be merged with the master Glossary file (probably by Tim Jenness) or MLG can provide a final merged spreadsheet file. No pre-existing Glossary terms needed to be modified based on these new terms, so merging should be straightforward.
Not included because it did not have a definition:
CI Continuous Integration
HSC Hyper Suprime-Cam
KPM Key Performance Metric
LDF LSST Data Facility
QAWG QA Strategy Working Group
Incorporated with some modification:
drill down "Move from a higher level aggregation of data to its inputs. For example, given data describing a tract, to drill down to constituent patches and then to objects. Also refers to the act of identifying an issue in a high-level summary of the data (e.g. an aberrant metric value) and interactively investigating its inputs to find the source of the problem."
General Parallel File System "The bulk data storage provided through a POSIX filesystem interface at the LSST Data Facility. Refers specifically to IBM’s General Parallel File System; aslo known as IBM Spectrum Scale." GPFS
metric value "The result of computing a particular metric on some given data. Note that metric values are typically computed rather than measured. See also: metric."
monitoring "In DM QA, this refers to the process of collecting, storing, aggregating and visualizing metrics."
Quality Assurance "All activities, deliverables, services, documents, procedures or artifacts which are designed to ensure the quality of DM deliverables. This may include QC systems, in so far as they are covered in the charge described in LDM-622. Note that contrasts with the LDM-522 definition of “QA” as “Quality Analysis”, a manual process which occurs only during commissioning and operations. See also: Quality Control." QA
Quality Control "Services and processes which are aimed at measuring and monitoring a system to verify and characterize its performance (as in LDM-522). Quality Control systems run autonomously, only notifying people when an anomaly has been detected. See also Quality Assurance." QC
releaseable product "A software package or other component of the DM system which is expected to be included in the next tagged release of the system. This implies inclusion in a standard top-level package. See also release-tag."
Incorporated with little to no modification:
aggregate metric "An aggregation of multiple point metrics. For example, the overall photometric repeatability for a particular tract given multiple observations of each star."
aggregation "A single result—e.g., a metric value—computed from a collection of input values. For example, we can sum or average a metric computed over patches to produce an aggregate metric at tract level."
Apache Parquet "A columnar storage data persistence format maintained by the Apache project; http://parquet.apache.org."
dashboard "A visual display of the most important information needed to achieve one or more objectives, consolidated and arranged on a single screen so that the information can be monitored at a glance (Few, 2013)."
metric "We follow the SQR-019 definition of a metric as a measurable quantities which may be tracked. A metric has a name, description, unit, references, and tags (which are used for grouping). A metric is a scalar by definition. We consider multiple types of metric in this document; see aggregate metric, model metric, point metric."
model metric "A metric describing a model related to the data. For example, the coeficients of a 2D polynomial fit to the background of a single CCD exposure."
point metric "A metric that is associated with a single entry in a catalog. Examples include the shape of a source, the standard deviation of the flux of an object detected on a coadd, the flux of an source detected on a difference image."
tidy data "Tidy datasets have a specific structure: each variable is a column, each observation is a row, and each type of observational unit is a table (Wickham, H., 2014, Journal of Statistical Software, Articles, 59, 1)."
Already included in glossary:
DM Data Management.
provenance A description of the inputs and processes which have been used to generate a particular result or data product.
SDQA Science Data Quality Assurance.
SQuaSH Science Quality Analysis Harness; SQR-009; https://squash.lsst.codes.
I have taken the Glossary terms from DMTN-085 and put them into the spreadsheet format used for
DM-14877(see uploaded QA_Definitions_20190613.numbers and .csv file).Once the DMTN-085 authors (Eric Bellm or a co-author) review and correct the following, the attached file can be merged with the master Glossary file (probably by Tim Jenness) or MLG can provide a final merged spreadsheet file. No pre-existing Glossary terms needed to be modified based on these new terms, so merging should be straightforward.
Not included because it did not have a definition:
CI Continuous Integration
HSC Hyper Suprime-Cam
KPM Key Performance Metric
LDF LSST Data Facility
QAWG QA Strategy Working Group
Incorporated with some modification:
drill down "Move from a higher level aggregation of data to its inputs. For example, given data describing a tract, to drill down to constituent patches and then to objects. Also refers to the act of identifying an issue in a high-level summary of the data (e.g. an aberrant metric value) and interactively investigating its inputs to find the source of the problem."
General Parallel File System "The bulk data storage provided through a POSIX filesystem interface at the LSST Data Facility. Refers specifically to IBM’s General Parallel File System; aslo known as IBM Spectrum Scale." GPFS
metric value "The result of computing a particular metric on some given data. Note that metric values are typically computed rather than measured. See also: metric."
monitoring "In DM QA, this refers to the process of collecting, storing, aggregating and visualizing metrics."
Quality Assurance "All activities, deliverables, services, documents, procedures or artifacts which are designed to ensure the quality of DM deliverables. This may include QC systems, in so far as they are covered in the charge described in LDM-622. Note that contrasts with the LDM-522 definition of “QA” as “Quality Analysis”, a manual process which occurs only during commissioning and operations. See also: Quality Control." QA
Quality Control "Services and processes which are aimed at measuring and monitoring a system to verify and characterize its performance (as in LDM-522). Quality Control systems run autonomously, only notifying people when an anomaly has been detected. See also Quality Assurance." QC
releaseable product "A software package or other component of the DM system which is expected to be included in the next tagged release of the system. This implies inclusion in a standard top-level package. See also release-tag."
Incorporated with little to no modification:
aggregate metric "An aggregation of multiple point metrics. For example, the overall photometric repeatability for a particular tract given multiple observations of each star."
aggregation "A single result—e.g., a metric value—computed from a collection of input values. For example, we can sum or average a metric computed over patches to produce an aggregate metric at tract level."
Apache Parquet "A columnar storage data persistence format maintained by the Apache project; http://parquet.apache.org."
dashboard "A visual display of the most important information needed to achieve one or more objectives, consolidated and arranged on a single screen so that the information can be monitored at a glance (Few, 2013)."
metric "We follow the SQR-019 definition of a metric as a measurable quantities which may be tracked. A metric has a name, description, unit, references, and tags (which are used for grouping). A metric is a scalar by definition. We consider multiple types of metric in this document; see aggregate metric, model metric, point metric."
model metric "A metric describing a model related to the data. For example, the coeficients of a 2D polynomial fit to the background of a single CCD exposure."
point metric "A metric that is associated with a single entry in a catalog. Examples include the shape of a source, the standard deviation of the flux of an object detected on a coadd, the flux of an source detected on a difference image."
tidy data "Tidy datasets have a specific structure: each variable is a column, each observation is a row, and each type of observational unit is a table (Wickham, H., 2014, Journal of Statistical Software, Articles, 59, 1)."
Already included in glossary:
DM Data Management.
provenance A description of the inputs and processes which have been used to generate a particular result or data product.
SDQA Science Data Quality Assurance.
SQuaSH Science Quality Analysis Harness; SQR-009; https://squash.lsst.codes.