Intern:MOSAiC data

This page discusses the steps for MOSAiC project data curation and the consistent use of metadata. The data should follow the MOSAiC Data Policy. The data processing levels were defined there as:
 * Raw data: Data directly produced by sensors, devices, or manual observation, prior to additional processing, calibration and quality assessment/control (never modified)
 * Primary data: Processed data that modify a copy of the raw data, e.g., outliers removed, calibrated, quality controlled
 * Value-added data/derived data product: Products based on raw or primary data that may involve derivation of additional parameters or delayed-mode quality control using external data or post-use sensor calibration; model data or a combination with any external data, e.g., by data assimilation, visualization, classification, or clustering

This SOP describes the publication of the later two levels, submitted by authors / PIs through the ticket system. Raw data publishing follows a different procedure.

=Processed data= Submitted by the authors via the submission form.

Submission and metadata check in the ticket

 * The authors / PIs were instructed to use MOSAiC label during the data submission. If this is not yet the case, add the "MOSAiC" label at any later stage, ideally before the metadata transfer from the ticket.
 * Abstract is provided in Description field of the ticket.
 * Author first names provided.
 * PI's e-mail address / ORCID provided.

Data import

 * The data are linked to Science operations based on the Events.
 * Check if the Raw data associated with the processed data have already been published in PANGAEA. When it's not the case, and it is reasonable to do so, ask the author to initiate raw data publication.

Required metadata

 * Event (device operation ID)
 * provides link to expedition leg (Campaign). Each MOSAiC-related campaign has an alternative Label "MOSAiC20192020".
 * provides link to sensor.awi.de
 * Abstract: required
 * Project: MOSAiC (this automatically provides the funding number "AWI_PS122_00"
 * Processing level:

Further metadata

 * Keywords: free text
 * Some teams agreed on using special keywords. These are:
 * MOSAiCPO: technical keyword for OCE team https://www.pangaea.de/?q=%40MOSAICPO (or in PANGAEA search type "@MOSAICPO")
 * MOSAiC_PO: keyword for OCE team https://www.pangaea.de/?q=keyword:%22MOSAIC_PO%22 (or in PANGAEA search type "keyword:"MOSAIC_PO")
 * References:
 * Documentation as further details (pdf, etc)
 * Link to raw version of the data, if archived. If not archived, ask the authors to publish raw data
 * Project: further projects and awards

Collections
Replicate events from childs to the collection. Like that the collection dataset can be associated with the correct expedition leg(s).

Datasets in review
Access generally only for authors.

Published datasets
MOSAiC data when not released open access immediately, have the default moratorium date of 2023-01-01 (the latest date on which all MOSAiC data should be published open access according to the data policy). For the password protected data in status "published" access rights are added for the group "MOSAiC".

The author is informed about the access restrictions in the ticket, for example: The access moratorium on the data is 2023-01-01 and until then only authors and PANGAEA users who signed the MOSAiC data policy can download the data.

Any PANGAEA user who signed the Data Policy can be added to the group MOSAiC on their request. The presence / absence on list can be crosschecked by Dana and Amelie.

Following text can be added as a data set comment: