Data set

Data in Pangaea are grouped in predefined data sets. A data set equals the file uploaded to the system and provided by PangaVista (or other clients or web services) for download. The granularity of a data set depends on the type of data, the number of data points and is primarily in the decision of the data provider. A data set may contain one to many data series, one to many data sets may be grouped to a parent set. Access rights can be defined for a complete set only. Each data set consists of the data accompanied by metadata according to ISO standard fields for describing geodata.

Opening a data set will show four tabs named Config, Basics, Details and Web with metadata fields as described below:

Config tab
 * Parameter window shows parameters with unit and short name, used in the data set. The buttons Add, Clear and PreSelect are used to compose new data sets.
 * Geocodes window lists all geocodes available; those used in the data set are highlighted and can be added to the Configuration.
 * Related metainformation window contains fields from the event table which can be added to the Configuration.
 * Configuration window lists geocodes, related metainformation and all parameters used in the order of the available data set. The Load and Save button can be used to save a configuration and load the same configuration to other similar data sets.
 * Format will show the number of digits before and after the decimal point of a numeric parameter if selected in the configuration window by a mouse click. Different formats can be selected from the pop-up menue or changed by hand. If the geocode Date/Time is selected, different types of ISO formats can be selected, depending on the required precision.
 * Split by event should not be used anymore.
 * Split by versions must be checked if a parameter occurs more than once.
 * Aggregate function may be used to internaly calculate statistic values - do not use.

Basics tab, the red fields are important to the citation of a data set
 * Author(s) of the data set, may be added by a multiple choice list related to the staff table
 * Title of the data set as free text; equivalent to the title of a publication
 * Source should be used for data not related to a reference, relational to the institution table which opens using the Choices button
 * Status with
 * status of the data set as pop-up menue with choices: questionable, not validated, validated, published
 * Access rights button to set individual access to data sets not having the status published
 * citable to make data set a parent set with the citation added to a library catalog - to be used by the data librarian or editors only !
 * Registry gives information about the registration process:
 * not to be registered if status is not published
 * registration is in the lead time for four weeks after setting the data set to status published
 * registered final status, automatically set four weeks after changed to published
 * login required may be checked for sets with status published but still under moratorium
 * Reference(s) opens a multiple choice list related to the reference table to select one to many papers relevant to the data set
 * PI(s) window lists all investigators related to the data series
 * Project(s) allows via a mutiple choice list to add one to many projects as provided by the Project table
 * Data series window lists all data series contained in this data set
 * Geocode(s)
 * Event(s)