DOI

Digital Object Identifier (DOI)
DOIs provide persistent links to scholarly content, helping users get to the authoritative, published version of the content they are searching for, even when the content changes location or ownership. With about 20 Million registered DOI for publications, the system is established and consequently used by scientific publishers and organisations.

Within the project STD-DOI the first agency for providing DOI for data is the Technical Information Library (TIB) in Hannover. PANGAEA among four data provides is one of the first data systems on the Internet, using DOI for the persistent identification of scientific data. A data DOI consists of the prefix 10.1594 which is assigned to the publication of primary data through the TIB and a suffix, separated by a slash /. The suffix is composed of the data system or center and a system specific part. In the case of Pangaea this part is the internal ID automaticaly assigend to a data set by the relational database system. Thus a valid Pangaea-DOI may look like 10.1594/PANGAEA.123456.

Citation and DOI are defined during three steps of a data sets publication process:
 * 1) If a data set is imported and the status is set to validated it receives an interneal ID which can be resolved as a preliminary DOI through doi.pangaea.de only.
 * 2) If a data set status is set to published the internal ID is changed to a global resolvable technical DOI after 4 weeks and the data set becomes citable.
 * 3) On request the DOI can be defined as an offical publication DOI with the citation included in the library catalog of the TIB (http://tws.gbv.de). If several data sets should be made citable, a new parent DOI my be defined by the systems editor.

PangaVista and the DOI resolver of PANGAEA can be used for any registered DOI (including preliminary DOIs of Pangaea).

Within the STD-DOI project, funded by the DFG, four agents consisting of the German WDC cluster (WDC-Climate, WDC-MARE, WDC-RSAT) and GFZ together with TIB as agency intend to establish the DOI-system for data citations/publications.

Any data provider, interested in assigning DOI for data may use one of the agents listed below or become a new agent of TIB. New agents need to assure the following when establishing a data system/center:
 * a metadata description is mandatory and should follow ISO19115 standard
 * data description must be accompanied by an extended abstract and technical comments (option)
 * long-term availability and full backup of the data system must be assured
 * scientific parameters with unit should follow standards of the related field as far as established
 * data must be georeferenced in space and/or time (if applicable)
 * a citation for each data set must consist of bibliographic fields (author, year, title, source/publisher, referenc) according to the STD-DOI application profile
 * data are online in Open Access, assuring direct access to the data/metadata via the DOI
 * data review and curation must include an editorial supervision with proof-read by the author
 * an external peer-review of the data publication is recommended

Links
Agents for geoscientific primary data:
 * World Data Center for Climate (WDCC) for climate models
 * example 
 * World Data Center for Marine Environmental Sciences (WDC-MARE) with PANGAEA for any kind of georeferenced observational data
 * example: 
 * World Data Center for Remote Sensing of the Atmosphere (WDC-RSAT) for remote sensing
 * example in preparation
 * GeoForschungszentrum Potsdam (GFZ)
 * example 


 * DOI of the DOI system:
 * DOI of the Pangaea information system:


 * DOI resolver of Pangaea
 * Homepage of the DOI Project for scientific primary data http://www.std-doi.de