DOI
Digital Object Identifier (DOI)
DOIs provide persistent links to scholarly content, helping users get to the authoritative, published version of the content they are searching for, even when the content changes location or ownership. With about 35 Million registered DOI for publications (2009), the system is established and consequently used by scientific publishers and organisations.
Through the project STD-DOI, the TIB (German National Library of Science and Technology, Hannover) was established as an agency for data DOI. PANGAEA among four data providers was the first system using DOI for automated persistent identification of data sets. A data DOI has the prefix 10.1594 which is assigned to the publication of primary data through the TIB. The suffix, separated by a slash, is composed of the data system or center acronym and a system specific part. In a Pangaea DOI, this part is equivalent to the internal ID, automaticaly assigned to a data set by the relational database management system during import; thus the uniqueness of each DOI is assured.
A valid Pangaea-DOI has the syntax 10.1594/PANGAEA.738357
- spelled doi:10.1594/PANGAEA.738357
- and resolved as https://doi.org/10.1594/PANGAEA.738357
Data citation and DOI are defined in three steps during the publication process:
- registry status will be registered, then
- registry status registration is in the lead time with DOI registration in progress for 30 days followed by
- registry status registered after transfer of the DOI to the DOI-registry > DOI can be resolved globally, e.g. at https://doi.org/
- If a data set is imported and the status is set to validated, its internal ID can only be resolved as a preliminary DOI through doi.pangaea.de (PANGAEAs own DOI resolver). In the citation, the data set is identified as Dataset #738509
- If a data set status is set to published, the internal ID is changed to a global resolvable technical DOI 4 weeks after the last edit and the data set gets the status citable. In the citation, the data set is identified as Dataset #738509 (DOI registration in progress), changing after 4 weeks to doi:10.1594/PANGAEA.82361 which can be resolved globally.
- On request the dataset can be defined as an offical data publication and is thus added to the library catalog of the TIB, see citation.
PangaVista and the DOI resolver of PANGAEA can be used for any registered DOI, including preliminary DOIs of Pangaea.
If a data set to be archived in Pangaea already has a DOI from an other repository, this citation will be indicated with its citation as "other version" in the metadata header.
In case a registered data set has to be deleted, in the field other version the link/DOI to the substitute must be given before deletion.
- Example: doi:10.1594/PANGAEA.58757
Prerequisites to become an agent for the registration of scientific primary data
Any data provider, interested in assigning DOI for data may use one of the agents listed below or become a new agent of the data-DOI agency TIB. When establishing a data system/center new agents need to assure the following points defined through a concept and a data policy:
- Metadata
- metadata are mandatory and should follow standards of the specific scientific field the data are covering (e.g. ISO19115 for geo-data)
- data sets must be accompanied by a citation, consisting of bibliographic fields according to the STD-DOI application profile
- Access and availibility
- long-term availability must be assured, stable linking is provided by means of a DOI
- data must be available online, assuring Open Access to metadata; Open Access to data is highly recommended (access restrictions may appear for a moratorium period); data should be provided under a CC-license
- it is highly recommended, that data are machine readable, giving data in the repository an added values. This means, that
- (1) data are provided in a standard technical format (best is ascii and ISO formats)
- (2) data are organized in a way, that further processing of any part of the repository can easily be performed (data model, relational database)
- a full backup of the data repository must be assured
- Data review and integrity
- once registered, data sets are static
- versioning is allowed, different versions should be linked to each other
- data curation must include an editorial process with proofread by the author/principle investigator (the author is responsible for the scientific quality of the data!)
- an external peer-review of data publications is recommended
Links to agents for archiving geoscientific primary data with DOI
- World Data Center for Climate (WDCC) for climate models
- example doi:10.1594/WDCC/EH5-T63L31_OM_1CO2_2_MM
- contact Michael Lautenschlager
- PANGAEA data library for georeferenced observational data (including WDC-MARE)
- example: doi:10.1594/PANGAEA.484677
- Contact
- World Data Center for Remote Sensing of the Atmosphere (WDC-RSAT) for remote sensing
- example doi:10.1594/WDCRSAT.5Q6Q9Q9B
- contact Michael Bittner
- GeoForschungszentrum Potsdam (GFZ) with ICDP
- example doi:10.1594/GFZ/ICDP/KTB/ktb-geoch-gaschr-p
- contact Jens Klump
DOI provision service for reports and grey literature by PANGAEA/TIB
For reports and grey literature such as Master or PhD theses a DOI can be assigned by the TIB. The DOI prefix for these kinds of documents is 10.2312. Update 2020-12: a new prefix was provided by the TIB to account for inconsistencies with the DataCite policy: please use 10.48433 from now on.
For submitting documents as PDF to TIB:
1) Put all PDF files into one directory. The file names of the PDF files have to be the suffix of the DOI (case sensitive!).
2) Create a control file for Intern:PanXML. See also File:Metadata grey literature v3.pdf
3) Execute the control file with Intern:PanXML.
PanXML creates an XML file for each DOI. Send all PDF files together with XML files to TIB (Frauke Ziedorn or Britta Dreyer) in one zip archive.
Links
- DOI-Registrieragentur @ TIB
- The International DOI Foundation (IDF) https://doi.org/
- shortDOI service http://shortdoi.org
- DOI handbook: doi:10.1000/182
- DOI Project for scientific primary data http://www.std-doi.de
- DOI of the DOI system: doi:10.1000/1
- DOI of the Pangaea data library: doi:10.1594/PANGAEA
- DOI resolver of Pangaea
- What is a resource identifier?
- Publication using a child-DOI for an image: doi:10.1371/journal.pbio.0020449 , see Reconstruction of Neanderthal woman
- Handle resolver
- Handle system
- to check handle values, type in a DOI and check Don't Redirect to URLs
- provider für DOI im deutschsprachigen Raum http://www.mvb-online.de
- provider für DOI in Europa http://www.medra.org/
- kopal - Kooperativer Aufbau eines Langzeitarchivs digitaler Informationen
- nestor - Kompetenznetzwerk Langzeitarchivierung
- vascoda - Internet-Portal für wissenschaftliche Information
- PARSE.Insight - Permanent access to the records of science in Europe
Resolver
- The CNRI Handle Extension for Firefox is part of the official Resolver Add-ons for Firefox
- https://doi.org - priority in use!
- https://doi.pangaea.de - resolves doi and handle and unregistered "doi" of pangaea
- http://hdl.handle.net
- http://nbn-resolving.de
- http://nbn-resolving.de/urn:nbn:de:gbv:46-ep000103869
- http://nbn-resolving.de/urn/resolver.pl?urn=urn:nbn:de:tib-10.1594/PANGAEA.5479896 (special case with doi as part of the urn)
- http://www.sref.org (invented by publisher Copernicus, out of order, substituted by DOI in 2009)
- http://lsid.tdwg.org/ (life science identifier)
for ISBN there is no online-redirect and thus no direct resolver, see
- http://en.wikipedia.org/wiki/Special:BookSources?isbn=9783000050282
- http://en.wikipedia.org/wiki/Wikipedia:ISBN
IGSN (International Geological Sampling Number) development stage
- IGSN: ODP010MEY