Intern:Project data management/IODP

Work flow to published ODP/DSDP/IODP Data
1) References search on GEOREF
 * via http://odp.georef.org/dbtw-wpd/qbeodp.htm
 * add in „Source“ the journal name
 * option „Brief record“

2) Open reference homepage/journal homepage
 * if available open site via URL given in "Brief record" (in new window)
 * or search via journal homepage for article (in new window)

3) Import of relevant information
 * older references may have a DOI which is NOT available in the pdf-version, but in online versions; same for the Issue. Therefore extract all informations from the online-version in an excel sheet:

Example from Marine Microplaeontology

Reference No of Georef, Author/Year, Volume and Issue, No of Tables with georeferenced data, No of supplements with georeferenced data, state of work, DOI

4) Download pdf
 * download pdf file and supplements of a reference.
 * Rename the pdf-file with the Reference No of Georef and the Author, example: 001Agnini.pdf.

5) After ALL ! References from one defined subset of Georef are checked/downloaded, start with the import to Pangaea. It is necessary to check first the Georef-Report, because new references will be added continuously. This may confuse the numberation, and a constant workflow.
 * Start with the most recent reference. Modern pdfs are in a better state than older ones.

6) Preparation of data
 * Check first if there are not allready exist datasets of the reference in Pangaea
 * Supplement data are often an excel sheet; if not, import to excel
 * There are several ways to convert a pdf to an excel file
 * 1) Save the pdf as Microsoft Word-Document. Open the *.doc file, copy the table and past it in an excel sheet.
 * 2) If this is not working. Mark the table in the pdf file with the text curser. Copy it and past it into a text editor (Word). Replace the blanks with tabstopp. Copy the document and past it into an excel sheet. Sometimes columns or lines are not proper, this has to be corrected. Also invalide numbers/names etc. The older the original document is the crazier will be the excel sheet.
 * 3) If this is not working: ask a datatypist for help ...

7) Import of data
 * If all tables of one reference are ready, start with the import to Pangaea. If there are more than one table, start the Dataset title with the Table/Supplement/Appendix number in brackets: „(Table 1) Age model of ODP Hole 113-698B“.
 * Set publication year in Pangaea to the publication year of the reference
 * If all datasets are imported, create a parent, including all (child-)datasets (editor only!)

8) Example:

Statistics
Springer
 * 14 Journals with 197 publications (153 digital available)
 * 44 with primary data = 25 % (41 imported)

Elsevier
 * Marine Microplaeontology with 214 publications (all digital available)
 * 143 with primary data = 67 % (143 imported)


 * Marine Geology under evaluation


 * Palaeogeography, Palaeoclimatology, Palaeoecology under evaluation