Intern:Advanced retrieval tool



ART is the acronym for the Advanced Retrieval Tool which is a data mining tool, written in Java, to retrieve individually configured subsets of metadata and/or data from the tables of the relational data model. With ART the user is able e.g. to compare time series from different sites, parameters from different profiles, plotted on a common axis versus time or space or even allows to extract slices in time and/or space for a specific parameter. The use of ART needs some training but provides a powerful tool for the scientific work with data. A context sensitive help system is supplied. ART may only be used by experienced users with knowledge about the Pangaea data model.

The default screen of ART to access the different tables shows a graphic of the simplified data model (see Fig.). Each box in the data model reflects a table of the relational system; arrows reflect some of the relations in a simplified way. Clicking on a box will open its retrieval window, which is configured for its specific type of information. The number of tabs depends on the tables context.

An example is given to retrieve analytical data. After clicking on Data, the user interface will open a window containing five tabs:


 * 1) ‘Retrieval’ – to define the query;
 * 2) ‘List config’ – to configure the output table;
 * 3) ‘List’ – will list the data in a result table after retrieval;
 * 4) ‘Map’ – will plot the data related sites on a map;
 * 5) ‘Plot’ – to visualize data versus space/time (optional).

Retrieval tab

 * In the field Criteria: the list of search criterions, i.ew. names of fields that might be used in this context. This list will vary depending on the table which was selected in the data model.
 * In the middle a list called Operator: is given
 * On the right the field Value: will accept free text (e.g. if Operator=contains) or provide content by selecting Choices... (e.g. if operator=is equal to).

If you click on a row of the search expression (1), the components from which it is made of are highlighted resp. displayed in the lower fields. You might then change this row by selecting different items in the lower fields. Different rows of the search expression are combined by the Boolean operators ‘AND’ and ‘OR’, thus giving you the possibility to do complex retrievals. The operators are located on the left side (5). You can delete rows of the search expression by clicking on that row and subsequently clicking on the ‘Delete line’ button (6). Values (4) are normally typed in. However, in many cases you can also use choice lists, or both, or only choices. Choice lists are useful, if you do not have an idea on how the value could be written. You can enter choices for a search criterion, if the choices button is highlighted or the add choices button is displayed. There are search criterions for which you will get immediately the complete list of choices after clicking the choices button. Other criterions, though, would result in hundreds or thousands of choice items. In this case you will get a further retrieval window, where you can first reduce the choices list by a subretrieval. If you are using the object ‘DATA’ you can press the ‘Add’ button to add parameters to the ‘Criteria’ list (2). You will get another retrieval window, where you can do a subretrieval for the parameters that you want to retrieve data for. The subretrieval window tries to reflect the previously defined search, thus confining the parameter retrieval to what can be expected in this context. E.g., if you have defined ‘Reference is equal to publication xy’, the subretrieval window will show the same search expression and clicking on the ‘Fetch’ button brings back exactly those parameters which are relevant to this publication. Search expressions can be saved and reloaded (7). In the ‘Options’ panel (8) you can set up the number of items to fetch (rows for metainformation, data items for DATA retrievals). On the lower part find the ‘Fetch’ button (9) to start a retrieval.

￼ Figure 13. ART list configuration card for Event retrievals. The configuration card presents a choice list of items to be displayed on the ‘List’ card as the final result of a search. On the left you find a choice list subscribed ‘Metainformation’ with all items that are available for display in the current context (1). On the right a default configuration is shown - named ‘Columns to display’ (2). The lists will vary depending on the object that you selected in the data model. You can add items to the default configuration either by clicking on the item and using the arrow (3) or by double clicking on an item in the left list. Items can also be removed (4). Items in the default configuration can be rearranged through the buttons ‘Up’ and ‘Down’ (5). Each item in the default list is associated with a specific format. Text items can not be configured. For numerical items the total number of digits, the precision, and leading zeros can configured. For date/time choose between a variety of formats (6). Use the ‘Save’ and ‘Load’ buttons (7) for custom configurations. ￼ Figure 15. ART list configuration of a Data retrieval. When using the retrieval for data there are two more fields to select ‘Parameters’ and ‘Geocodes’. Use the ‘Add’ button (8) to add parameters to the ‘Parameters’ list. A second retrieval window, will allow to extract the parameters of interest from the parameter dictionary. The subretrieval window may reflect the previously defined search expression on the ‘Retrieval’ card, thus confining the parameter retrieval to what can be expected in this context. From the list of ‘Geocodes’ you can choose the type of geocoding of your data. The special geocode ‘Age calculated’ can be used for data from geological profiles, which are related to an age model. Using Agem calculate the age is calculated as part of the retrieval, based on an existing age model. The ‘Split by events’ option (9) will return data laterally grouped by events, thus allowing a comparison of profiles between events (max 255 columns). Checking the ‘Split by versions’ option will split different versions of the same parameter. A retrieval is started by using the ‘Fetch’ button (10).

￼ Figure 15. ART List card showing the result of a data retrieval. List card This card displays the result of your retrieval as a text matrix, formatted as give on the ‘List config’ card. button have the functionality (1) to save the list, (2) to open the results in a browser window, (3) to select all rows in the list, (4) to gain more details on list items. Just click on single rows or SHIFT click to select multiple rows. Then click the ‘Details’ button and a further window shows cards with detailed information for each selected row. Sorting of a specific column by values is enabled through clicking in the header row above that column; dragging the delimiters between columns lets you change the horizontal size of the columns. ￼ Figure 16. ART Map card displaying the position of the retrieved data sets. Map card This card gives you an overview on the geographical context of the retrieved information; only available for objects having geographical coordinates. (1) zoom in, (2) geographical retrieval mode’ to define a geographical selection as an added constraint to the ‘Retrieval’ card with latitudes/longitudes as search criterions, (3) detail info opens a card with detailed information for each selected point, (4) zoom out, (5) back to global view, (6) map properties button opens a configuration to choose projection and center, (7) save grafic as Postscript, (8) save as pixel grafic ￼ Figure 17. ART plot card for displaying data columns in a line or scatter graph. Plot card If the first column of a result set contains numerical values and there is at least one further numerical column the ‘Plot’ card displays x-y-plots for each pair of numerical columns. If the values in the first column are continuously increasing or decreasing a line graph is drawn, otherwise a scatter plot. The detail info window ￼ Figure 18. ART detailed info window. The detail info window shows cards with detailed information for each selected. Only one card is displayed, the others are in the background as indicated by tabs. Bring other cards in front by clicking on its tab. Click on the ‘Save’ button to download the information of all cards to your harddisk. Global buttons are "Fetch": To start a retrieval. It is highlighted as soon as the search expression and the configuration is valid, "Close": to close the retrieval window, "?": to switch on context sensitive help.

Retrieval example for events in ART: Looking for gravity cores in the Mediterranean Sea

This example explains the way how to define a retrieval with ART (Advanced Retrieval Tool). Any retrival in one of the tables of the data model works in a similar way.

> start on http://www.pangaea.de
 * click on Data
 * click on ART (the data mining tool of PANGAEA is a Java applet which is now loaded to the local computer; if using ART for the first time, accept requester for the certificate)
 * click on Event (the Event table includes all samples/observations with its metadata)
 * click on the List config card
 * configure your table in Columns to display. You can select the searched criterions by double clicking on an item; sort with the up and down options. (e.g. include elevation and recovery from the "Meta Information" menu, exclude gear type).
 * click on the Map card.
 * select Geographical retrieval mode in the tool bar; select the derived area by dragging the cursor in the global map keeping the mouse button pressed (e.g. Mediterranean Sea). The coordinates of the selected area are shown in the status line below the map.
 * click on Retrieval card (the search area is now defined)
 * click on new line
 * click on and
 * click on gear type in Criteria and on is equal to in the Operator list
 * Choices... gives you a list of all gear types defined in the system.
 * select Gravity corer and click ok
 * click on Fetch - the job is beeing processed.
 * the List card shows the results
 * the Map card shows the sites; use the Zoom mode to zoom in
 * select the Detail info mode i and click on one of the dots to get detailed information.

You may save the list from the List card to your harddisk by clicking on the disk symbol in the tool bar or open the results in a new browser window which can also be saved to disc.

You can import the results into the program PanMap to draw your individual map. This implies that you have Latitude and Longitude in the first two columns of the list.