Technology

Technology of Pangaea is based on a three tiered client/server architecture with a data set cache:


 * 1) A relational database is backend and central archiving system using SYBASE as the RDB management software on a multiprocessor computer.
 * 2) As middleware, an application server with open server components for import, and other CTLib/Java tools for retrieval and editing is operated. The search interface "PANGAEA Search" is operated by the open source software panFMP which is based on the full text search engine Lucene. All components are encapsulated and use standard interfaces for communication.
 * 3) On the frontend side different clients ensure access to the system. The graphical user interface (GUI) for data upload and metadata definitions uses 4th Dimension software (ACI) running on a windows server (4D-client for Mac OS X and Windows). A web server runs domains and web services for data retrieval, download and harvesting. Middleware and frontend components follow a generic model to ensure a flexible functionality and easy modifications. The system has an Internet connection of 1 GBit.
 * 4) Data sets are stored in its original configuration in the database cache in XML format. The cache is updated by background services.

Data of small to medium size sets (< some million items) are stored in two tables, one for numeric values, one for string values, both organized through an index tree. Larger data sets or binary objects (e.g. fotos, seismics) are stored as files, linked to its metadescription and georeference in Pangaea. File archive are two hierachical storage systems (HS) consisting of a combination of hardiscs and tape drive silos located in different buildings with a distance of 1 km. Both HS are mirroring each other and have a capacity of 60 TB. An incremental backup is running every day, a full backup once per week. The system is technicaly operated by the AWI Computer Center.