NOTICE: Historic website, not updated.

The presentations from the E-ARK Final Conference are all available for download from here: http://www.eark-project.com/resources/conference-presentations/finconfpres (click on hyperlink to go directly to the Presentations Folder)

 

General Model (External Site)

      

E ARK overall arch 

 

 E ARK Benefits

 

 

 

 

 

 

 

 

 

 

 

 

All E-ARK tools are available via our GitHub site.

Pre-Ingest Tools 

This section lists the tools which are capable of preparing SIPs according to the E-ARK SIP specification. 

ERMS Export Module

The ERMS Export Module is a tool which allows for the controlled and selective export of records and their metadata from any CMIS 1.0 compliant records or content management system.

The tool allows users to set up a CMIS connection to the source system, acquire an overview of the records and their aggregations, select required records, and finally export selected records and their metadata into the structure defined by the E-ARK Common Specification. As of now, the module supports the mapping of metadata into EAD3 format. Technically it’s possible to create mappings into any destination format.

The module consists of two separate technical components, the E-ARK ERMS Export Bridge as the backend component and the ERMS Export UI Module for the user interface.

RODA-in

RODA-in is a tool specially designed for producers and archivists to create Submission Information Packages (SIP) ready to be submitted to an Open Archival Information System (OAIS). The tool creates SIPs from files and folders available on the local file system.

In version 2 we revolutionized the way SIPs are created to satisfy the need for mass processing of data. In this version you can create thousands of valid SIPs with just a few clicks, complete with data and metadata.

Installable package of RODA-in is available at http://rodain.roda-community.org/ 

ETP (ESSArch Tools for Producer)

ETP is a tool designed to support the creation of Submission Information Packages (SIPs) by the data producer, and their delivery to archival repositories. The tool is highly configurable and allows for the setup of complex archival description profiles and validation. The tool is fully compliant with the E-ARK SIP specification.

Download and installation instructions of ETP are available at http://etp.essarch.org/.

ETA (ESSArch Tools for Archives)

ETA is the archives counterpart of ETP, developed to receive and validate packages created according to the E-ARK SIP Specification.

 

Database preservation tools

This section lists the tools and components which allow for the migration of live databases into the SIARD2 preservation format and reuse of databases stored in the SIARD2 format.

Database Preservation Toolkit (DBPTK)

The Database Preservation Toolkit allows conversion between Database formats, including connection to live systems, for purposes of digitally preserving databases. The toolkit allows conversion of live or backed-up databases into preservation formats such as SIARD 2.0, a XML-based format created for the purpose of database preservation. The toolkit allows also to upload the archived database into a live DBMS (Oracle, MySQL etc.).

Further information and the executable application can be found at http://www.database-preservation.com/.

dbptk-gui-backend

RESTful service to control the DBPTK by a graphical user interface.

db-visualization-toolkit

Relational Database Viewer for databases based on SIARD 2

 

Digital repository solutions 

This section lists the repository solutions provided by E-ARK partners which implement the E-ARK Information Package specifications. 

RODA

RODA is a complete digital repository that delivers functionality for all the main units of the OAIS reference model. RODA is capable of ingesting, managing and providing access to the various types of digital objects produced by large corporations or public bodies. RODA is based on open-source technologies and is supported by existing standards. 

Further information about RODA can be found at http://www.roda-community.org/.

ESSArch Preservation Platform

ESSArch Preservation Platform - an practical implementation of the OAIS (Reference Model for an Open Archival Information System).

Further information about the platform is available at http://www.essolutions.se/ESSArch

 

E-ARK Integrated Platform components 

The E-ARK Integrated Platform is a reference implementation of all E-ARK specifications. The main component of the Integrated Platform is the E-ARK Web which is a digital preservation system supporting all OAIS functions. The E-ARK Web can be extended with additional components for scalable computing, data mining and natural language processing.

E-ARK Web

E-ARK Web is an open source web-based archiving, digital preservation and access system. It is OAIS-oriented which means that data ingest, archiving and dissemination functions operate on information packages bundling content and metadata in contiguous containers.

Access to a demo installation of E-ARK Web is available upon request. If interested, please contact us!

The following components listed in this section are optional add-ons which primarily provide support for scalable infrastructures (i.e. Lily and Hadoop technologies).

dm-etl

Tool of the full-scale E-ARK deployment to extract content from AIPs and load content into Lily.

dm-parent

Common Maven configuration of the Java-based tools used in the E-ARK full-scale deployment.

dm-nlp

The data mining and natural language processing modules used in a cluster environment (Hadoop).

dm-text-classification

MapReduce job to run text classifier

dm-hdfs-storage

A tool of the full-scale E-ARK deployment which provides a REST service for transfering large files into HDFS.

dm-file-ingest

The map/reduce file ingest tool of the full-scale E-ARK deployment. It unpackages TAR packaged E-ARK information packages and initiates the indexing of the individual files using the Lily API. The Java-based tool runs as a service and consumes RabbitMQ messages notifying about new packages being available for indexing in HDFS.

end_user_gui

Prototype for end user's interface 

dm-hdfs-storage-client

Java example client for the file upload service (dm-hdfs-storage)

global-configuration

Shared Maven and other global configuration parameters of the Java-based tools.

denormalize-db

E-ARK WP6 - reference implementation: generate csv from archived db

 

Access tools or components 

Peripleo

A search engine for the Pelagios universe, with a comprehensive JSON API.

E-ARK CMIS Viewer

The E-ARK CMIS Viewer facilitates remote access to any CMIS 1.0 compatible digital preservation system.

Access Software Platform

The E-ARK Order Management Tool allows for querying, ordering and viewing of E-ARK Dissemination Information Packages from any compliant repository. The tool consists of a backend, the Order Management Service, and a frontend called the E-ARK Platform UI.

 

Other tools or components 

This section lists all other E-ARK sub-projects available to date. 

EAD Editor

The EAD Editor is a tool which allows for simple form-based editing and creation of EAD3 compliant archival descriptions

Data Warehouse and OLAP

This project includes all documentation about the E-ARK data warehousing and OLAP pilot. 

commons-ip

The RODA Commons IP provides a generic API to manipulate Information Packages in the E-ARK Common Specification format.

E-ARK Information Package examples

Specification and example of the Submission, Archival and Dissemination Information Packages.

SIARD2 format

SIARD 2 Format specification and examples.

Latest News

RSS 2.0 Feed - Latest News

@EARKProject on Twitter

For the latest EARK news follow us on Twitter @EARKProject