210 likes | 239 Views
This presentation discusses the implementation of digital preservation at the European Commission, focusing on the transition from physical records to digital archives. It covers topics such as the preservation actions, technical setup, repository structure, metadata model, and storage policy. The presentation also provides information on the complexity of the ingest process and the sources involved in the digital preservation efforts.
E N D
From records to archivesImplementation of digital preservation at the EC Lieven BAERT European Commission SIO meeting @ NATO 21/05/2019
The Archives team since 1983 RÈGLEMENT (CEE, EURATOM) N° 354/83 DU CONSEIL du 1er février 1983 concernant l'ouverture au public des archives historiques de la Communauté économique européenne et de la Communauté européenne de l'énergie atomique
Movie https://audiovisual.ec.europa.eu/en/video/I-152983
History – 1967 MergerTreaty EuropeanCommunities European Union
Right of initiative • Policy and budget implementation • Guardian of Treaties • International representation
Digital Preservation Analysis start 2012 Proof of Concepts 2013 Licences obtained 2014 Production release 2017 Part of the e-Domecpolicy (HPS)
Digital Preservation Technical setup Repository structure Metadata model Information package Access Source systems Storage policy Preservation actions Security/visibility Pre-ingest RM integration Ingest AMS integration Reporting DP plan
HPS metadata model Generic concepts SIP <…> Preservica XIP XML </…> <…> HPS metadata XML </… RM metadataanalysis
General ingest workflow Commission service ← →Historical Archives Service Ingest actions Ingestsuccess Pre-ingest actions • Structure • Preservationmetadata • FT search • Source data • Validate • [accept Paper] • Accept SIP • Runingest • [New source system ?] • Select content • Validate • Requesttrf Digital objects Size (GB) 5.157.635 2.083
in production Sources - HAN 90ies today 45M docs, 65M digital objects, 70TB data 60ies
in production Sources - HAN +1.000 transfers ca. 600.000 documents ca. 9.000 dossiers Ingestprocessbuilt-in Complexity of 2-way communication
in production Sources – Shared drives Only for Cabinets (Barroso I and Barroso II) 67 GB + 876 GB 220.000 + 2.200.000 objects Ingest via specifictool Limited « quality » Limited access
in production Sources - Sybil Application to manage the President’s mail 2004-2011 ca. 40.000 records + 100.000 objects Mapping to metadata model SIP creation by local IT team
being tested Sources - Adonis Former RM system (1995-2010) – ca. 150 databases 20.000.000 records 15.000.000 objects 600.000 dossiers « Retro » scanning High appraisal effort Specific SIP creator
being tested Sources – ARCHIS-Scanning • Internaldigitisation • At dossier level: 16.500 • At image (page): 4.400.000 • Around 35 TB of data • Complexmapping: • Scanning technical data • Archival description data BAC-0101-1999-0311, p.29
to be analysed Sources – External scanning • Externaldigitisationsince 2013 • COM numbers: 33.000 • Images (pages): 4.500.000 • roughly 50 TB of data • Complexmapping: • Publications Office XML format • Archival description data
to be analysed Sources – future sources • Bilateral discussion to discuss major IS of DG’s • Ongoing • In parallel: Digital Preservation Plan • Definevarious aspects of digital preservation • Implement (technical) archiving solutions • Createoverview of IS 1120 Commission information systems are registered in the portfolio management system, among them, 754 are “operational”. (source : GOVIS – feb 2018)
Thankyou lieven.baert@ec.europa.eu http://ec.europa.eu/historical_archives/index_en.htm