170 likes | 315 Views
NetarchiveSuite at the BNE. Juan Carlos García Arratia – Chief of IT Development Service, NLS Mar Pérez Morillo – Chief of Web Service, NLS IIPC GA Paris 2014, 22nd May. Table of contents. Starting point Non-print Legal Deposit regulation Agreement with Red.es Why NAS?
E N D
NetarchiveSuiteat the BNE Juan Carlos García Arratia – Chief of IT Development Service, NLS Mar Pérez Morillo – Chief of Web Service, NLSIIPC GA Paris 2014, 22nd May
Table of contents Starting point Non-print Legal Deposit regulation Agreement with Red.es Why NAS? Installation of test environment First test crawls Specifications and needs What we expect
1. Starting point • With Internet Archive (2009-2013): • 8 domaincrawls • 2 selectivecrawls: • General Elections 2011 • Humanities • Total: ± 100 TB • Collectiondelivery (Red.es)
2. Non-print Legal Deposit Regulation • Allegations answered • Ministery of Education and Culture • Stakeholders (publishers and content providers) • Public information • Last step of the process • Enactment expected by the end of the year
3. Red.es • Strong investment: • Network • Storage • Servers
4. Why Netarchive Suite? • Othertoolsconsidered • Wholelifecyclecovered • First workshop on NAS at Vienna • BnF, as a model: • Legal depositlaw • Similar startingpointwith IA • Nationaldomain • Size of the French web • Ability of sharingtoolsand experiences • Community of users and developers (Denmark, Austria) • Modularity
5. Installation of test environment • Summer 2013: pilot installation 6 servers • 1st crawl: internal administrative network • Lack of documentation • Problems understanding configuration profiles • Need of strong network security
6. First test crawls • MiningHistorical Archive (Archivo Histórico Minero) • Death of Adolfo Suárez • Death of Gabriel García Márquez
6.1 Archivo Historico Minero • www.archivohistoricominero.org
6.3. Ongoing crawls • Regional Governments proposals • European Elections
NAS at the BNE http://bns08.bne.local/HarvestDefinition/Definitions-selective-harvests.jsp
7.1. National cooperation • Designing a workflow in a collaborative environment • Legal deposit purposes • Using administrative Network • Need of common interface for proposals
7.2. Web content curators at the Library • Internal and externalcurators • Interface to share • Easier to use forlibrarians • New reason to choose NAS…
8. What we expect • Better understanding of templates and configuration • Dashboard for Quality Assurance: • to show big figures to monitor crawl status • Modularity • Fine tuning