1 / 7

NetarchiveSuite

NetarchiveSuite. Sabine Schostag The Netarchive sas@statsbiblioteket.dk. How we use NetarchiveSuite. Questions and answers on NetarchiveSuite :

alice
Download Presentation

NetarchiveSuite

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. NetarchiveSuite Sabine Schostag The Netarchive sas@statsbiblioteket.dk

  2. How we use NetarchiveSuite Questions and answers on NetarchiveSuite: • lifecycle: Whataspects of the web archivinglifecycle model does the tool cover? Whataspects of the model wouldyoulike to/do youintend to buildinto the tool? Whatfunctionalitydoes the tool provide thatisn'treflected in the model? • development: Whatresourcesarecommitted to the tool'songoingdevelopment? Whatare major features in the roadmap? Is the code open source? • adoption: What is the user base for the tool? How environment-specific is the tool as opposed to readilyreusable by otherorganizations? • functionality: Whatare the tool'sunique features? Whatareitsshortcomings?

  3. NetarchiveSuite Lifecycle What aspects of the web archiving life cycle model does the tool cover? What aspects of the model would you like to/do you intend to build into the tool? Extended documentation, Search functions, time schedules ≤ 1 hour What functionality does the tool provide that isn't reflected in the model? Time schedules min: once an hour – max ??

  4. NetarchiveSuite • development: • What resources are committed to the tool's ongoing development? • 2,6 MP • What are major features in the roadmap? • Technical improvements, • Upgrade to or support Heritrix 3, • Replacing current NetarchiveSuite Archive module • Better integration of documentation • Is the code open source?https://sbforge.org/display/NASDOC42/NetarchiveSuite+Overview

  5. NetarchiveSuite • adoption: What is the user base for the tool? How environment-specific is the tool as opposed to readily reusable by other organizations? • Even though the NetarchiveSuite software is developed in Java, and therefore is mostly platform independent, we do have a couple of external calls to the Unix sort command. The parts of our software using this external command therefore only run on Linux/Unix, or Windows with Cygwin installed. • Se installation manual: https://sbforge.org/display/NASDOC42/Installation+Overview

  6. NetarchiveSuite • Functionality: What are the tool's unique features? What are its shortcomings? • Multifaceted aplication • Selective Harvests • Snapshot Harvests • Domains • Schedules • Extended fields • Heritrix GUI Access • Global Crawler Traps • Harvest History • Harvester Templates • Quality Assurance • System State • Bit Preservation • See: https://sbforge.org/display/NASDOC42/User+Manual

  7. NetarchiveSuite Netarchiveuse of NAS /overview • Broad crawls • Selective crawls • ”Selective crawls” • Event crawls • Special crawls (e.g. upon a scholarswish) • Focused crawls: Social media (special templates), verybig sites,..

More Related