1 / 29

Node Recovery

Node Recovery. Day 3, Session 6 MetaArchive Distributed Digital Preservation Workshop. Presented by Chris Helms. Session Learning Objectives. Restoring a lost cache. Hardware/Software fault recovery Archival Units Archival Unit Configuration. Restoring a lost primary node.

maryharris
Download Presentation

Node Recovery

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Node Recovery • Day 3, Session 6 • MetaArchive Distributed Digital Preservation Workshop Presented by Chris Helms

  2. Session Learning Objectives

  3. Restoring a lost cache • Hardware/Software fault recovery • Archival Units • Archival Unit Configuration

  4. Restoring a lost primary node • Preparing for Configuration Node Recovery • Recovering the Title Database • Recovering the Keystore • Reconfiguring the Nodes in the Network

  5. Restoring data from the cache • Viewing Cached Content • Using your LOCKSS node as a cache proxy • Proxy configuration options • Cached AU retrieval and proxy testing

  6. Restoring a Lost Cache • By design, LOCKSS stores harvested material in a cache directory. LOCKSS will maintain the cache’s integrity through peer polling and voting with other members of your LOCKSS private network. Occasionally an outside occurrence such as a power failure, problematic hardware, or a piece of errant coding can corrupt a file system leaving the LOCKSS cache in an unusable state.

  7. Archival Units • Archival Units may be likened to a run of journals or a web-based collection containing images, videos, and text. Long term access to these AU’s is insured via LOCKSS harvesting and the polling of loyal peers. The concept of having loyal peers insures that data may be gained from other peer nodes if the original content is no longer available from the publisher.

  8. Archival Configuration Adding Titles (manual)

  9. Title Selection

  10. Archival Configuration Adding Titles (backup file) Default cache config file = BatchAuConfig

  11. Restoring a Lost Primary Node • In order for LOCKSS to harvest and preserve data, certain configuration files must remain available via a primary node. These are the title database and public keystore. As a convenience, these files are stored within the conspectus database and are readily available from any participating LOCKSS node.

  12. Recovering the Title Database

  13. Recovering the Keystore

  14. Reconfiguring the Nodes in the Network Creation and setup of the new web root directory and Apache configuration files. Modify the following line: Apache graceful restart

  15. Update the lockss.xml file

  16. Run hostconfig on all member nodes and modify the “Configuration URL:”

  17. Restart LOCKSS • /etc/init.d/lockss stop • /etc/init.d/lockss start

  18. Restoring Data From the Cache • One of the key benefits of participating in a LOCKSS private network is having reliable, long-term access to your material. When available access to the original content is no longer available, the LOCKSS cache proxy server can be used to serve its harvested content. This harvested content may be pulled from any node participating in your LOCKSS private network. As we have seen in the previous section, harvested content may span archival units, key configuration files, and plugins.

  19. Viewing cached content

  20. Using your LOCKSS node as a cache proxy • A LOCKSS node runs a cache proxy that will serve cached content if the original content is no longer available. If the original content is still available, the LOCKSS node will proxy the request to the original site.

  21. Proxy access control

  22. Proxy Options Notes: 1. The audit proxy serves only cached content, and never forwards requests to the publisher or any other sites. By configuring a browser to proxy to this port, you can view the content stored on the cache. All requests for content not on the cache will return a “404 Not Found” error. 2. The ICP server responds to queries sent to other proxies and caches to let them know if this cache has the content they are looking for. This is useful if you are integrating this cache into an exiting network structure with other proxies and cache that support ICP.

  23. Proxy Info

  24. Excerpt from a PAC file:

  25. Adding a PAC file to Mozilla Firefox 1. Windows: Tools/Options MAC OS X: File/Preferences 2. Advanced/Network/Settings

  26. Questions ?

More Related