500 likes | 720 Views
OAIS. Rathachai Chawuthai rathachai.chawuthai@live.com . Information Management CSIM / AIT. Agenda. Preface Overview Data Model Function Model Architecture Model OAIS in use. Preface. Starter. I want to build own restaurant . What should I do?. Starter. What you should have.
E N D
OAIS RathachaiChawuthai rathachai.chawuthai@live.com Information Management CSIM / AIT
Agenda • Preface • Overview • Data Model • Function Model • Architecture Model • OAIS in use
Starter I want to build own restaurant. What should I do?
Starter What you should have
Starter What you should plan
Starter How you should run
Starter I don’t tell you a blueprint or concrete model for running a restaurant. But I guideyou WHAT and HOW that you have to considerwhen plan to run a restaurant business.
Introduction I want to build an archival information system. What should I do?
Introduction Understand OAIS reference model
Overview • Open Archival Information System • In 2000 the Research Libraries Group (RLG) and Online Computer Library Center (OCLC) discussed how both organizations build an infrastructure for purposes of archiving digital objects. • It guides you to build archival information system
Overview • Purpose • Model a system for archival information, which is represented in digital format, for long-term preservation • Scope • Framework for long-term preservation and access • Terminology • Architectures and Operation • Preservation strategies and techniques • Data model
Overview • Primary functions • To preserve digital resource over an extended period of time • To provide user access to the information in archives
High Level Concept Person(s), or client systems who interact with the OAIS system and services Person(s), or client systems, who provide the information to be preserved Person(s) who set the overall policy of the OAIS. Management is separate from administrative functions
Archive External Data Workflow • Producer • Ingest digital resource to system • Management • Monitor, verify digital resource, do preservation planning, migrate digital resource, and etc. • Customer • Search and access digital resource in repository
Package model • 4 Simple information concepts • Content Info • A digital resource that need to preservee.g. text, image, video, sound, … • Preservation Description Info (PDI) • Contain preservation metadata that informs humans or machines to know what they should concern when they want to access, render or other actions to the digital resource. • Archive Packaging Info • A package that enwraps both Content Info (1) and PDI (2) to store as one object • Info of Archive Package • It performs as a metadata of Archive Packaging Info (3). • It helps search engine that does not need to costly extract Archive Package Info to query Content Info or PDI directly
Package Model PDI Preservation Description Information Content Information • Content Information: • Original targeted for preservation. • Physical/Digitalobject and it RepresentationInformation. Archive Packaging Information Package 1 Descriptive Information about Package 1
Package Model PDI Preservation Description Information Content Information • Preservation Description Information (PDI): • What is needed to preserve the Content Information • Provenance • For reliability • Source of content • histories • Context • Environment to render • Reference • Refer to thing outside e.g. ISBN • Fixity • Check sum, MD5, … Archive Packaging Information Package 1 Descriptive Information about Package 1
Package Model PDI Preservation Description Information Content Information • Descriptive Information: • information which is used to discover which package has the Content Information of interest • Full set of attributes that are searchable in catalog service • To perform indexing to this information may improve performance of searching Archive Packaging Information Package 1 Descriptive Information about Package 1
Package Model Descriptive Information about Package 1 Package 1
Information Model • Content Information • A basic concept of information that contain data and its representation information. • For example, it can be “Thailand Map” Content Information
Information Model • Data Object • It is an object that need to preserve. • It can be either physical thing in the real world or digital object content containing bit string. • In this case, it can be file content (1010101010101110101 000101010101001010…..) of image file of Thailand Map • In fact, it is just a string of bit that has no meaning if no one cannot understand.
Information Model • Representation Information • A bit string(100111010…) may be useless if no one knows its meaning. The representation Information inform what structure of “100111010…”is and how to interpret it. • It may has representation of representation if the data object content has complex structure or encode by many level Format in byte form Raw bit string Construct JPEG format structure Interpret to color of pixels to be a picture
Information Model • PDI contains • Reference Info • Identifier that link to thing outside system or real world resource; such as ISBN • Provenance Info • To record why the digital resource born, where it born, why, and how. • Including software and environment that created it. • Context Info • To inform how reliable of the digital resource • To inform original or source of content • To inform history of change • To inform migration process • Fixity Info • To provide necessary information to access and verify digital resource • E.g. keyword, Checksum, MD5, and etc
Information Model Example of PDI components
Information Model Example of PDI components
Information Model Example of PDI components
Information Model Big picture Of Information Model Of a package After combine with all components
Information Package • SIP • A form of package that the is suitable to ingest to the system by the producer. • Majorly, SIP contains Content Info and PDI. • Multiple SIPs may associate with the same PDI. DIP AIP SIP • AIP • A form of package that is suitable stored in the system. • One or more SIPs is transformed to AIP that has complete set of PDI associated Content Info. • AIP may be a collection of AIPs. • DIP • A form of package that is suitable disseminateto consumer. • AIP is transformed to DIP for sharing purpose. • DIP may contains one or more AIP that may not has complete set of PDI.
Information Package Ingest Query/Access SIP DIP Disseminate Consumer Producer AIP Store Administrator
1) Ingest • Accept SIPs from Producers • or from internal elements under Administration control • Prepare the AIPs for archive storage
2) Archival Storage • Storage of AIP • Maintenance of AIP • Retrieval of AIPs.
3) Data Management • Populate • Descriptive Information • Administrative Data • Maintain • Descriptive Information • Administrative Data • Access • Descriptive Information • Administrative Data
4) Administration • Solicit and negotiate submission agreement • With producer • Audit submission • To ensure that they meet standard • Maintain Configuration Management of • System hardware • Software • Day-to-day governance of the other OAIS functional entities
5) Preservation Planning • Monitor environment of OAIS • Provide recommendations • Still accessible? • Long-term? • If original computing environment becomes obsolete?
6) Access • Determine • Existence • Description • Location • Availability Of information in OAIS • Allow Consumer • Request • Retrieve Information of Products
Summary What you should have
Summary What you should plan
Summary How you should run
References • http://www.oclc.org • http://www.oclc.org/research/pmwg/pm_framework.pdf 1