220 likes | 241 Views
The PDS4 Information Model (IM) Steve Hughes steve.hughes@jpl.nasa.gov NASA Jet Propulsion Laboratory (JPL), California Institute of Technology. Information Model Overview. Developed using: lessons learned from 20 years worth of archiving
E N D
The PDS4 Information Model(IM) SteveHughes steve.hughes@jpl.nasa.gov NASA Jet Propulsion Laboratory(JPL), California Institute ofTechnology
Information Model Overview • Developedusing: • lessons learned from 20 years worth of archiving • best practices for information modeldevelopment • Foundational principles adoptedfrom: • Open Archival Information System (OAIS) Reference Model - ISO 14721 - FoundationalPrinciples • ISO/IEC 11179 - Volume 3 – Metadata Registry Specification - Hierarchy of data dictionaries and governancemodel. • W3C XML (ExtensibleMarkupLanguage) - Rules forencoding documentselectronically. • Drives the PDS4 infrastructure byproviding: • the representation of concepts and their relationships, constraints, rules, andoperations • a sharable, stable, and organized structure of information requirements. • formal definitions that are suitable for configuring and generating code.
Views RepositoryView Community’sView Product Tagged DataObject (InformationObject) <local_identifier>MPFL_M_IMP_IMAGE</local_identifier> <offsetunit="byte">0</offset> <axes>2</axes> <axis_index_order>Last_Index_Fastest</axis_index_order> <encoding_type>Binary</encoding_type> <Element_Array> <data_type>SignedMSB4</data_type> <unit>pixel</unit> </Element_Array> <Axis_Array> <axis_name>Line</axis_name> <elements>248</elements> <sequence_number>1</sequence_number> </Axis_Array> <Axis_Array> <axis_name>Sample</axis_name> <elements>256</elements> <sequence_number>2</sequence_number> </Axis_Array> </Array_2D_Image> Describes Data Object Information Modeler’sView
Roles • Definesthe: • data structure(format) • science interpretation of thedata • context within which the data was captured, processed,and • archived • relationships between thedata acquired from domain experts from each of the sciencedisciplines • Provides a multilevel governanceframework • Common dictionary provides coredefinitions • Discipline and mission levelextensions • Governance authority resides with dictionaryStewards • Provide a single authoritative source for the datastandards • Remains independent of itsimplementation.
IMSoftware • Information Model Tool(IMTool) • Builds the IM master database from a protégé object model, protégé data dictionary database, and configurationfiles. • Extracts the database content and converts it tosystem files in various formats for systemconfiguration. • Local Data Dictionary Tool (LDDTool) -- IMTool–l • Accepts one or more Local Data Dictionary(LDD) definition files(Ingest_LDD). • Validates the syntax, semantics, and modelingapproach • Ingests the LDD(s) into the masterdatabase • Checks consistency against the Commondictionary • Writes systemfiles
Version1.8.0.0- Build7b • Delivered to I&T on March 31,2017. • Documents and artifacts available on PDS4 web site • Information Model Specification(HTML) • ReleaseNotes • Data Dictionary (HTML andPDF) • XML Schema andSchematron • System files in variousformats • JSON, XMI/UML,RDF/OWL • Updated NamespaceRegistry
Changes • CommonDictionary • Made the ASCII numeric data types boundaries moreprecise • Otherchanges • Product_SIP_Deep_Archive • PropertyMaps • JSONfile • Documents
SCR metrics overtime CommonDictionary
DictionaryGovernance • Registration Authority is an entity that has the authorityand responsibility to govern one or moredictionaries. • Steward is a entity consisting of a group of domain experts that is given authority and responsibility by a Registration Authority to govern (i.e.., create and manage) one ormore dictionaries within a domainarea. • One member of the group is identified as aprimary • steward. • The steward in general can create one or morenamespaces within adomain. • Namespace is a “container” for a logical grouping under a single identifier of classes andattributes.
IMGovernance governs RegistrationAuthority Steward Common pds: Discipline rings: img: disp: geom: cart: Mission mgs: ladee: mvn: insight: orex: 1 ISO 14721:2003 - Open Archival Information System (OAIS) ReferenceModel
ModelComponents Common, Discipline and MissionDictionaries
DictionaryGovernance • Dictionary • Name: Common • Registration Authority:0001_NASA_PDS_1 • Steward_id:pds • Steward1: PDS EN Node • Namespace_id:pds • Namespace: http://pds.nasa.gov/pds4/pds/v1 • Dictionary • –Name: Geometry – Registration Authority:0001_NASA_PDS_1 geo PDS GEONode geom http://pds.nasa.gov/pds4/geom/v1 • Steward_id: • Steward1: • Namespace_id: • Namespace: 1 The primary governance entity within the stewardshipgroup
IM V1.9.0.0 ReleaseSchedule • Aug 17, 2017 – All SCRs to be implemented have beenidentified • Aug 17 - 30, 2017 – Finishimplementation • Release IM to developmentdirectory • Internaltesting • Fixbugs • Aug 31 – 13, 2017 – EN, DN, and IPDAtesting • Release IM to developmentdirectory • Perform regression testing atEN • Sep 14 - 28, 2017 – Prepare IM for release to SystemI&T • Sep 29, 2017 – Release IM to releasedirectory • “Start of Build” and System I&T • IM isfrozen • System I&T - SystemTesting • System Release Build
Summary • The PDS4 Information Model and Standards are beingused successfully • Have been adopted by the International PlanetaryData • Alliance(IPDA). • Still work to bedone • DisciplineDictionaries • Leverage the IM in systemdevelopment • Configuration of search engines and otherservices • Development of model-drivensoftware • Desk Assessment of PDS4 against ISO 16363 found that92% of the metrics of the ISO 16363 standard weresatisfied • 80% of the metrics for Governance and OrganizationalViability • 95% of the metrics for Digital ObjectManagement • 96% of the metrics for Infrastructure and Security RiskManagement.
ThankYou Questions andAnswers PDS homepage:https://pds.nasa.gov/ Acknowledgements - This research was carried out at the Jet Propulsion Laboratory, California Institute of Technology, under a contract with the National Aeronautics and SpaceAdministration.
Definitions • “An information model is a representation of concepts, relationships, constraints, rules, and operations to specify data semantics for a chosen domain of discourse.”1 • It provides a sharable, stable, and organized structure of information requirements or knowledge for the domaincontext. 1 Lee, Y. T. 1999. Information Modeling: From Design To Implementation. In Proceedings of the Second World Manufacturing Congress, ed. S. Nahavandi and M. Saadat, 315-321. Canada/Switzerland: International Computer ScienceConventions.
Summary ofChanges to Standards Reference(SR) ALL CHANGES were captured in Change Log and reviewed byDocument ReviewTeam
Summary ofChanges to Data Provider’s Handbook (DPH)
Dictionary Governance ISO1 Standard GovernanceEntities • RegistrationAuthority • Steward • Namespace Common Discipline Mission 1 ISO 14721:2003 - Open Archival Information System (OAIS) ReferenceModel