1 / 13

Measurement Data Object Descriptor Specification - Present Status

Measurement Data Object Descriptor Specification - Present Status. Giridhar Manepalli Corporation for National Research Initiatives. Measurement Data - Metadata. Measurement Data Object Descriptor (MDOM) describes I&M data captured or streamed during GENI experiments

loc
Download Presentation

Measurement Data Object Descriptor Specification - Present Status

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Measurement Data Object Descriptor Specification - Present Status Giridhar Manepalli Corporation for National Research Initiatives

  2. Measurement Data - Metadata • Measurement Data Object Descriptor (MDOM) describes • I&M data captured or streamed during GENI experiments • Data from various phases & levels of an experiment • I&M Data Object is loosely defined as an aggregation of data units that is self contained and makes sense by its own • MDOM classified into • Identifiers • Descriptors • Holders

  3. MDOD • Identifiers (mandatory elements) • Identifier • Primary and any secondaries • URN or other kinds • Source (as in owner/holder of the object) • Note: Primary ID must be a URN • Domain:sub-domain+object_type+object_name • Object_name is unique for a given domain & sub-domain

  4. MDOD • Identifier (optional elements) • Title • Abstract • Subject • Keywords • Annotations

  5. MDOD • Descriptors • Descriptor (mandatory elements) • Level (numbering scheme to support descriptions of both objects & their components within the same MDOD) • Object Type (stored file, stream, etc.) • Slice ID • Locator • View (scope of availability) • Type (via web or file system, etc.) • Object format (perf sonar api, oml db, gui, etc.) • Interpretation method • Is Encrypted

  6. MDOD • Descriptor (optional elements) • Collection • Geographic location • Start and end date • Project Id • Experiment Id • Run Id • Target • Category • Encryption method • Annotations

  7. MDOD • Holders • Holder (mandatory elements) • Id • Order • Domain • Sub-domain • Slice Id • User Id • Contact Info • Collection & inheritance status • Anonymization • Sharing • Disposal • Transactions (what & when)

  8. MDOD • Holder (optional elements) • Collection policy • Sharing Policy • Anonymization Method • Annotations on transactions

  9. Good Things • Excellent start • Collaborative Specification • Great Coverage • Nicely broken down into elements • Mandatory vs. optional elements identified • Genuine Use Cases • Gathering, transferring, and sharing

  10. Net Karma’s Proposal

  11. Zurawski’s Comments • Too many secondary identifiers • Descriptors should be contextualized • Variations based on the type of object • GENI specific descriptions should be clearly marked and separated • Slight changes to names & enclosing elements recommended

  12. Metadata Practices • Too many optional elements • Too many choices given to users • Users bound to take the path of least resistance • Keep the scope restricted to only mandatory elements – at least in the beginning • Try those out. Implement them. • One size fits all ---- No! • Capturing descriptions, formats, policies, transactions, etc. in a monolithic fashion • Register individual components separately • E.g., Capture legal formats & interpretations in their own records, and reference them here • E.g., Same with accepted policies • Identifiers cannot be semantic • Domain, sub-domain, and object-type are part of an ID • World view changes frequently • Non-semantic Ids are worth every penny • Search engines & registries mask the opaqueness • After all, IDs are just entities behind the scenes

  13. Metadata Practices • Object Type controlled vocabulary enumerates apples and oranges • Collection, flow, directory, file, database, gui are not mutually exclusive • Doesn’t help the recipient make any decision looking at the descriptor • Bundle type & format into format interpretation method • Covers too many corner cases, e.g., flow-rate • Expects too many details, e.g., locator (type, access method, etc.)

More Related