70 likes | 207 Views
Registry. Global Biodiversity Information Facility (GBIF) 2012. Éamonn Ó Tuama. A shared registry. Primary aims: To allow the registration and discovery of a growing amount of entities: Institutions, networks, datasets, schemas, vocabularies etc
E N D
Registry Global Biodiversity Information Facility (GBIF) 2012 Éamonn Ó Tuama
A shared registry Primary aims: • To allow the registration and discovery of a growing amount of entities: Institutions, networks, datasets, schemas, vocabularies etc • Provide the means to direct clients on how to access network resources • To accurately model the complex relationships between entities, to enable correct attribution (e.g. recognizing data hosting partnerships) • To provide a reliable identifier “minting” service, allowing distributed systems to connect on common resources
A shared registry Secondary aims: • Provide network monitoring services, to (e.g.) provide alerts on new resources, or technical failures (servers going offline) • Offer search capabilities through indexing of metadata • Enable external classification on registered objects through the use of tagging (both private and public tagging)
AGENT table: holds the information of all entities inside the GBIF Network. These are Organizations, Datasets, Technical Installations, Nodes and Networks AGENT_RELATION table: holds the relations between these entities. For example, this tables models relations like "OrganizationXYZ owns DatasetABC" or "NodeLMP endorses OrganizationXYZ“ 4 tables hold information related to each agent: SERVICE, CONTACT, IDENTIFIER, TAG
System Architecture • A MySQL database (modeling the network graph) • SOLR Search Server • Rabbit MQ message broadcasting • XML files stored on the filesystem • RESTful (JSON) web services • Considering SPARQL endpoint