190 likes | 426 Views
STELLAR Introduction Douglas Tudhope Hypermedia Research Unit, University of Glamorgan. STELLAR. 12 month AHRC funded project Hypermedia Research Unit, University of Glamorgan Archaeology Data Service, University of York English Heritage Centre for Archaeology, Portsmouth
E N D
STELLAR Introduction Douglas Tudhope Hypermedia Research Unit, University of Glamorgan
STELLAR • 12 month AHRC funded project • Hypermedia Research Unit, University of Glamorgan • Archaeology Data Service, University of York • English Heritage Centre for Archaeology, Portsmouth • Builds on previous 3 year AHRC funded STAR Project • Acknowledgments Ceri Binding (University of Glamorgan) Andreas Vlachidis (University of Glamorgan) Keith May, English Heritage (EH) Stuart Jeffrey, Julian Richards, Michael Charno, Tim Evans, Holly Wright Archaeology Data Service (ADS) Archaeology Department, University of York
STAR – Aims and background • Investigate semantic technologies for integrating and cross searching datasets and associated grey literature • Current situation - fragmented datasets with different terminology • Lack of semantic interoperability and cross search • Need for integrative metadata framework CIDOC CRM (ISO standard) as high level, core ontology together with the CRM-EH archaeological extension of the CRM along with relevant EH thesauri and glossaries
STAR Project - General Architecture Applications – Server Side, Rich Client, Browser Web Services, SQL, SPARQL RDF Based Semantic Layer (CRM / CRMEH / SKOS) Data Mapping / Normalisation Indexing Conversion STAN RRAD MoLAS LEAP RPRE EH thesauri, glossaries Grey literature
Natural Language Processing (NLP) of archaeological grey literature • Extract key concepts in same semantic representation as for data. • Allows unified searching of different datasets and grey literature • in terms of same underlying CRM-based conceptual structure • Output as RDF triples in Demonstrator and as XML with greylit “ditch containing prehistoric pottery dating to the Late Bronze Age”
STAR Demonstrator – search for a conceptual pattern An Internet Archaeology publication on one of the (Silchester Roman) datasets we used in STAR discusses the finding of a coin within a hearth. -- does the same thing occur in any of the grey literature reports? Requires comparison of extracted data with NLP indexing in terms of the ontology.
STAR Demonstrator – search for a conceptual pattern • Research paper reports finding a coin in hearth – exist elsewhere?
STELLAR aims and outcomes • Make it easier to map and extract datasets to CIDOC CRM ontology in a consistent manner • Generalise the data extraction tools produced by STAR so third party data providers can use them • Develop methods for mapping and extraction of archaeological datasets into RDF/XML conforming to CIDOC CRM-EH ontology with unique global identifiers for entities and concepts (http URIs) for publication as linked data • Freely available tools and guidelines/tutorials
STELLAR background • In practice mapping to CRM has tended to require specialist knowledge of the ontology and been resource intensive • Given the wide scope of the CRM, it is possible to make multiple valid mappings depending on the intended purpose and focus of the mappings • STELLAR tools convert archaeological data to CRM/RDF in a consistent manner, without requiring detailed knowledge of the underlying ontology • User chooses a template for a particular data pattern and supplies the corresponding input from their database (combination of optional elements with a mandatory ID) • STELLAR templates for • CRM-EH archaeological extension to the CIDOC CRM • Some more general CIDOC CRM templates conforming to the CLAROS Project format • SKOSifying a glossary/thesaurus connected with the dataset
STELLAR applications http://hypermedia.research.glam.ac.uk/resources/STELLAR-applications/ STELLAR.Console STELLAR.Web
STELLAR.Console • Command line application • Freely downloadable • Data import / export / conversion
STELLAR.Web • Subset of STELLAR.Console functionality • CSV2RDF • RDFStats • Using same templates • Input CSV from any external application • No registration or login
Data Conversions CSV TAB TAB2DB CSV2DB SQL2CSV SQL2TAB Database TAB2RDF CSV2RDF SQL2RDF Data Template RDF
Consistent URI Construction • Namespace prefix (user defined) • http://stellar/silchester/ • Entity type (generated by template) • EHE0007 (i.e. Context) • Value (From data, URL encoded if necessary) • 1015 • URI pattern: {prefix}{entity type}_{value} http://stellar/silchester/EHE0007_1015
Using the RDF data RDF data output from STELLAR RDF application / triple store SPARQL queries RDF enabled applications Linked data browsers
Linked data publication by ADS • Selected range of archived archaeological excavation datasets (academic and commercial sectors) converted to RDF using STELLAR tools and ingested into a repository (triple store) • The SPARQL endpoint allows consumption by semantic technologies including Pubby (an open source linked data front end) used for publishing linked data • Content negotiation presents data in formats appropriate for the requesting application (eg RDF/XML/HTML browsers). • Effort devoted to ensure URI construction appropriate for the domain. For ADS archives this includes use of existing DOI identifier codes in the target URI. For external data sets (not already archived with the ADS, eg from commercial contractors) site naming conventions validated by the ADS adopted. • The linked data outputs (and the frontend) are available from ADS website http://data.archaeologydataservice.ac.uk
Contact Information Douglas Tudhope Faculty of Advanced Technology University of Glamorgan Pontypridd CF37 1DL Wales, UK dstudhope@glam.ac.uk http://hypermedia.research.glam.ac.uk/kos/STAR/ http://hypermedia.research.glam.ac.uk/resources/star-demonstrator/ STAR Research Demonstrator http://intarch.ac.uk/journal/issue30/tudhope_index.html STAR Internet Archaeology paper (open access) http://andronikos.kyklos.co.uk/aboutus.php NLP work - see reports with CRM and CRM-EH composite annotations in Sample Documents http://hypermedia.research.glam.ac.uk/kos/STELLAR/ http://hypermedia.research.glam.ac.uk/resources/STELLAR-applications/ STELLAR tools, templates and documentation http://data.archaeologydataservice.ac.uk STELLAR linked data