530 likes | 594 Views
The European Legislation Identifier. Using semantic technologies to identify legislation. The problem. Legislation is digitally available. Yet ... it can be hard to access, exchange and reuse! Because different legal systems and legislation publishers...
E N D
The European Legislation Identifier Using semantic technologies to identify legislation
The problem Legislation is digitally available. Yet ... it can be hard to access, exchange and reuse! Because different legal systems and legislation publishers... organise and classify this information in different ways.
The solution What if we could... ... change that? ... uniquely identify your legislation online? ... guarantee easier access, exchange and reuse of legislation?
The European Legislation Identifier INTRODUCING:
The European Legislation Identifier: our goals Help to make legal data transparent and reliable Promote interoperability and cooperation on legal matters Improve search mechanisms for legal professionals and citizens
How does it work? 1. HTTPs and URIs allow to identify patterns in the legislation 2. Common ontology helps to describe legislation • RDFA —a machine readable format for metadata exchange
ELI – a joint project Driven and coordinated by governmental legislation publishers in cooperation with the Publications Office of the EU. A dedicated team shares expertise and helps with the implementation Together we make legislation easy to access, exchange and reuse
ELI access points and documentation ELI on EUR-Lex: http://eurlex.europa.eu/ ELI ontology: http://publications.europa.eu/mdr/eli/index.html ELI register: http://eur-lex.europa.eu/eli-register/about.html ELI guides: •ELI - A technical implementation guide http://data.europa.eu/doi/10.2830/74251 •ELI implementation methodology - Good practices and guidelines http://data.europa.eu/doi/10.2830/813167
Persistent URIsEnabling permanent availability of your information assets on the web
The problem • Broken links • No guarantee of persistence of URIs. Would you build your app based on non-persistent data? • Namespace fragmentation leads to confusion • No common persistent URIs policy towards citizens, businesses and public administrations
The solution: URI Committee • Common URI pattern:http://data.europa.eu/{collection.id}/{local.id} • Interinstitutional governance: URI Committee • Common persistent URI policy for EU institutions • URI redirection service managed by Publications Office
Persistent URIs for EU Institutions Our servicefor all EU Institutions and agencies: data.europa.eu: One subdomain for all EU related data Additional visibility for your datasets Your data remain persistent through redirection service Questions or suggestions? Contact usat OP-URI@publications.europa.eu
IMMC Exchange your data and documents in a standardised and secure way
The problem: How to send data in a standardised and secure manner? • Many manual interventions needed (e-mail) • Costly mappings and transformations • No automatic validation • Are transmissions secure?
The solution: IMMC • Standardised exchange format agreed between EU institutions • Common core schema with business specific extensions • Use of common metadata elements and values • Secure transmission using e-TrustEx • Under governance of Interinstitutional Metadata Maintenance Committee (IMMC) • Used for exchange of data between • EU institutions • EU institutions and Member States
IMMC for EU Institutions Our servicefor all EU Institutions and agencies: • IMMC governance • Maintenance of IMMC exchange protocol on MDR website:http://publications.europa.eu/mdr/core-metadata/index.html • Business and technical consultancy • IMMC builder tool: Facilitates IMMC implementation in your environment Questions or suggestions? Contact usat OP-METADATA-REGISTRY@publications.europa.eu
Exploiting harmonised structures of legislative documentsInterinstitutional Formats Committee
Context: The Ordinary Legislative Procedure REGULATIONEU No 1384/2014 OF THE EUROPEAN PARLIAMENT AND OF THE COUNCIL of 18 December 2014on the tariff treatment for goods originating in Ecuador THE PARLIAMENT AND THE COUNCIL OF THE EUROPEAN UNION Having regard to the Treaty on the Functioning of the European Union,and in particular Article 207(2) thereof, Having regard to the proposal from the European Commission, [...]
The problem: Multiple formatconversions REGULATIONEU No 1384/2014 OF THE EUROPEAN PARLIAMENT AND OF THE COUNCIL of 18 December 2014on the tariff treatment for goods originating in Ecuador THE PARLIAMENT AND THE COUNCIL OF THE EUROPEAN UNION Having regard to the Treaty on the Functioning of the European Union,and in particular Article 207(2) thereof, Having regard to the proposal from the European Commission, [...] Formex <XML> EC / LegisWrite EP / Akoma Ntoso .docx <XML>
The challenge: Time, cost, quality REGULATIONEU No 1384/2014 OF THE EUROPEAN PARLIAMENT AND OF THE COUNCIL of 18 December 2014on the tariff treatment for goods originating in Ecuador THE PARLIAMENT AND THE COUNCIL OF THE EUROPEAN UNION Having regard to the Treaty on the Functioning of the European Union,and in particular Article 207(2) thereof, Having regard to the proposal from the European Commission, [...] Formex <XML> L € Formex <XML> EC / LegisWrite EC / LegisWrite EP / Akoma Ntoso EP / Akoma Ntoso .docx .docx <XML> <XML>
The solution: Harmonisingformatsfor easy exchange • Common Exchange • Model • Based on LegalDocML (Akoma Ntoso) • Business rules • Technical validationrules Common Vocabulary <regulation><title_of_the_regulation> <type_of_act>REGULATION</type_of_act> (EU) No 1384/2014 OF THE EUROPEAN PARLIAMENT AND OF THE COUNCIL of 18 December 2014 on the tariff treatment for goods originating in Ecuador </title_of_the_regulation> Interinstitutional Formats Committee Formex <actxmlns="http://formex.publications.europa.eu/" [...] <title><ti><p> <ht type="upper">Regulation</ht>(EU) No 1384/2014 <ht type="upper">of the European Parliament and of the Council</ht></p> <p>of <date iso="2014-12-18">18 December 2014</date> </p> <p>on the tariff treatment for goods originating in Ecuador</p> </ti> </title> Akoma Ntoso <akomaNtosoxmlns="http://www.akomantoso.org/2.0" xmlns:leos="urn:eu:europa:ec:leos"> <bill contains="originalVersion"> <meta> [...] <preface id="preface">[...] <docTitleid="preface__longtitle_1__p__docTitle"> <docStageid="preface__longtitle_1__p__docTitle__docstage_1" refersTo="~proprietary__docstage">Proposalfor a</docStage> <docTypeid="preface__longtitle_1__p__docTitle__doctype_1" refersTo="~proprietary__doctype">REGULATION OF THE EUROPEAN PARLIAMENT AND OF THE COUNCIL</docType> <docPurposeid="preface__longtitle_1__p__docTitle__docpurpose_1" refersTo="~proprietary__docpurpose">on thetarifftreatmentforgoodsoriginating in Ecuador</docPurpose> </docTitle>
The benefits Speedinguptheprocess Contributingtobetterlaw-making ... also applicabletootherpublications Interinstitutional Formats Committee
Questions or suggestions? Contact uson OP-CUSTOMER-SUPPORT@publications.europa.eu Exploiting harmonised structures of legislative documents? Interinstitutional Formats Committee
VocBenchEasy way to manage your controlled vocabularies with semantic technologies
The problem: How to manage my controlled vocabularies? • What tool? What format? • I want to use semantic technologies, but do not have the expertise • I'd prefer to avoid using a commercial solution • Certain solutions not supported by my infrastructure
The solution: VocBench • Open source web-based multilingual vocabulary management tool • Large scale thesaurus management (e.g. EuroVoc) • Access through Testa network • Specialised standard-based format for controlled vocabularies: SKOS-AP-EU • No need to install in your infrastructure or invent your own formats
VocBench for EU Institutions Our servicefor all EU Institutions and agencies: Access to your own VocBench instance on Testa:https://webgate.ec.testa.eu/publications/vocbench/ No need to reinvent the wheel, manage your vocabularies inSKOS-AP-EU format Consultancy Questions or suggestions? Contact usat OP-EUROVOC@publications.europa.eu
Long-term digital preservationWhat's in it for the EU institutions?
The problem Unless the vexatious problem of digital preservation is solved, all texts “born digital” belong to an endangered species. Robert Darnton, The Research Library in the Digital Age, Harvard University Library, 2008
The solution: long-term digital preservation The long-termpreservation of digital assetsensurestheirauthenticity, access and use over time. It is (rather) new • Thinkabout personal pictures(weneverhad to) Itis(allabout) active planningand standards • Planning: which formats were/willbestandard in 1990/2030? • Active: define/refinepolicies, transform/migrate, automatise flows, acquire new collections, error handling, document and generatemetadata, technologywatch, standards… • Standards: ISO 14721/16363, FRBR, XML, PDF/A, METS, PREMIS Regregex@Wikipedia CC <https://goo.gl/55xfQy>
What long-term digital preservation is not • It does not need to be a public repository • Itisnot a box to fill and forget • Obsolescencewilldoubleyourwork • Loss of data • Itrequires active management • Repository • Access: public, variable, urgent, short term • Objectives: givepublicaccess, preventcrashes • Archive • Access: restricted, slow, longterm • Objectives: preventlitigation, complywithpolicies Rev.Xanatos@Flickr CC <https://goo.gl/rcS1Fe>
OP Archive collections Legal publications • (Digital) Authentic Official Journal • Legislativecollections: EU and nationallegislation, prelegislation, consolidation, case law etc. General publications • EU Bookshop • Websites (more info in a moment) Master data • Ontologies, vocabularies, schemas • Logs, configuration, software • Documentation
What is in it for the EU institutions? Archive service • CT 10361, Lot 1 Digitization • CT 10579 Long-Termpreservationexpertise • Cellar and Archive team @ OP, and also CT 10361, Lot 2 Questionsorsuggestions? Contactuson OP-LONG-TERM-PRESERVATION@publications.europa.eu United Nations CC <https://goo.gl/3zLye6>
The problem: Losing web content • Changing web content? • Deleting web pages? • Getting a new online visualidentity? • Moving to anotherdomainname? BUT: Wheredoes the oldcontent go? How to keeptrack?
The solution: Web preservation Quality control Harvest Description Crawl Access Long termpreservation Selection Living web
Example: EC digital transformation 31 October 2013 9 June 2016
Web preservation for the EU Institutions Our servicefor all EU Institutions and agencies: Your website archived 4 times per year Web archive online accessible: http://collections.internetmemory.org/haeu Questions or suggestions? Contact uson op-web-preservation@publications.europa.eu
CELLAR Enabling reuse of EU informationby machines
Publications Office's central content and metadata store - CELLAR IMMC IMMC Validation Conversion Reception METS CELLAR IMMC IMMC IMMC ELI IMMC ELI: European Legislation Identifier IMMC: Standardized XML transmission envelope METS: Metadata Encoding Standard Contractors …
How are things structured in CELLAR? Thesauri / authoritytables Ontologies / Common Data Model Control Data Instance Data …
How can you retrieve data from CELLAR? • Dublin Core (coremetadata) • Linked Open Data (LOD) • Web-friendly ("RESTful") Interface • Resource Description Framework (RDF) • Standard QueryLanguage (SPARQL) • FRBR model • URIs: http://publications.europa.eu/resource/ {ps-id}/{obj-id} Internet EUR-Lex OP Portal http://publications.europa.eu/webapi/rdf/sparql Direct access / RESTful WS Notification / RSS SPARQL
8 mio requests per day served on average, peaks >20 mio >100k SPARQL queries / day > 1 miodifferent "works" in > 10 million linguistic versions ("expressions"), > 23 mio formats ("manifestations") in > 34.5 mio "items" > 230 million persistent identifiers > 1500 million triples in Oracle RDF store Ca. 5000 resources treated each day (most in 23 languages) Sizes: 4.2 TB Oracle DB (compressed) Content (in Fedora repository) > 17.5 TB 120million files in Fedora How much is CELLAR used? Daily requests / day (2016-09) Requests from internet / country (2016-09) SPARQL requests / day (2016-09)
Questions or suggestions? Contact us at: OP-CELLAR-HELPDESK@publications.europa.eu
EU Open Data Portal The single access point to data from EU institutions
What do we mean by data? Sets of bibliographic records of documents (publications, legal documents, reports, recommendations, studies…) Data about projects, programmes, actions, measures, grants … Reference Data (thesauri, vocabularies, ontologies…) Geographic, geospatial, meteorological data Statistics and time series Data is: all objective, factual, non-personal, non-aggregated information that we collect or generate during our activities
Harmonised metadata / Multilinguism Why an EU Open Data Portal? Bring together resources from all EU institutions (65 so far) Single access point to all European Union's Open Data Aligned terms of use (implementing EU reuse policy) Centralise engagement via a data consumer community