1 / 10

CLARIN-NL/VL procedure

CLARIN-NL/VL procedure. Project workspace. Become a member of the CLARIN-NL/VL group Create a group for your project add members related to your project Create one or more DCSs add existing and usable DCs preferably from the CLARIN-NL/VL view add new DCs Share the DCSs with

cian
Download Presentation

CLARIN-NL/VL procedure

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. CLARIN-NL/VL procedure CLARIN-NL ISOcat workshop

  2. Project workspace • Become a member of the CLARIN-NL/VL group • Create a group for your project • add members related to your project • Create one or more DCSs • add existing and usable DCs • preferably from the CLARIN-NL/VL view • add new DCs • Share the DCSs with • the project group • can edit shared DCs en DCSs • the CLARIN-NL/VL group • can also edit, so ask permission (!) if you want to change something in a DC owned by another project • changes are logged and attributed • Make one or more DCSs public when the group and Ineke finds the DCs mature • the project group becomes visible in the public part of ISOcat • also make the DCs public CLARIN-NL ISOcat workshop

  3. CLARIN-NL/VL workspace • Every project should have a representative in the CLARIN-NL/VL group • this representative should own the project DCS so (s)he can share it with the CLARIN-NL/VL group • Once DCs are mature for CLARIN-NL/VL use add them to the admin-proposed DCS • this is still in the private/shared part of ISOcat • Ineke will • review the (new) DCs • add them to CLARIN-NL/VL thematic DCSs • If an accepted DC is changed add them to the admin-changed DCS • Ineke will review the changed DCs CLARIN-NL ISOcat workshop

  4. Workflow to pick a DC • Look for a matching DC in • an approved one within our view • look in the thematic DCSs • if not, one under construction • look in the project DCSs • if not, look in ISOcat at large • include in your own DCS • create a DC stub if there is a very near one, Ineke will contact the owner or clone the DC • if not, create a new one • give reason in the justification • Add to the admin-proposed DCS CLARIN-NL ISOcat workshop

  5. BeyondISOcat • Use CMDI profiles annotated with ISOcat DCs for your metadata • Annotate your project resources with the selected/created Data Categories • preferably annotate a schema • to be stored in SCHEMAcat (under construction) • use @dcr:datcat • for XML (schemata) • for EBNF grammars • ... (contact isocat@mpi.nl) • Remember (ontological) relationships between the selected/created Data Categories and other Data Categories/concepts • to be stored in RELcat (under construction) • there is a spreadsheet template CLARIN-NL ISOcat workshop

  6. XML resource <lmf:lexiconxml:lang=“jp” alphabet=“ipa”> <lmf:entry> <lmf:lemma> <lmf:writtenForm>nihongo</…> … </…> … </…> … </…> CLARIN-NL ISOcat workshop

  7. XML Relax NG schema <rng:attribute name=“alphabet” dcr:datcat=“http://www.isocat.org/datcat/…”> <rng:valuedcr:datcat=“http://www.isocat.org/datcat/…”> ipa </…> … </…> CLARIN-NL ISOcat workshop

  8. CGN/DCOI grammar with DC references tag = pos '(' feat* ')' # @datcat ‘WW’ http://www.isocat.org/datacat/DC-1424 # @datcat ‘TW’ http://www.isocat.org/datacat/DC-1334 # @datcat ‘VG’ http://www.isocat.org/datacat/DC-1226 # @datcat ‘TSW’ http://www.isocat.org/datacat/DC-2717 pos = 'N' | ' ADJ' | 'WW' | 'TW' | 'VNW' | 'LID' | 'VZ' | 'VG' | 'BW' | 'TSW' feat = 'NTYPE' | 'GETAL' | 'GRAAD | 'GENUS | 'NAAMVAL' | 'POSITIE' | 'BUIGING | 'GETAL-N' | 'WVORM | 'PVTIJD | 'PVAGR' | 'NUMTYPE' | 'VWTYPE' | 'PDTYPE' | 'PERSOON' | 'STATUS' | 'NPAGR' | 'LWTYPE' | 'VZTYPE’ | 'CONJTYPE' | 'SPECTYPE' NTYPE = 'soortnaam' | 'eigennaam' GETAL = 'enkelvoud' | 'meervoud' | 'getal' GRAAD = 'basis' | 'comparatief' | 'superlatief' | 'diminutief' GENUS = 'genus' | 'zijdig' | 'masculien' | 'feminien' | 'onzijdig' NAAMVAL = 'standaard' | 'nominatief' | 'oblique' | 'bijzonder' | 'genitief' | 'datief' POSITIE = 'prenominaal' | 'nominaal' | 'postnominaal 'vrij' BUIGING = 'zonder' | 'met-e' | 'met-s' GETAL-N = 'zonder-n' | 'meervoud-n' WVORM = 'persoonsvorm' | 'buigbaar' | 'innitief' | 'onvdw' | 'voltdw‘ # @datcat PVTIJD http://www.isocat.org/datacat/DC-1286 # @datcat ‘verleden’ http://www.isocat.ord/datacat/DC-1347 # @datcat ‘conjunctie’ http://www.isocat.ord/datacat/DC-1843 PVTIJD = 'tegenwoordig' | 'verleden' | 'conjunctief' PVAGR = 'enkelvoud' | 'meervoud' | 'met-t' NUMTUPE = 'hoofdtelwoord' | 'rangtelwoord' VWTYPE = 'pr' | 'persoonlijk' | 'reexief' | 'reciprook' | 'bezittelijk' | 'vb' | 'vragend' | 'betrekkelijk' | 'exclamatief' | 'aanwijzend' | 'onbepaald' PDTYPE = 'pronomen' | 'adv-pronimen' | 'determiner' | 'gradeerbaar' PERSOON = 'persoon' | '1' | '2' | '2v' | '2b' | '3' | '3p' | '3' | '3v' | '3o' STATUS = 'vol' | 'gereduceerd' | 'nadruk' NPAGR = 'agr' | 'evon' | 'rest' | 'evz' | 'mv' | 'agr3' | 'evmo' | 'rest3' | 'evf' | 'mv' LWTYPE = 'bepaald' | 'onbepaald' VZTYPE = 'initieel' | 'versmolten' | 'naal' CONJTYPE = 'nevenschikkend' | 'onderschikkend' SPECTYPE = 'afgebroken' | 'onverstaanbaar' | 'vreemd' | 'deeleigen' | 'meta' | 'commentaar' | 'achtergrond' | 'afkorting' | 'symbool' | 'dialect' CLARIN-NL ISOcat workshop

  9. Relation type taxonomy • rel:related • rel:sameAs • rel:almostSameAs • rel:broaderThan • rel:superClassOf • rel:hasPart • rel:hasDirectPart • rel:narrowerThan • rel:subClassOf • rel:partOf • rel:directPartOf • Extensible taxonomy inspired by OWL and SKOS. Any other predicates should be added at the appropriate place in the hierarchy: • rel:sameAs • owl:sameAs • skos:exactMatch • rel:almostSameAs • skos:closeMatch CLARIN-NL ISOcat workshop

  10. Relation Registry - RELcat • The Relation Registry is basically a triple store: • Subject: resource 1 • Predicate: relationship • Object: resource 2 • # /first plural exclusive vernacular/ is-a /vernacular/ • isocat:DC-1234 rel:subClassOf isocat:DC-4 . • # /first person/ part-of /first plural exclusive vernacular/ • isocat:DC-1 rel:partOf isocat:DC-1234 . • # /plural/ part-of /first plural exclusive vernacular/ • isocat:DC-2 rel:partOf isocat:DC-1234 . • # /exclusive/ part-of /first plural exclusive vernacular/ • isocat:DC-3 rel:partOf isocat:DC-1234 . • The Relation Registry is implemented as a quad store: • every user can have its own set(s) of relationships CLARIN-NL ISOcat workshop

More Related