100 likes | 213 Views
CLARIN-NL/VL procedure. Project workspace. Become a member of the CLARIN-NL/VL group Create a group for your project add members related to your project Create one or more DCSs add existing and usable DCs preferably from the CLARIN-NL/VL view add new DCs Share the DCSs with
E N D
CLARIN-NL/VL procedure CLARIN-NL ISOcat workshop
Project workspace • Become a member of the CLARIN-NL/VL group • Create a group for your project • add members related to your project • Create one or more DCSs • add existing and usable DCs • preferably from the CLARIN-NL/VL view • add new DCs • Share the DCSs with • the project group • can edit shared DCs en DCSs • the CLARIN-NL/VL group • can also edit, so ask permission (!) if you want to change something in a DC owned by another project • changes are logged and attributed • Make one or more DCSs public when the group and Ineke finds the DCs mature • the project group becomes visible in the public part of ISOcat • also make the DCs public CLARIN-NL ISOcat workshop
CLARIN-NL/VL workspace • Every project should have a representative in the CLARIN-NL/VL group • this representative should own the project DCS so (s)he can share it with the CLARIN-NL/VL group • Once DCs are mature for CLARIN-NL/VL use add them to the admin-proposed DCS • this is still in the private/shared part of ISOcat • Ineke will • review the (new) DCs • add them to CLARIN-NL/VL thematic DCSs • If an accepted DC is changed add them to the admin-changed DCS • Ineke will review the changed DCs CLARIN-NL ISOcat workshop
Workflow to pick a DC • Look for a matching DC in • an approved one within our view • look in the thematic DCSs • if not, one under construction • look in the project DCSs • if not, look in ISOcat at large • include in your own DCS • create a DC stub if there is a very near one, Ineke will contact the owner or clone the DC • if not, create a new one • give reason in the justification • Add to the admin-proposed DCS CLARIN-NL ISOcat workshop
BeyondISOcat • Use CMDI profiles annotated with ISOcat DCs for your metadata • Annotate your project resources with the selected/created Data Categories • preferably annotate a schema • to be stored in SCHEMAcat (under construction) • use @dcr:datcat • for XML (schemata) • for EBNF grammars • ... (contact isocat@mpi.nl) • Remember (ontological) relationships between the selected/created Data Categories and other Data Categories/concepts • to be stored in RELcat (under construction) • there is a spreadsheet template CLARIN-NL ISOcat workshop
XML resource <lmf:lexiconxml:lang=“jp” alphabet=“ipa”> <lmf:entry> <lmf:lemma> <lmf:writtenForm>nihongo</…> … </…> … </…> … </…> CLARIN-NL ISOcat workshop
XML Relax NG schema <rng:attribute name=“alphabet” dcr:datcat=“http://www.isocat.org/datcat/…”> <rng:valuedcr:datcat=“http://www.isocat.org/datcat/…”> ipa </…> … </…> CLARIN-NL ISOcat workshop
CGN/DCOI grammar with DC references tag = pos '(' feat* ')' # @datcat ‘WW’ http://www.isocat.org/datacat/DC-1424 # @datcat ‘TW’ http://www.isocat.org/datacat/DC-1334 # @datcat ‘VG’ http://www.isocat.org/datacat/DC-1226 # @datcat ‘TSW’ http://www.isocat.org/datacat/DC-2717 pos = 'N' | ' ADJ' | 'WW' | 'TW' | 'VNW' | 'LID' | 'VZ' | 'VG' | 'BW' | 'TSW' feat = 'NTYPE' | 'GETAL' | 'GRAAD | 'GENUS | 'NAAMVAL' | 'POSITIE' | 'BUIGING | 'GETAL-N' | 'WVORM | 'PVTIJD | 'PVAGR' | 'NUMTYPE' | 'VWTYPE' | 'PDTYPE' | 'PERSOON' | 'STATUS' | 'NPAGR' | 'LWTYPE' | 'VZTYPE’ | 'CONJTYPE' | 'SPECTYPE' NTYPE = 'soortnaam' | 'eigennaam' GETAL = 'enkelvoud' | 'meervoud' | 'getal' GRAAD = 'basis' | 'comparatief' | 'superlatief' | 'diminutief' GENUS = 'genus' | 'zijdig' | 'masculien' | 'feminien' | 'onzijdig' NAAMVAL = 'standaard' | 'nominatief' | 'oblique' | 'bijzonder' | 'genitief' | 'datief' POSITIE = 'prenominaal' | 'nominaal' | 'postnominaal 'vrij' BUIGING = 'zonder' | 'met-e' | 'met-s' GETAL-N = 'zonder-n' | 'meervoud-n' WVORM = 'persoonsvorm' | 'buigbaar' | 'innitief' | 'onvdw' | 'voltdw‘ # @datcat PVTIJD http://www.isocat.org/datacat/DC-1286 # @datcat ‘verleden’ http://www.isocat.ord/datacat/DC-1347 # @datcat ‘conjunctie’ http://www.isocat.ord/datacat/DC-1843 PVTIJD = 'tegenwoordig' | 'verleden' | 'conjunctief' PVAGR = 'enkelvoud' | 'meervoud' | 'met-t' NUMTUPE = 'hoofdtelwoord' | 'rangtelwoord' VWTYPE = 'pr' | 'persoonlijk' | 'reexief' | 'reciprook' | 'bezittelijk' | 'vb' | 'vragend' | 'betrekkelijk' | 'exclamatief' | 'aanwijzend' | 'onbepaald' PDTYPE = 'pronomen' | 'adv-pronimen' | 'determiner' | 'gradeerbaar' PERSOON = 'persoon' | '1' | '2' | '2v' | '2b' | '3' | '3p' | '3' | '3v' | '3o' STATUS = 'vol' | 'gereduceerd' | 'nadruk' NPAGR = 'agr' | 'evon' | 'rest' | 'evz' | 'mv' | 'agr3' | 'evmo' | 'rest3' | 'evf' | 'mv' LWTYPE = 'bepaald' | 'onbepaald' VZTYPE = 'initieel' | 'versmolten' | 'naal' CONJTYPE = 'nevenschikkend' | 'onderschikkend' SPECTYPE = 'afgebroken' | 'onverstaanbaar' | 'vreemd' | 'deeleigen' | 'meta' | 'commentaar' | 'achtergrond' | 'afkorting' | 'symbool' | 'dialect' CLARIN-NL ISOcat workshop
Relation type taxonomy • rel:related • rel:sameAs • rel:almostSameAs • rel:broaderThan • rel:superClassOf • rel:hasPart • rel:hasDirectPart • rel:narrowerThan • rel:subClassOf • rel:partOf • rel:directPartOf • Extensible taxonomy inspired by OWL and SKOS. Any other predicates should be added at the appropriate place in the hierarchy: • rel:sameAs • owl:sameAs • skos:exactMatch • rel:almostSameAs • skos:closeMatch CLARIN-NL ISOcat workshop
Relation Registry - RELcat • The Relation Registry is basically a triple store: • Subject: resource 1 • Predicate: relationship • Object: resource 2 • # /first plural exclusive vernacular/ is-a /vernacular/ • isocat:DC-1234 rel:subClassOf isocat:DC-4 . • # /first person/ part-of /first plural exclusive vernacular/ • isocat:DC-1 rel:partOf isocat:DC-1234 . • # /plural/ part-of /first plural exclusive vernacular/ • isocat:DC-2 rel:partOf isocat:DC-1234 . • # /exclusive/ part-of /first plural exclusive vernacular/ • isocat:DC-3 rel:partOf isocat:DC-1234 . • The Relation Registry is implemented as a quad store: • every user can have its own set(s) of relationships CLARIN-NL ISOcat workshop