130 likes | 224 Views
The development of PDT 3.0 Introduction to the discussion. Malá Skála, September 2012. The goals as defined in the project proposal (March 2009). The goals as defined in the project proposal (March 2009).
E N D
The developmentof PDT 3.0Introductionto the discussion Malá Skála, September 2012
The goals as defined in the project proposal (March 2009) (A) To broaden and deepen the information on the tectogrammatical level, annotation of phenomena intra-sentential and inter-sentential and their reflecting of the cognitive content (B) To enrich the methodology of the annotation (an account of disputable and marginal phenomena)
The goals as defined in the project proposal (March 2009), cont. (C) Elaboration of instructions for the annotation of the new introduced phenomena (D) To increase the amount of linguistically annotated data
Results achieved (August 2012) (1) A new schema of verbal grammatemes (grammatemes diatgram, factmod, sentmod) (2) Noun grammateme pair/group meaning (implemented in PDT 2.5, PDTSC – in svn) (3) Continuation of TGTS annotation of PDTSC (in progress)
Results achieved (August 2012)cont. (4) Coreferential arrows for 1st and 2nd person (in progress) (5) Verbal Pattern Sample of selected English Verbs (6) Theoretical conclusions and proposals contained in the book of M. Mikulová, book on some types of noun valency of V. Kolářová, new parts of the manuscript of Syntax (Grammar of contemporary Czech II) and in many papers
Plans for October – December 2012 and January – October 2013 (i) Final version of PDT 3.0 (How will PDT 3.0 be defined? As a set of Czech treebanks consisting from 3 parts – PDT 2.5 improved, PDTSC, PCEDT or “extended” PDT 2.0/2.5; actually, we do not plan an extension of data) – managed by J. Štěpánek
Plans for October – December 2012 and January – October 2013, cont. (ii) Preparation of manuals – describing new phenomena, and the approach to the spoken data annotation (including “reconstruction”) – J. Panevová, M. Ševčíková, M.Mikulová (Technical Report) (Proposal: An appendix to the report containing formulation of more sophisticated queries in Tred – J. Štěpánek, E. Bejček, J. Mírovský)
Plans for October – December 2012 and January – October 2013, cont. (iii) Dictionary of verbo-nominal expressions and revision of QCor - (including the deletion of the lemma #Benef, its revision and change into lemma #Gen with the functor BEN) – V. Kettnerová, V. Kolářová (iv) List of deverbal and deadjective nouns with non-regular valency (with specific shifts in the forms of their participants and possible reflections in sempos values); their treatment in PDT-VALLEX – V. Kolářová
Plans for October – December 2012 and January – October 2013, cont. (v) Dictionary of multiword expressions – E. Bejček (vi) Coreference additions: 1st and 2ndperson finished – A. Nedolužko (vii) Introducing t/f articulation, coreference and verbal grammatemes in PDTSC – M. Mikulová, M. Rysová (K. Rysová), J. Mírovský
Plans for October – December 2012 and January – October 2013, cont. (viii) Implementation of Context Extractor – a Treex module for verb collocates extraction – S. Cinková, L. Smejkalová, (M. Holub) (ix) Intensification of verbal actions (one type of “Aktionsart”) – searching criteria, tagging and annotation – J. Hlaváčová, A. Nedolužko (x) Final version of the book on Syntax – J. Panevová and others