1 / 8

LING 581: Advanced Computational Linguistics

LING 581: Advanced Computational Linguistics. Lecture Notes March 9th. Scheduling. Spring break next week Due date: Wednesday 23 rd March. Homework Task. Treebank There are 180487 VPs in the Wall Street Journal section Q: what kinds of verb frames are attested? EVCA Project

leroy
Download Presentation

LING 581: Advanced Computational Linguistics

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. LING 581: Advanced Computational Linguistics Lecture Notes March 9th

  2. Scheduling • Spring break next week • Due date: Wednesday 23rd March

  3. Homework Task Treebank • There are 180487 VPs in the Wall Street Journal section • Q: what kinds of verb frames are attested? EVCA Project • Pick verbs that exist in EVCA (evca93.index) and also in the PTB • Produce a report that compares EVCA with what is present in the corpus

  4. Additional Annotation for the Treebank An annotated corpus for the analysis of VP ellipsis • http://www.let.rug.nl/bos/vpe/ • Verb Phrase Ellipsis (VPE) has been studied in great depth in theoretical linguistics, but empirical studies of VPE are rare. We extend the few previous corpus studies with an annotated corpus of VPE in all 25 sections of the Wall Street Journal corpus (WSJ) distributed with the Penn Treebank. […] • Our annotation is theory neutral, and has better coverage than earlier efforts that relied on automatic methods, e.g. simply searching the parsed version of the Penn Treebank for empty VP's achieves a high precision (0.95) but low recall (0.58) when compared with our manual annotation.

  5. New Topic From stochastic parsing to the (near) latest in syntax • Paper: • Derivation by Phase (DbP) (Chomsky) • (manuscript 1999, published 2001) • 4 files on usb drive • Reading Homework over Spring Break • dpb.pdf(the unlocked published version) you may find the following notes very useful • JU_DbP_1.pdf (DbyP with inline notes from Juan Uriagereka) • JU_DbP_2.pdf (part 2) • Yoon_DbP.pdf (notes from James Yoon, UIUC)

  6. Software (next time) Goal: linguistic theory an action

  7. Software (next time) Graphical Version SWI-Prolog TCL/TK Definite Clause Grammar (DCG) implementation Of DbyP

  8. Introduction • Derivation by Phase (DbP) (Chomsky) • (manuscript 1999, published 2001)

More Related