1 / 9

Progress report on SRL

Progress report on SRL. Abdul- Lateef Yussif 11-03-2011. Agenda. CoNLL 2004 Shared Tasks Data Set Format of Data Set. Example of annotated sentence. IOB2 Format. The IOB2 format represents chunks which do not overlap nor embed. Words outside a chunk receive the tag O.

shina
Download Presentation

Progress report on SRL

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Progress report on SRL Abdul-LateefYussif 11-03-2011

  2. Agenda • CoNLL 2004 Shared Tasks • Data Set • Format of Data Set

  3. Example of annotated sentence

  4. IOB2 Format • The IOB2 format represents chunks which do not overlap nor embed. • Words outside a chunk receive the tag O. • For words inside a chunk of type $k, the first word receives the “B-$k” tag (Begin), and the remaining words receive the tag “I-$k” (Inside).

  5. Find potential Arguments • An argument can be any consecutive words • Restrict potential arguments • BEGIN(word) = word begins argument • END(word) = word ends argument • Argument • (wi…..wj) is a potential argument iff • BEGIN(wi) = 1 and END(wj) = 1

  6. Classifiers & Features • I intend to use support vector with the following features • Words • Predicate lemmas • POS • Token Position • Path • Headword • length

  7. Data and evaluation Metrics • CoNLL 2004 dataset • Part of the Propbank Corpus • Consists from the Wall Street Journal of the Penn Treebank • Training (Section 15-18) • Development (Section 20) • Testing data (Section 21)

  8. Hypothesis • Target is to replicate and improved on Best System performance

  9. Questions Thank you

More Related