10 likes | 102 Views
candidate text. phrase. candidate text. phrase. most coherent text. sentence. candidate text. paragraph. …. sentence. candidate text. ADGEN: Advanced Generation for Question Answering Univ. of Southern California (Information Sciences Institute). OBJECTIVES
E N D
candidate text phrase candidate text phrase most coherent text sentence candidate text paragraph … sentence candidate text ADGEN: Advanced Generation for Question AnsweringUniv. of Southern California (Information Sciences Institute) • OBJECTIVES • Determine, empirically, what makes a text coherent • Focus on ordering, redundancy, contradictory • information understanding, and answering • Use sentence-based bag generation • algorithm to fuse phrases into grammatical • sentences and text-based bag-generation • algorithm to order sentences as coherent text; • system backend for answer generation • Apply probabilistic rewriting operations on • existing text to produce alternate version with fewer • grammatical errors and incoherencies • PLAN • Data preparation and processing (texts totaling 1 billion English words) • Problem preparation, initial feature study, baseline system, error analysis, prepare and evaluate normalized problem for Sentence Ordering, Redundancy, Contradiction • Package and test; work with systems focused on answer retrieval and apply coherence-judgments to retrieved text Principal Investigator: Daniel Marcu, Kevin Knight Topic Area: Component Data Dimension: English News Text