1 / 8

Where is future QA?: One pilot and two main proposals

Cross-Language Evaluation Forum, CLEF 2006. Where is future QA?: One pilot and two main proposals. Diana Santos Luís Costa Linguateca, www.linguateca.pt and SINTEF ICT. Alicante, September 22th. QolA: collaboration among Web services. Proposal for a pilot in QA

Download Presentation

Where is future QA?: One pilot and two main proposals

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Cross-Language Evaluation Forum, CLEF 2006 Where is future QA?:One pilot and two main proposals Diana Santos Luís Costa Linguateca, www.linguateca.pt and SINTEF ICT Alicante, September 22th

  2. QolA: collaboration among Web services • Proposal for a pilot in QA • Pilot: a different setup that should not replace the main track, but whose (positive or negative) results hopefully influence future main tracks • Rationale (generic) • Semantic Web technology is a buzzword (funding, visibility, available technolog(ies)) • An evaluation contest should also promote collaboration

  3. QA rationale • most interesting questions have multiple answers • the path to get the answer is often more informative than the answer alone • most (naive?) people consult the Web (i.e, they don’t care whether it was system X or system Y or person A or B) • integration of different methodologies, different answer snippets, different answer rankings is the key to future QA technology (not monolithical systems with a proprietary database) • the size/form of the perfect answer depends on the questioner (and on the answer set)

  4. QA@CLEF rationale • We have 30-36-40 separate systems which provide answers together with snippets that justify them • The systems do not know which kind of question they are answering • don’t need to know, either, where did the answer came from (people, or other systems) • Some systems even provide their own confidence judgement • Wide variety of approaches (and languages) BUT • There does not seem to have been significant individual progress • There is very little crosslinguality • There is no cooperation

  5. Practical setup • Some systems would have to be “refactored” as Web services • eventually only invocable by CLEF registered participants • A basic infrastructure of communication among QA agents has to be deployed • communication protocol • service identification • Enough participants (both systems and groups, for more than one language) interested Is there any interest out there?

  6. Two additions for the main track: the Web • Add the Web to our collections! • If you can justify your answer with a Web document (cached), do it! • there is absolutely nothing that changes in the main track • assessors still have only to look at snippets • Who cares which Web you have indexed or which major search engine API you invoke behind the scenes?

  7. Two additions for the main track: reasoning • Ask non-trivial questions! • Devise relational questions, that is, that require the system (the answerer) to understand and use a particular relationship between 2 (or more) facts • Examples: • Whose politician’s wife runs for the White House? • Who was the first African Nobel prize winner? • How long was X’s murderer in prison? • Systems can always find more (factoid) answers when we thought there was only one in our collection... so a reanalysis of question types is always necessary.

  8. Summing up Main track • 30 difficult questions next year • with the Web (in whatever format) as an additional/alternative justification source The QolA pilot • a pilot exploring cooperation between different systems • comparing answers • validating answers • merging different information • in a way, the logical conclusion/extension of WiQA and AVE

More Related