110 likes | 257 Views
NACLO 2009 North American Computation Linguistics Olympiad University of North Texas site Rada Mihalcea Genene Murphy Michael Mohler. What is Computational Linguistics?.
E N D
NACLO 2009 North American Computation Linguistics Olympiad University of North Texas site Rada Mihalcea Genene Murphy Michael Mohler
What is Computational Linguistics? • Linguistics is the study of language itself – how languages are formed, how we learn and understand languages, how languages work, etc. • Syntax • Semantics • Phonology • Morphology • Pragmatics • Computational linguistics is the science of designing algorithms and programming computers to “understand” the rules of languages that we intuitively know.
Language Subtleties • Adjective order and placement • A big black dog • A big black scary dog • A big scary dog • A scary big dog • *A black big dog • Antonyms: which sizes go together? • Big and little • Big and small • Large and small • *Large and little
Headline Ambiguity • DRUNK GETS NINE YEARS IN VIOLIN CASE • FARMER BILL DIES IN HOUSE • PROSTITUTES APPEAL TO POPE • STOLEN PAINTING FOUND BY TREE • INCLUDE CHILDREN WHEN BAKING COOKIES • MINERS REFUSE TO WORK AFTER DEATH
Computational Linguistics in the World: • Internet Search Engines/Social Networks • Machine Translation • Speech Recognition • Artificial Intelligence • Cryptography • Education Technology
So…what is this Olympiad? • The Olympiad will test your logical and analytical thinking skills to solve problems similar to the one you just saw • You will be given a set of problems to solve individually within a three-hour period • Success at the two state-level competitions will give you the opportunity to participate in the International Linguistics Olympiad in Poland
A Sample Problem… Doing Math in Indonesian Definition: satu = 'one' • Satu ditambah satu menjadi dua. • Dua ditambah dua menjadi empat. • Satu ditambah dua menjadi tiga. • Dua dikalikan dua menjadi empat. • Enam dikurangi tiga menjadi tiga. • Sepuluh dikurangi enam menjadi empat. • Sepuluh dibagi dua menjadi lima. • Tiga dikalikan enam menjadi delapan belas. • Delapan belas dikurangi satu menjadi tujuh belas. • Tiga ditambah empat menjadi ___________. • Tiga dikalikan tiga menjadi sembilan.
How did you solve that? • (Now imagine trying to get a computer to do the same thing!)
Dates • February 4, 2009: • Qualifying Round @ UNT, Research Park, • Registration starts at 8:00am, competition starts at 8:45am • [Optional] January 29, 2009, 6pm, Research Park F223 • Practice session • March 11, 2009: • Invitational Round @ UNT, Research Park • Summer 2009: • International Linguistics Olympiad Training Camp • Summer 2009: • International Linguistics Olympiad in Poland
Why Should You Participate? • Get introduced to an ever-growing field • Try a new challenge, with nothing to lose • Looks great on college resumes • A chance to travel world-wide • No prior experience necessary – just a head for analytical thinking and a can-do attitude
Thanks! • For more information: • http://lit.csci.unt.edu/index.php/NACLO_2009 • email: naclo@cs.unt.edu • Registration: • http://www.naclo.cs.cmu.edu/ • Additional problems can be found at: • http://www.naclo.cs.cmu.edu/practice.php