230 likes | 739 Views
Natural Language Processing. Daniele Quercia Fall, 2000. Presentation Outline. Introduction to the Natural Language Processing (NLP) Applications of NLP Discussion. Introduction to Natural Language Processing (NLP). Representation of the input meaning. Natural Language (English).
E N D
Natural Language Processing Daniele Quercia Fall, 2000
Presentation Outline • Introduction to the Natural Language Processing (NLP) • Applications of NLP • Discussion
Representation of the input meaning Natural Language (English) NLP • What is Natural Language processing (NLP)? • “The Natural Language Processing (NLP) is the engineering of systems that process or analyse written or spoken natural language.” • NLP • System • (computational model)
Psychology • Neuroscience • Natural • Language • Processing • Computer Science • Linguistics Related disciplines
A possible file: • PIPPO.TXT • Hallo Pippo. • Ciao Pippo. • Auf widersehen Pippo. • Unix wc program • total number of • bytes, • words, • lines • in a text file. • What is a word? Knowledge of language • These language processing applications use knowledge of language
Knowledge of language - 2 • Types of knowledge used in NLP • Phonetics and Phonology • Morphology • Syntax • Semantics • Pragmatics • Discourse • NLP can be viewed as resolving ambiguity at one of these levels.i
Knowledge of language - 3 • Example: HAL computer system. • capable of interacting with humans • Stanley Kubrik’s film • 2001: A Space Odissey • Conversation between HAL and Dave
Knowledge of language - 4 • Example(cont’d): HAL computer system. • capable of interacting with humans • Stanley Kubrik’s film • 2001: A Space Odissey • Dave: Open the pod bay doors, right now. • HAL: I’m sorry Dave, I’m afraid I can’t do that.
Syntax • The study of the structural relationship between words • but syntax don’t tell us much about the meaning … • e.g. : • I saw the Grand Canyon flying to New York • Time flies like an arrow Knowledge of language - 5 • Phonetics and Phonology • The study of linguistic sounds • How word are pronounced in colloquial speech
Semantics • The study of meaning • e.g. : • I work for J.F. Zich • and he’s a good boss • which is a good company • Pragmatics • The study of how language is used to accomplish goals • e.g. • Can youn tell me what time is it? • Could I have the salt? Knowledge of language - 6
Morphology • The study of the meaningful components of words. • e.g. : • Kick, kicks, kicked, kicking • Arm, army Knowledge of language - 7 • Discourse • The study of linguistic linguistic units larger than a single utterance
Machine translation • Information retrieval Applications of NLP • Text-based applications • Dialogue based applications • Speech recognition
Applications of NLP • Machine translation • Automatic machine translation • translates texts from one language to another • Important constraint • The text needs to be restricted to a limited range of subjects • Windows 98 deve essere distribuito • senza Internet Explorer o insieme al navigatore del concorrente Netscape • Windows 98 must be delivered either without Internet Explorer or in connection with the sailor of the competitor Netscape
Applications of NLP • Machine translation • even the best system produces poor translation… • … but Machine-assisted translation uses a computer to help a translator.
Applications of NLP • Information retrievial • concerns the retrieving of relevant information from databases.
Applications of NLP • Information retrievial (cont’d) • Inverted index • Retrieval depends only on how often each word appears in a document • Should we treat all words equally?
Applications of NLP • Information retrievial (cont’d) • Stop list: words to ignore • Many frequent words in English are function words, which are useless.
Applications of NLP • Speech recognition • Speech generation is comparatively easy, but recognition is hard ! • The acoustic signal is highly ambigous • Disambiguation using statistics • If …they buy… occurs more frequently than …they by …, it will chosen
Conclusions • Computers can process natural language in a variety of interesting ways • A computer can’t do anything close to “understanding” language … • NLP is almost never error-free