360 likes | 534 Views
Alexander Gelbukh Moscow, Russia. Mexico. Computing Research Center (CIC), Mexico. Chung-Ang University, Korea Electronic Commerce and Internet Application Lab. Natural Language Processing. Alexander Gelbukh www.Gelbukh.com. What language is. Better communication with computers.
E N D
Alexander Gelbukh Moscow, Russia
Chung-Ang University, KoreaElectronic Commerce andInternet Application Lab
Natural Language Processing Alexander Gelbukh www.Gelbukh.com
Better communication with computers 0101011101010001101010111o101001011 VS. Persons are more productive when speaking their own language
Accessibility of computers for all vs. It’s easier to teach one computer how to speak than teach generations of people how to use computers
Better knowledge management vs. Computers are better than people at managing information
Applications • Information retrieval (Internet search. Google) • Question Answering (Internet) • Information extraction (Fill a DB from newspapers) • Automatic translation • OCR, speech recognition • Natural Language Interfaces (robots, computers) • Interaction of agents • Thinking computers? • Think = speak
General scheme of text processing • Linguistic processor uses linguistic knowledge • Applied system uses other types of knowledge(e.g., Artificial Intelligence)
Language levels • Morphological: words • Syntactic: sentences • Semantic: meaning • Pragmatic: intention • ...?
Example of text “Science is importantfor our country. The Government pays it much attention.”
Textual representation Text is a sequence of letters. S c i e n c e i s i m p o r t a n t f o r o u r c o u n t r y . T h e G o v e r n m e n t p a y s i t m u c h a t t e n t i o n .
Morfological analysis Morphologicalanalysis
Morphological representation A sequence of words.
Syntactic parsing Syntacticparsing
Syntactic representation A sequence of syntactic trees.
Semantic analysis Semanticanalysis
Semantic representation Complex structure of whole text
The meaning “La ciencia es importante para nuestro país. El Gobierno le pone mucha atención.” “Science is important for our country. The Government pays it much attention.” There are good conditions for development of science in our country.
Problems • Ambiguity of text • I see a cat with a telescope • Knowledge needed • Linguistic • About the world and life Good news • Learning from texts • Plenty of texts in Internet! • Good statistical methods
Current state Working...
Conclusiones • ¿Is it necessary? • ¿Is it simple? • ¿Is it possible? • ¿Has been done something? • ¿Has been done all? • ¿Where are people working on it?
Thank you! www.Gelbukh.com