370 likes | 387 Views
Explore the motivation, content, and conclusion of YAGO, a semantic knowledge core built from Wikipedia data. Learn how YAGO extracts ontology from text corpora for better coverage and accuracy.
E N D
YAGO – A Core of Semantic Knowledge Fabian M. Suchanek, Gjergji Kasneci, Gerhard Weikum (Max-Planck Institute for Computer Science Saarbrücken/Germany) YAGO - A Core of Semantic Knowledge
Overview ر Motivation ر The Yago ontology ر Content ر Model ر Conclusion YAGO - A Core of Semantic Knowledge
Motivation But: Maybe there are other Rock N'Roll geniuses in his generation? YAGO - A Core of Semantic Knowledge
Usual solution Which other singers were born when Elvis was born? I'm Feeling Lucky Google Search Elvis Presley - Wikipedia, the free encyclopedia Elvis Presley was born on January 8, 1935 at around 4:35 a.m. in a two-room ... Othersingers had been doing this for generations, but they were black. ... en.wikipedia.org/wiki/Elvis_PresleyCachedSimilar pages YAGO - A Core of Semantic Knowledge
Usual solution Which other singers were born in Elvis' year of birth? I'm Feeling Lucky Google Search Elvis Presley - Wikipedia, the free encyclopedia Elvis Presley was born on January 8, 1935 at around 4:35 a.m. in a two-room ... Othersingers had been doing this for generations, but they were black. ... en.wikipedia.org/wiki/Elvis_PresleyCachedSimilar pages YAGO - A Core of Semantic Knowledge
Usual solution Will you please give me IMMEDIATELY all singers that wer I'm getting angry Google Search Elvis Presley - Wikipedia, the free encyclopedia Elvis Presley was born on January 8, 1935 at around 4:35 a.m. in a two-room ... Othersingers had been doing this for generations, but they were black. ... en.wikipedia.org/wiki/Elvis_PresleyCachedSimilar pages Reason: Google does not search knowledge, but Web pages YAGO - A Core of Semantic Knowledge
Solution: An ontology singer is a born born ? 1935 YAGO - A Core of Semantic Knowledge
Solution: An ontology person subclass Classes singer Relations is a is a born born ? Individuals 1935 means means Words "Elvis Presley" "The King" YAGO - A Core of Semantic Knowledge
Where do we get the ontology from? Previous approaches: رAssemble the ontology manually (WordNet, SUMO, GeneOntology) Problem: Usually low coverage ر Extract the ontology from corpora (e.g. the Web) (KnowItAll, Espresso, Snowball, LEILA) Problem: Usually low accuracy (50%-92%) YAGO - A Core of Semantic Knowledge
Where do we get the ontology from? YAGO approach: Assemble the ontology from Wikipedia (=> good coverage) Use the category system of Wikipedia (=> good accuracy) YAGO - A Core of Semantic Knowledge
Exploiting the Wikipedia category system Elvis Pr born 1935 blah blah blub Elvis (don't read this! Better listen to the talk!) laber fasel suelz. Insbesondere, blub, texte zu, und so weiter blah blah blub Elvis laber fasel suelz. Blub, aber blah! Insbesondere, blub, texte zu, und so weiter blah blah blub Elvis laber fasel suelz. Insbesondere, blub, texte zu, und so weiter Exploit relational categories Categories: 1935_births YAGO - A Core of Semantic Knowledge
Exploiting the Wikipedia category system American_singer Elvis Pr is a born 1935 blah blah blub Elvis (don't read this! Better listen to the talk!) laber fasel suelz. Insbesondere, blub, texte zu, und so weiter blah blah blub Elvis laber fasel suelz. Blub, aber blah! Insbesondere, blub, texte zu, und so weiter blah blah blub Elvis laber fasel suelz. Insbesondere, blub, texte zu, und so weiter Exploit relational categories Exploit conceptual categories Categories: American_singers YAGO - A Core of Semantic Knowledge
Exploiting the Wikipedia category system Disputed_article American_singer Elvis Pr is a is a born 1935 blah blah blub Elvis (don't read this! Better listen to the talk!) laber fasel suelz. Insbesondere, blub, texte zu, und so weiter blah blah blub Elvis laber fasel suelz. Blub, aber blah! Insbesondere, blub, texte zu, und so weiter blah blah blub Elvis laber fasel suelz. Insbesondere, blub, texte zu, und so weiter Exploit relational categories Exploit conceptual categories Categories: Avoid administrational categories Disputed_articles YAGO - A Core of Semantic Knowledge
Exploiting the Wikipedia category system Rock'n_Roll_Music American_singer Elvis Pr is a is a born 1935 blah blah blub Elvis (don't read this! Better listen to the talk!) laber fasel suelz. Insbesondere, blub, texte zu, und so weiter blah blah blub Elvis laber fasel suelz. Blub, aber blah! Insbesondere, blub, texte zu, und so weiter blah blah blub Elvis laber fasel suelz. Insbesondere, blub, texte zu, und so weiter Exploit relational categories Exploit conceptual categories Categories: Avoid administrational categories Rock'n_Roll_Music Avoid thematic categories YAGO - A Core of Semantic Knowledge
Thematic vs Conceptual Categories conceptual: American singers of German origin thematic: Rock N'Roll music in America Shallow linguistic noun phrase parsing: Premodifier Head Postmodifier Heuristics: If the head is a plural word, the category is conceptual YAGO - A Core of Semantic Knowledge
The Upper Model entity ? person American_singer is a born 1935 YAGO - A Core of Semantic Knowledge
The Upper Model: From Wikipedia? Business Social_group ? People_by_occupation American_singer is a born 1935 YAGO - A Core of Semantic Knowledge
The Upper Model: From WordNet? Person#3 Singer#17 Singer#1 ... ? American_singer is a born 1935 YAGO - A Core of Semantic Knowledge
The Upper Model: From WordNet? Person#3 Singer#17 Singer#1 ... ! American_singer is a born 1935 YAGO - A Core of Semantic Knowledge
The YAGO ontology Person#3 subclass Singer#1 means subclass "singer" American_singer is a born 1935 "Elvis Presley" means YAGO - A Core of Semantic Knowledge
The YAGO ontology: Accuracy YAGO - A Core of Semantic Knowledge
The YAGO ontology: Number of Facts 6,000,000 Ontologies should not be judged purely by the number of facts! This is just an informational overview. 2,000,000 30,000 60,000 200,000 300,000 Yago KnowItAll SUMO WordNet OpenCyc Cyc YAGO - A Core of Semantic Knowledge
The Yago Model: Why binary is not enough singer (Elvis, is_a, singer) (But only from 1953 to 1977) is a (We know this from Wikipedia) YAGO - A Core of Semantic Knowledge
The Yago Model: Why binary is not enough singer #1 (Elvis, is_a, singer) #2 (#1, time, 1953-1977) #3 (#1, source, Wikipedia) time 1953-1977 is a source Wikipedia YAGO - A Core of Semantic Knowledge
The Yago model formally • A YAGO ontology over • a set of relations R • a set of common entities C • a set of fact identifiers I • is a function • I (RCI) R (RIC) #1 (Elvis, is_a, singer) #2 (#1, time, 1953-1977) #3 (#1, source, Wikipedia) • We can talk about • facts (#1, source, Wikipedia) • additional arguments (#1, time, 1953-1977) • relations (time, hasRange, time_interval) YAGO - A Core of Semantic Knowledge
The Yago model: Logical aspects Axioms: (x, is_a, y) (y, subclass, z) => (x, is_a, z) ... person subclass singer is a is a YAGO - A Core of Semantic Knowledge
The Yago model: Logical aspects finite, unique f1, f2, f3, f4, f5, f6, f7, f8, f9, f10 Axioms: (x, is_a, y) (y, subclass, z) => (x, is_a, z) ... derive facts f1, f2, f3, f4, f5 Eliminate facts f1, f2, f3 finite, unique YAGO - A Core of Semantic Knowledge
Other singers of Elvis' generation Which other singer was born in the same year as Elvis? http://www.mpi-inf.mpg.de/yago Enter your Yago Query: "Elvis Presley" bornInYear $year $other bornInYear $year $other isa singer YAGO - A Core of Semantic Knowledge
YAGO – A Core of Semantic Knowledge - Opera YAGO - A Core of Semantic Knowledge
I am a screenshot Subgraph of YAGO Singer born in the same year as Elvis: Utah Phillips YAGO - A Core of Semantic Knowledge
Applications What can you do with YAGO? http://www.mpi-inf.mpg.de/yago • Query it (See our WWW 2007 poster "How NAGA Uncoils") • Download it (use it as "WordNet + Individuals", e.g. for disambiguation) • Find other Elvises YAGO - A Core of Semantic Knowledge