170 likes | 721 Views
YAGO – A Core of Semantic Knowledge Fabian M. Suchanek, Gjergji Kasneci, Gerhard Weikum. Hari K Devulapally Motivation But: Maybe there are other Rock N'Roll geniuses in his generation? Usual solution Which other singers were born when Elvis was born? I'm Feeling Lucky Google Search
E N D
YAGO – A Core of Semantic KnowledgeFabian M. Suchanek, Gjergji Kasneci, Gerhard Weikum. Hari K Devulapally
Motivation But: Maybe there are other Rock N'Roll geniuses in his generation?
Usual solution Which other singers were born when Elvis was born? I'm Feeling Lucky Google Search Elvis Presley - Wikipedia, the free encyclopedia Elvis Presley was born on January 8, 1935 at around 4:35 a.m. in a two-room ... Othersingers had been doing this for generations, but they were black. ... en.wikipedia.org/wiki/Elvis_PresleyCachedSimilar pages
Usual solution Which other singers were born in Elvis' year of birth? I'm Feeling Lucky Google Search Elvis Presley - Wikipedia, the free encyclopedia Elvis Presley was born on January 8, 1935 at around 4:35 a.m. in a two-room ... Othersingers had been doing this for generations, but they were black. ... en.wikipedia.org/wiki/Elvis_PresleyCachedSimilar pages Reason: Google does not search knowledge, but Web pages
YAGO • YAGO is a ontology that combines high coverage with high quality. • Information is obtained from Wikipedia and WordNet. • YAGO is based on a data model of entities and binary relations.
YAGO Model • A slight variation to RDFS. • The YAGO model also expresses relations between facts and relations.
Examples of Entities - Relations • AlbertEinstein hasWonPrize NobelPrize • “Einstein” means AlbertEinstein • AlbertEinstein type physicist • physicist subClassOf scientist
Semantics • Fact • Fact Identifier • Common Entities • YAGO ontology is defined as a function over a finite set of common entities C, a finite set of relation names R and a finite set of fact identifiers I
Exploiting the Wikipedia category system Elvis Pr born 1935 blah blah blub Elvis (don't read this! Better listen to the talk!) laber fasel suelz. Insbesondere, blub, texte zu, und so weiter blah blah blub Elvis laber fasel suelz. Blub, aber blah! Insbesondere, blub, texte zu, und so weiter blah blah blub Elvis laber fasel suelz. Insbesondere, blub, texte zu, und so weiter Exploit relational categories Categories: 1935_births
Exploiting the Wikipedia category system American_singer Elvis Pr is a born 1935 blah blah blub Elvis (don't read this! Better listen to the talk!) laber fasel suelz. Insbesondere, blub, texte zu, und so weiter blah blah blub Elvis laber fasel suelz. Blub, aber blah! Insbesondere, blub, texte zu, und so weiter blah blah blub Elvis laber fasel suelz. Insbesondere, blub, texte zu, und so weiter Exploit relational categories Exploit conceptual categories Categories: American_singers
Exploiting the Wikipedia category system Disputed_article American_singer Elvis Pr is a is a born 1935 blah blah blub Elvis (don't read this! Better listen to the talk!) laber fasel suelz. Insbesondere, blub, texte zu, und so weiter blah blah blub Elvis laber fasel suelz. Blub, aber blah! Insbesondere, blub, texte zu, und so weiter blah blah blub Elvis laber fasel suelz. Insbesondere, blub, texte zu, und so weiter Exploit relational categories Exploit conceptual categories Categories: Avoid administrational categories Disputed_articles
Exploiting the Wikipedia category system Rock'n_Roll_Music American_singer Elvis Pr is a is a born 1935 blah blah blub Elvis (don't read this! Better listen to the talk!) laber fasel suelz. Insbesondere, blub, texte zu, und so weiter blah blah blub Elvis laber fasel suelz. Blub, aber blah! Insbesondere, blub, texte zu, und so weiter blah blah blub Elvis laber fasel suelz. Insbesondere, blub, texte zu, und so weiter Exploit relational categories Exploit conceptual categories Categories: Avoid administrational categories Rock'n_Roll_Music Avoid thematic categories
Thematic vs Conceptual Categories conceptual: American singers of German origin thematic: Rock N'Roll music in America Shallow linguistic noun phrase parsing: Premodifier Head Postmodifier Heuristics: If the head is a plural word, the category is conceptual
Relations subClassOf: • The hierarchy given in Wikipedia does not help. • Word Net is used to establish the hierarchy. Means Relation • Uses word meaning to establish Means Relation. • A class for each Synset and Each word in the Synset has a Means relation with the corresponding class • Using Wikipedia Redirects • Similarly, GIVENNAMEOF and FAMILYNAMEOF relations are established.
The YAGO ontology: Number of Facts 6,000,000 Ontologies should not be judged purely by the number of facts! This is just an informational overview. 2,000,000 30,000 60,000 200,000 300,000 Yago KnowItAll SUMO WordNet OpenCyc Cyc
Conclusion • YAGO, a large and extendable ontology of high quality. • YAGO contains 1 million entities and 5 million facts – more than any other publicly available formal ontology. • YAGO has a near-human accuracy around 95%.