1 / 28

YAGO

YAGO. Reporter: Qi Liu. What is YAGO?. A semantic web A knowledge base A combination of WordNet and wikipedia. Semantic web. Advocated by W3C( World Wide Web Consortium ) Aimed at reconstructing the WWW A standard framework: RDF( Resource Description Framework ). What is YAGO?.

reidar
Download Presentation

YAGO

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. YAGO Reporter: Qi Liu

  2. What is YAGO? • A semantic web • A knowledge base • A combination of WordNet and wikipedia

  3. Semantic web • Advocated by W3C(World Wide Web Consortium) • Aimed at reconstructing the WWW • A standard framework: RDF(Resource Description Framework)

  4. What is YAGO? • A semantic web • A knowledge base • A combination of WordNet and wikipedia

  5. Knowledge base • To be: • A special database for knowledge management • To do: • Provides a means for collecting, organising, searching and utilising information • Three types: • Machine-readable knowledge bases(DBpedia) • Human-readable konwledge bases(Wikipedia) • Knowledge base analysis and design

  6. What is YAGO? • A semantic web • A knowledge base • A combination of WordNet and wikipedia

  7. WordNet • To be: • A lexical database for English since 1985 • To do: • Groups words into synsets • Provides short, general definitions • Records the semantic relations between these synsets • 25 basic noun groups & 15 verb groups

  8. Key Concepts • Ontology vs Taxonomy • Lexicon:the bridge between a language and the knowledge expressed in that language • Syntactic (there vs their) • Semantic (sight vs site) • Pragmatic (infer vs imply)

  9. Semantics of YAGO • Five relations: • Domain • Range • subRelationof • Type • subClassOf • Entities: • Domain • Relation • Range • Literal • ......

  10. Axiomatic rules

  11. Reasoning rules • correctness and completeness

  12. The YAGO system • Knowledge extraction • YAGO storage • Enriching YAGO

  13. Knowledge extraction • TYPE relation • SUBCLASSOF relation • MEANS relation • Other relations • Meta-relations

  14. TYPE relation extraction • The Wikipedia Category System • Types: conceptual, administrative, relational, thematic • Identifying Conceptual Categories • Conceptual  TYPE • Adm and relational ones: excluded by hand • Employ a shallow linguistic parsing(Noun Group Parser) of the left two categories • E.g. Naturalized citizens of United States • domain and range extracted at the same time

  15. SUBCLASSOF relation extraction • Wikipedia categories • DAG(directed acyclic graph) • Reflect merely the thematic structure • Use only the leaf categories of Wikipedia • Integrating WordNet Synsets • Match or prefer WordNet • Establishing subClassOf • American people in Japan • Exceptions • Correct manually

  16. Means relation extraction • Exploiting WordNet Synsets • A synset{urban center,metropolis, city} • Attach a class for the synset ‘city’ • Exploiting Wikipedia Redirects • Search “Einstein, Albert”, redirected to “Albert, Einstein” • Parsing Person Names • givenNameOf subRelationOf means • familyNameOf subRelationOf means

  17. Other relations extraction • BornInYear & DiedInYear • EstablisedIn & LocatedIn • WrittenInYear • PolitionOf • HasWonPrize • Filtering the Results

  18. Meta-relationsextraction • Descriptions • Individual DESCRIBES URL • Witness • Fact FoundIn URL(of its witness page) • ExtractedBy • Context • Linkages btw A&B: A Context B

  19. Knowledge extraction • TYPE relation • SUBCLASSOF relation • MEANS relation • Other relations • Meta-relations

  20. The YAGO system • Knowledge extraction • YAGO storage • Enriching YAGO

  21. YAGO storage • Model independent of storage • Storage: • Text files, XML, database tables, RDF

  22. Enriching YAGO • Add the fact(x,r,y) • Map x,y to existing entities(word sense disambiguation) • If mapping failed, add new entity. • Map r to YAGO ontology • If mapping successed, add a FoundIn relation • If mapping failed, add a new fact!

  23. Summary on YAGO1 • 1M entities & 5M facts • Accuracy around 95%

  24. YAGO2: In Time, Space and Many Languages • YAGO: about 100 manually defined relations • Build YAGO2 architecture based on such rules: • Factual rules • E.g. Exceptions,definition of all relations, domains, ranges and classes • Implication rules • Inferring rules from the facts in the database • Replacement rules • Normalize numbers, tags and other formats • Extraction rules • Extracting facts from a given source text

  25. Temporal Dimension • People wasBornOnDate & diedOnDate • Groups wasCreatedOnDate&wasDestroyedOnDate • Artifacts(buildings, songs,cities) [same as above] • Events startedOnDate & endedOnDate =>startExistingOnDate&endExistingOnDate • Facts • Entities in a fact =>subjectStartRelation&objectStartRelation

  26. GEO-SPATIAL Dimension • All physical objects have a location in space! • Define it with geographical coordinates, i.e. Latitude and longtitude =>yagoGeoCoordinates, =>hasGeoCoordinates • Two sources: • Wikipedia • GeoNames • locatedIn & hasGeoCoordinates & <location,TYPE,class>

  27. Textual Dimension • hasWikipediaAnchorText • hasWikipediaCategory • hasCitationTitle • subClassOf hasContext Integrating UWN to including 200 languages

More Related