130 likes | 221 Views
Searching for knowledge with UIMA. IBM Research J. William Murdock Christopher Welty David Ferrucci. Last Update: May 14, 2006. Within-document analysis. Relation: OwnerOf. Entity: Organization. Entity: Person. Relation: OwnerOf. Entity: Organization. Entity: Person.
E N D
Searching for knowledge with UIMA IBM Research J. William Murdock Christopher Welty David Ferrucci Last Update: May 14, 2006
Within-document analysis Relation: OwnerOf Entity:Organization Entity:Person Relation: OwnerOf Entity:Organization Entity:Person OwnerOf (Relation Annotation) OwnerOf (Relation Annotation) Person (Entity Annotation) Organization (Entity Annotation) Person (Entity Annotation) Person (Entity Annotation) Organization (Entity Annotation) Joseph Gradgrind, who is the owner of Gradgrind Foods, ... Joe Gradgrind, owner of GF, ... doc1.txt doc2.txt
Cross-document coreference Relation: OwnerOf Entity:Organization Entity:Person Relation: OwnerOf Entity:Organization Entity:Person Relation: OwnerOf Entity:Organization Entity:Person OwnerOf (Relation Annotation) OwnerOf (Relation Annotation) Person (Entity Annotation) Organization (Entity Annotation) Person (Entity Annotation) Person (Entity Annotation) Organization (Entity Annotation) Joseph Gradgrind, who is the owner of Gradgrind Foods, ... Joe Gradgrind, owner of GF, ... doc1.txt doc2.txt
EKDB: Extracted Knowledge Database(same information, in relational tables) Relation Arguments Referents * Not shown: component ID’s,confidences, etc. Annotations Spans Documents Names doc1.txt doc2.txt
Entity Search Subject of interest: User query: All persons named “Joe Gradgrind” All entities named “Joe Gradgrind” All persons
Entity Search in EKDB Interface EKDB User Query doc1.txt doc2.txt
Browsing entities found by Entity Search User query Entities matchingthe query Names ofthe entities doc1.txt doc2.txt doc88.txt Documents in whichthe entities occur ... Joseph Gradgrind, who is the owner of Gradgrind Foods, ... Spans inthe documents Facts (relations)involving the entities ... Browsing facts
Fact Search Subject of interest: User query: Some person named “Joe Gradgrind” owns some organization named “Gradgrind Foods” OwnerOf OwnerOf Some entity named “Joe Gradgrind” owns some organization OwnerOf Some person owns something Some relationship from some entity named “Joe Gradgrind” to some entity named “Gradgrind Foods” ...
Fact Search in EKDB Interface EKDB User Query OwnerOf doc1.txt doc2.txt
Browsing facts (relations) found by Fact Search User query ManagerOf Facts matchingthe query OwnerOf doc1.txt doc2.txt Documents in whichthe facts occur Joseph Gradgrind, who is the owner of Gradgrind Foods, Spans inthe documents Entities involvedin the facts ... ... Browsing entities
Fact chain search User query: Subject of interest: ??? Some (complex?) relationship between a person named “Joe Gradgrind” and a city named “Manchester” CitizenOf SubPlace OwnerOf BasedIn Near
Fact pattern search Subject of interest: A person that that resides in Leeds and owns an organization in Stockport User query: ResidesIn OwnerOf BasedIn
Status • Entity Search & Fact Search implemented in SAW 1 • But limited interaction between the two • Thus misses some of the recursive nature of browsing entities and facts (entities participate in facts, that contain entities, etc.) • Prototype of Fact Chain Search implemented in a SAW 1 variant • No metrics for “interestingness” of chains yet • Fact Search implemented in SAW 2 • More capabilities on the way • Fact Pattern Search: Future work