260 likes | 388 Views
Linguateca www.linguateca.pt. A Geographic Knowledge Base for Semantic Web Applications. Marcirio Silveira Chaves Mário J. Silva Bruno Martins. 20º Brazilian Symposium on Databases - SBBD 2005 Uberlândia - MG. Motivation/Context. GKB - Geographic Knowledge Base Geographic Network
E N D
Linguateca www.linguateca.pt A Geographic Knowledge Base for Semantic Web Applications Marcirio Silveira Chaves Mário J. Silva Bruno Martins 20º Brazilian Symposium on Databases - SBBD 2005 Uberlândia - MG
Motivation/Context • GKB - Geographic Knowledge Base • Geographic • Network • Information exported as ontologies • Geographic-aware Semantic Web applications • GREASE – Geographic Reasoning for Search Engines 20º Brazilian Symposium on Databases
Presentation Structure • Conceptual Design of GKB • Knowledge Integration • Using Geographic Knowledge in GKB • GKB as an Ontology • Statistics of the Ontologies Created • Applications using GKB • Final Remarks 20º Brazilian Symposium on Databases
Geo-Administrative and Geo-Physical Domain Administrative Postal Gazetteers Wikipedia Network Domain FCCN Web domains Web sites Information Sources used by GKB 20º Brazilian Symposium on Databases
Architecture of GKB 20º Brazilian Symposium on Databases
Feature concept in GKB • A meaningful object in the selected domain of discourse [ISO19109]. Ex.: • countries, cities and localities 20º Brazilian Symposium on Databases
Conceptual Design of GKB • GKB meta-model 20º Brazilian Symposium on Databases
Presentation Structure • Conceptual Design of GKB • Knowledge Integration • Using Geographic Knowledge in GKB • GKB as an Ontology • Statistics of the Ontologies Created • Applications using GKB • Final Remarks 20º Brazilian Symposium on Databases
Knowledge Integration in GKB • GKB hierarchy from different information sources • Algorithm: • It searches the lowest common features types in both hierarchies • If it holds, it identifies the common instances between the hierarchies • Once the common instances are identified, it goes up the hierarchy and searches for the lowest common ancestor • It verifies the distance (in number of relationships partOf) between the common instances of the features types and its ancestors. The ancestor, which has the small distance up to the common instances is merged through a relationship partOf with the ancestor in the another hierarchy. The existing relationships in both hierarchies are maintained. 20º Brazilian Symposium on Databases
H2 H1 Porto Norte NUT2 DISTRITO Grande Porto NUT3 Tâmega MUNICIPALITY MUNICIPALITY Vila Nova de Gaia Vila Nova de Gaia Penafiel Penafiel Matosinhos Matosinhos Knowledge Integration in GKB • GKB hierarchy from different information sources 20º Brazilian Symposium on Databases
H2 H1 Porto Norte NUT2 DISTRITO Grande Porto NUT3 Tâmega MUNICIPALITY MUNICIPALITY Vila Nova de Gaia Vila Nova de Gaia Penafiel Penafiel Matosinhos Matosinhos Knowledge Integration in GKB • GKB hierarchy from different information sources 20º Brazilian Symposium on Databases
H2 H1 Porto Norte NUT2 DISTRITO Grande Porto NUT3 Tâmega MUNICIPALITY MUNICIPALITY Vila Nova de Gaia Vila Nova de Gaia Penafiel Penafiel Matosinhos Matosinhos Knowledge Integration in GKB • GKB hierarchy from different information sources 20º Brazilian Symposium on Databases
Merged Hierarchy Norte Grande Porto Vila Nova de Gaia Porto Tâmega Penafiel Matosinhos Knowledge Integration in GKB 20º Brazilian Symposium on Databases
Presentation Structure • Conceptual Design of GKB • Knowledge Integration • Using Geographic Knowledge in GKB • GKB as an Ontology • Statistics of the Ontologies Created • Applications using GKB • Final Remarks 20º Brazilian Symposium on Databases
Using Geographic Knowledge in GKB • Geographic scopes • www.cm-lisboa.pt • Lisboa (municipality) • Rules • New relationships and knowledge • Description Logics (DLs) • Geo domain • Names composed of multiple words are represented in different ways • Network domain • Names of URLs are decomposed by the correspondent domain division 20º Brazilian Symposium on Databases
Using Geographic Knowledge in GKB • ABox in DLs for the: • municipality of Santiago do Cacém geoFeatureName(270,“santiagodocacem”) geoFeatureName(270,“santiagocacem”). geoFeatureName(270,“santiago-do-cacem”). geoFeatureName(270,“santiago-cacem”). geoFeatureType(270,“CON”). • web site: www.cm-santiago-do-cacem.pt netSiteSubDomain(33684,“www”). netSitePrefix(33684,“cm”). netSiteDomainToken(33684,“santiago-do-cacem”). netSiteTLD(33684,“pt”). 20º Brazilian Symposium on Databases
Using Geographic Knowledge in GKB • Terminology Description (TBox in DLs) • Municipalities hasScope(idN,idG) netSiteDomainToken(idN,X) ((netSitePrefix(idN,“cm”) netSitePrefix(idN,“mun”)) geoFeatureType(idG,“CON”) geoFeatureName(idG,X). 20º Brazilian Symposium on Databases
Using Geographic Knowledge in GKB • Ex.: hasScope(idN,idG) netSiteDomainToken(idN,X) (netSitePrefix(idN,“cm”) netSitePrefix(idN,“mun”)) geoFeatureType(idG,“CON”) geoFeatureName(idG,X). netSiteDomainToken(33684, “santiago-do-cacem”). netSitePrefix(33684, “cm”). geoFeatureType(270, “CON”). geoFeatureName(270, “santiago-do-cacem”). New knowledge: hasScope(33684, 270). 20º Brazilian Symposium on Databases
Using Geographic Knowledge in GKB • Rule-based assigned scopes by GKB to sites of Portugal • Scopes extended to the web pages under each one of the sites of matching subdomains 20º Brazilian Symposium on Databases
Presentation Structure • Conceptual Design of GKB • Knowledge Integration • Using Geographic Knowledge in GKB • GKB as an Ontology • Statistics of the Ontologies Created • Applications using GKB • Final Remarks 20º Brazilian Symposium on Databases
GKB as an Ontology • Geo-Net-PT01 <gn:Geo_Feature rdf:ID="GEO_238"> <gn:geo_id>238</gn:geo_id> <gn:geo_name xml:lang="pt">Porto</gn:geo_name> <gn:geo_type_id rdf:resource="#CON"/> <gn:info_source_id rdf:resource="#INE"/> <gn:related_to> <rdf:Bag> <rdf:li> <gn:Geo_Relationship> <gn:rel_type_id rdf:resource="#PRT"/> <gn:geo_id><rdf:Bag> <rdf:li rdf:resource="#GEO_130"/> <rdf:li rdf:resource="#GEO_3967"/> </rdf:Bag></gn:geo_id> </gn:Geo_Relationship> </rdf:li> <rdf:li><gn:Geo_Relationship> <gn:rel_type_id rdf:resource="#ADJ"/> <gn:geo_id> <rdf:Bag> <rdf:li rdf:resource="#GEO_127"/> <rdf:li rdf:resource="#GEO_156"/> <rdf:li rdf:resource="#GEO_162"/> <rdf:li rdf:resource="#GEO_331"/> </rdf:Bag> </gn:geo_id> </gn:Geo_Relationship></rdf:li> </rdf:Bag> </gn:related_to> <gn:population>263131</gn:population> </gn:Geo_Feature> 20º Brazilian Symposium on Databases
Statistics of the Ontologies Created 20º Brazilian Symposium on Databases
Presentation Structure • Conceptual Design of GKB • Knowledge Integration • Using Geographic Knowledge in GKB • GKB as an Ontology • Statistics of the Ontologies Created • Applications using GKB • Final Remarks 20º Brazilian Symposium on Databases
Applications using GKB • NERC tool for recognizing geographicalreferences in text • Classification tool for assigning documents to a corresponding geographical scope • Information retrieval interface for geographical queries 20º Brazilian Symposium on Databases
Applications using GKB 20º Brazilian Symposium on Databases
Final Remarks • A domain-independent model for storing geographic and network knowledge • Sharing of the collected knowledge as formal ontologies • Geo-Net-PT01: The first public geographic ontology of Portugal - http://xldb.fc.ul.pt/geonetpt • Future work • Augmenting the knowledge in GKB with geographic entities extracted from the texts of the Portuguese Web 20º Brazilian Symposium on Databases