90 likes | 170 Views
Integration on the Web. Craig Knoblock University of Southern California. XML, Semantic Web, Web Services. The Internet and intranets are providing unprecedented access to information
E N D
Integration on the Web Craig Knoblock University of Southern California
XML, Semantic Web, Web Services • The Internet and intranets are providing unprecedented access to information • The move towards XML and Web Services will simplify the problems of accessing this information programmatically • The recent push towards the Semantic Web will provide a semantic grounding for this information • But the key challenge of dynamically integration information across these different sources still remains • XML, Web Services, and the Semantic Web do NOT solve the integration problem
Key Challenges in Integration • Aligning ontologies/schemas across sources • Ontology Alignment • Aligning records across sources • Records linkage/Object consolidation • Dynamically composing sources and services • Query planning and service composition • Integrating diverse types of sources (beyond structured data) • Geospatial data integration
Record Linkage Problem Zagat’s Restaurant Guide Health Dept Restaurant Listings Art’s Deli California Pizza Kitchen Campanile Citrus Grill, The Philippe The Original Spago Art’s Delicatessen Ca’ Brea CPK The Grill Patina Philippe’s The Original The Tillerman How can records be linked when they are not named consistently?
Schema/Ontology Integration Example from [Doan, Domingos, Levy, SIGMOD 2000] • Problem: Automated techniques and tools for mapping a source and its corresponding ontology into an existing ontology Existing Ontology: house address contact num-baths amenities name phone house Ontology for new source: location contact-info full-baths half-baths handicap-equipped agent-name agent-phone
Dynamically Composing Sources and Services “Find laptops with wireless networking and at least 1GB RAM, 40GB HD, and a 2GHz processor, that I can buy with the funds remaining in Project X, along with product reviews” laptops budget Contracts Reviews Providers
Geospatial Data Integration Saadi Ave. NIMA Gazetteer Bakhtar Dabestane School 275, Elemi Passage Ashemi Dabestane School 67, Saadi Ave. NIMA streets vector data GeoINT 1 meter/pixel satellite imagery Extracted Features Saadi Ave. B M Knitting Co Or Namet Computers Elemi Pass. Wrapper Extracted Data Tehran Phone Book
Summary • Ontology mapping, record linkage, source and service composition are all critical technical areas • Geospatial data integration provides a killer app for the Air Force that will drive integration research