1 / 22

The Document as a Model for Universal Information Integration

The Document as a Model for Universal Information Integration. Tim Matthews Ipedo, Inc. Open Publish 2002. Topics. Data or Document? XML – Where are We With Content? XML Standards Putting It To Practice XML to HTML - Basics XML Personalization - Mobile Dynamic XML Catalogs - Web.

Download Presentation

The Document as a Model for Universal Information Integration

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. The Document as a Model for Universal Information Integration Tim Matthews Ipedo, Inc. Open Publish 2002

  2. Topics • Data or Document? • XML – Where are We With Content? • XML Standards • Putting It To Practice • XML to HTML - Basics • XML Personalization - Mobile • Dynamic XML Catalogs - Web (c) 2002 Ipedo, Inc.

  3. Where is My Data? • Fact 1: 70-80% of all corporate data is in unstructured form – documents and “content” • Fact 2: One of the fastest growing sources of corporate data is in Web pages – a.k.a. “content” • Fact 3: One of the best areas of ROI in corporate infrastructures is data and code re-use across multiple projects • Conclusion: One of the fastest growing sources of information in companies is potentially the hardest to re-use! (c) 2002 Ipedo, Inc.

  4. XML Documents Can Be Data Too • XML is a document standard • You may call what you use messages, files or pages, but they are documents • XML tags are defined by DTDs or Schemas • So you know what you are getting or sending or storing • XML allows document level views with data level access • The unit of information is familiar, but the capabilities are sophisticated (c) 2002 Ipedo, Inc.

  5. Universal Information Integration • Document level views with data level access • Tailored access to broad information • Every document (page, message, file) is a potential source of information • And it can be done in realtime, used by applications not just people • XML is the magic (c) 2002 Ipedo, Inc.

  6. Why XML? • It’s Open • Customers and smaller providers like open technology. • It’s Smarter • Combine presentation with data description. Makes sense. • It’s More Efficient • Better description, search, logic and transform are promising. (c) 2002 Ipedo, Inc.

  7. The XML Numbers • META Group: “By 2003, 95% of the Global 2000 will deploy XML-based content management infrastructures across Internet, extranet and Intranet venues.” • Gartner: “By the end of 2000, 75% of Fortune 500 companies will be using XML in at least one prototype project.” • Almost every analyst has similar numbers • More importantly, it’s happening in real companies in a big way: Cisco, Dell, IBM to name a few (c) 2002 Ipedo, Inc.

  8. XML and Content Delivery … and (Some of) the Standards • Integrate (XML Schema, XSL) • Consolidate • Cleanse • Standardize • Manage (XML Schema, XML Query) • Organize • Discover • Deliver (XSL, SOAP • Re-use • Customize/Personalize • Deploy (c) 2002 Ipedo, Inc.

  9. XML Schema • XML Schema (Schema) • The XML Schema recommendation provides a means of specifying element content in terms of data types, so that document type designers can provide criteria for validating the content of elements as well as the markup itself. • Schemas are written as XML files • Avoids interoperability problems • What element types? How do they combine? • What constraints are there on date values? Integer values? • The UII “Model Descriptor” (c) 2002 Ipedo, Inc.

  10. XSL/XSLT • XSL Transformations (XSLT) • Defines the syntax and semantics of XSLT, which is a language for transforming XML documents into other XML documents. • Designed for use as part of XSL, which is a stylesheet language for XML. In addition to XSLT, XSL includes an XML vocabulary for specifying formatting. • More than just a template and style language – can process information inside XML documents • The UII “Data Transformer” (c) 2002 Ipedo, Inc.

  11. XML Query • XQuery • XQuery is a query language that derives features from such query languages such as SQL, OQL, XQL and XPath. • Superset of XPath • Example FOR $b IN document("books.xml")/bib/book WHERE $b/publisher = "Addison-Wesley" AND $b/@year > 1991 RETURN <book year={ $b/@year }> { $b/title } • The UII “Data Finder” (c) 2002 Ipedo, Inc.

  12. SOAP • Simple Object Access Protocol (SOAP) • SOAP is a protocol specification for invoking methods on servers, services, components and objects. • SOAP codifies the existing practice of using XML and HTTP as a method invocation mechanism. • A transport mechanism described in XML that carries XML • The heavy lifter of Web Services, where WSDL and UDDI play supporting roles • The UII “Data Transport” (c) 2002 Ipedo, Inc.

  13. Example: XML into HTML XML document profiling people in Computer Sciences department. XSL documents describing how to transform information ◘◘ ◘◘ • Select XML document • Select XSL stylesheet • Initiate transformation (c) 2002 Ipedo, Inc.

  14. Example: XML into HTML HTML document created from XML document HTML document displayed by web browser ◘◘ ◘◘ HTML document dynamically generated from XML Original XML remains as-is in the database (c) 2002 Ipedo, Inc.

  15. XML Personalization:Mobile Example • Business Problem – Provide relevant information to a variety of mobile devices. • Technology Challenges • Web content re-use • Variety of mobile markup formats • Personalization at scale • Performance • Tools of Choice • XSLT • DSML (LDAP) • XML Query (c) 2002 Ipedo, Inc.

  16. XML Personalization:Mobile Example Palm HTML XML Web Site XML Doc Cache Nokia XML Repository XSLT Cache Samsung CRM XML Index Cache XML Profile Directory Web DSML Interface (c) 2002 Ipedo, Inc.

  17. Dynamic XML: eCatalog Example • Business Problem: Provide custom e-catalogs to a range of customers and partners • Technology Challenges • Collecting catalog content • Fluid catalog content • Dynamic generation of catalog views • Tools of Choice • XSLT • XQuery • SOAP (c) 2002 Ipedo, Inc.

  18. Dynamic XML: eCatalog Example Buyers Internet Browser Internet Browser XML Repository Catalog Designers  Search numerous parts/designs Problem: Buyers cannot search design images Solution: Buyers search for design images by key words or categories How It Works: Data in XML, Scalable Vector Graphics (SVG) include metadata to be searched Ž Make designs widely available XSL transformations  Manage design images and info SVG images & searchable metadata Œ Develop designs (c) 2002 Ipedo, Inc.

  19. Dynamic XML:eCatalog Example Customer A XML (A) HTML XML Web Site XML Doc Cache XML Repository XML (B) Customer B XSLT Cache ERP SOAP + XML (C) XML Index Cache XML Updates Partner C XML Query SOAP + XML (D) XML Updates Internal D Partner Catalogs (c) 2002 Ipedo, Inc.

  20. Dynamic XML: eCatalog Example (Extra Credit: Are these documents or data?) (c) 2002 Ipedo, Inc.

  21. Dynamic XML: eCatalog Example XQuery Example: <catalog xmlns=”http://www.parts.com” xmlns:xsi=”http://www.w3.org/2001/XMLSchema-instance” xsi:schemaLocation=”http://www.parts.com catalog.xsd”> { For $p1 in document(“Alexs_catalog.xml”)/catalog/item Return <part id={$p1/partid}> { $p1/desc, $p1/price } </part> For $p2 in document(“Bobs_catalog.xml”)/partList/part Return <part id={$p2/partNumber}> <desc>{$p2/name}</desc> <price>{$p2/price}</price> </part> } </catalog> (c) 2002 Ipedo, Inc.

  22. Conclusion Universal Information Integration Reusable XML Document Infrastructure Portals eCatalogs Wireless Web Services SOAP • Integrate • Manage • Deliver XSL • Simpler • Faster • Personal • Dynamic • Distributed • Reusable XML Schema XML Query XML Document/Index/Style Caching (c) 2002 Ipedo, Inc.

More Related