240 likes | 247 Views
Explore the use of XML in databases, including XML technologies, querying approaches, and the potential for XML to replace relational databases. Learn about validation, query languages, parsing, and transformation, as well as the integration and distribution of data. Discover various XML database products and the benefits of using native XML storage. Consider current and future research issues in XML and distributed databases.
E N D
<XML /> and Distributed Databases on the Web Nathaniel Ayewah CSE 8330 Presentation SMU
Introduction • Why XML? • In Databases? Structured Content Data Model
Overview • XML Technologies • XML and Databases • Will XML supplant Relational DBs? Querying XML Different Approaches
XML Origins • SGML • HTML • XML
XML Example meta language
XML Technologies XPointer XML Signature XQuery XML Schema XSLT XSL-FO XQL XML Encryption XML-QL XPath SOAP DOM UDDI SAX WSDL DTD
XML Technologies • Validation and Structure • Query Languages** • Parsing and Processing • Transformation and Presentation
Query Languages • Document Community • Database Community • W3C [XPath/XQL] [XML-QL] [XQuery]
“What is the size of the Computer Science Faculty?” http://support.x-hive.com/xquery/index.html
XQuery: Joins Source: http://www.w3.org/TR/xquery
Why distribute? • Data Integration • Data Distribution
Classification Data view Virtual view Query view (d) (v) (q) Td,v,q d, v, q {R, X, H, N} Classification: R = Relational Data Model X = XML Data Model H = Hybrid Data Model N = View does not exist
Products • Native XML • XML Enabled • Middleware or XML Server • Wrappers • Standalone XML XQuery Engine • Content Management System http://www.rpbourret.com/xml/XMLDatabaseProds.htm TX,X,X ? TR,X,X TR,R,X TR,X,X TR,R,R TR,X,N TX,R,R TX,N,X TX,X,X
XML-Enabled Default View
Native XML Database XML::DB Initiativehttp://www.xmldb.org Native Relational XML Document Tuple(s) Collection Relation Physical Storage
Berkeley DB XML • XML Data Model over Physical Berkeley DB storage system • Native XML Storage • Supports transactions, recovery, indexing, replication, multiple users and concurrency, query processing, standards • C++/Java APIs http://www.sleepycat.com/products/xml.shtml
Berkeley DB XML Source: Berkeley DB XML Documentation
Current/Future Research Issues • Physical Storage • Query Optimization • Distributed Processing/Optimization • Static vs Dynamic Processing • First, Last, Partial Results • Updates
Conclusion and Future • Will XML replace existing DBs? • Document-centric applications • XML Sources • Data warehousing (Xyleme) • Data-centric applications • Business Transactions