280 likes | 390 Views
CS 253: Topics in Database Systems: XPath, NameSpaces. Dr. Alexandra I. Cristea http://www.dcs.warwick.ac.uk/~acristea/. Previously we looked at: XML Next: XPath Namespaces. XPath. XPath. XPath is a syntax for defining parts of an XML document
E N D
CS 253: Topics in Database Systems: XPath, NameSpaces Dr. Alexandra I. Cristea http://www.dcs.warwick.ac.uk/~acristea/
Previously we looked at: • XML • Next: • XPath • Namespaces
XPath • XPath is a syntax for defining parts of an XML document • XPath uses path expressions to navigate in XML documents • XPath contains a library of standard functions • XPath is a major element in XSLT • XPath is a W3C recommendation, thus a Standard (16. November 1999 )
XPath Path Expressions • Uses path expressions to select nodes or node-sets in an XML document. • These path expressions look very much like the expressions you see when you work with a traditional computer file system.
XPath Standard Functions • over 100 built-in functions. • string values, • numeric values, • date and time comparison, • node and QName manipulation, • sequence manipulation, • Boolean values, • and more.
XPath Terminology • Nodes • Atomic values • Items (atomic values or nodes) • Relationships of nodes • Parent • Children • Siblings • Ancestors • Descendants
XPath Nodes • 7 kinds of nodes: • element, • attribute, • text, • namespace, • processing-instruction, • comment, and • document (root) nodes. • XML documents are treated as trees of nodes. The root of the tree is called the document node (or root node).
Document (root) node Element node Attribute node Nodes Examples <?xml version="1.0" encoding="ISO-8859-1"?> <bookstore> <book> <title lang="en">Harry Potter</title> <author>J K. Rowling</author> <year>2005</year> <price>29.99</price> </book> </bookstore>
Atomic values Examples* <?xml version="1.0" encoding="ISO-8859-1"?> <bookstore> <book> <title lang="en">Harry Potter</title> <author>J K. Rowling</author> <year>2005</year> <price>29.99</price> </book> </bookstore> *nodes with no children or parent
Predicates • Predicates are used to find a specific node or a node that contains a specific value. • Predicates are always embedded in square brackets.
Example predicates – cont. Selects all the book elements of the bookstore element that have a price element with a value greater than 35.00 Selects all the title elements of the book elements of the bookstore element that have a price element with a value greater than 35.00
Example: selecting several paths Selects all the title as well as price elements in the document Selects all the title elements of the book element of the bookstore element as well as all the price elements in the document
axisname::nodetest[predicate] • //DDD/parent::* <AAA> <BBB> <DDD> </DDD> </BBB> </AAA>
axisname::nodetest[predicate] • //BBB/child::* <AAA> <BBB> <DDD> </DDD> </BBB> </AAA> Note: /AAA is equivalent to /child::AAA
More examples • http://www.zvon.org/xxl/XPathTutorial/General/examples.html • Check basics, //, *, predicates, attributes, functions (new ones: count, name, normalize-space, starts-with, contains, string-length, floor, ceiling), axes, operators (mod) • Note: The ancestor, descendant, following, preceding and self axes partition a document (ignoring attribute and namespace nodes): they do not overlap and together they contain all the nodes in the document. (see example)
XPath Conclusion • We have learned: • XPath definition • Path expressions • Standard functions • Terminology • Predicates • Location paths • Axes • Some operators
Before we go on, one more thing about XML: • XML Namespaces
The Idea to Solve it • Assign a URI (~ URL) to every sub-language: • E.g., for XHTML 1.0: http://www.w3.org/1999/xhtml • Qnames: Qualify element names with URIs: • {http://www.w3.org/1999/xhtml}head Web Naming and Addressing Overview (URIs, URLs, ...)
The actual solution • Namespace declarations bind URIs to prefixes: • Default namespace (no prefix) declared with: xmlns=“…” • Lexical Scope • Attribute names can also be prefixed
Next we look at how to query XML • This can be done, to some extent, as we have seen, within XSLT, • but the main language developed for this purpose is …