300 likes | 313 Views
Efficient XSLT Processing in Relational Database System. Zhen Hua Liu Anguel Novoselsky Oracle Corporation VLDB 2006. Agenda. XML Processing Languages Overview XSLT Processing - Coprocessor Vs Integrated Approach XSLT to XQuery Rewrite Translation Technique Performance Evaluation
E N D
Efficient XSLT Processing in Relational Database System Zhen Hua Liu Anguel Novoselsky Oracle Corporation VLDB 2006
Agenda • XML Processing Languages Overview • XSLT Processing - Coprocessor Vs Integrated Approach • XSLT to XQuery Rewrite Translation Technique • Performance Evaluation • Conclusion • Q & A
XML Processing Languages • XQuery/XPath XSLT • SQL/XML –integration XML and XQuery/XPath into SQL (http://www.sqlx.org) • SQL/XML (Oracle XMLDB) – • XMLTransform – Oracle XMLDB extension operator: embedding XSLT in SQL • SELECT XMLTransform(emp.resume, ‘XSLT code’) FROM emp; // resume is of XMLType column of table emp
XQuery, XSLT, SQL/XML Comparison • All are Declarative Languages ! (Central Dogma in declarative query processing) • Share common XQuery Data Model • SQL/XML XMLType is based on XQuery Data Model • XQuery & SQL/XML • More database centric • Share the same paradigm – Selection, Projection (XML Construction), Join, Order • XSLT • Database Foreign - Template Rule Matching based Execution Model • XLST is more declarative than XQuery • Impedance Mismatch with XQuery/SQL Processing Model, how to run XSLT in RDBMS ?
Multi-Coprocessors Approach • Embed off-the-shelf Xquery/XSLT processors into a SQL engine XSLT Engine XSLT(Transform) XQuery(XMLQuery) SQL Engine XQuery Data Model Instances Input XQuery Engine XQuery DM instances output
Issues & Challenges with Coprocessor Approach • Fully composable in SQL/XML: Can we optimize XSLT, XQuery, XPath as one language ? Cross Language Optimization Feasible ? • XML is stored & Indexed: Can XSLT processingleverage index on XML in RDBMS? • How to make XSLT template rule matching based execution model “fit” into RDBMS processing Model ?
XQuery/XSLT/SQL/XML Integrated Architecture XSLT to XQuery Rewrite SQL/XML XQuery XSLT Common algebraic operator tree – storage independent optimization XMLType Abstraction XML Storage/Index dependent Optimization Binary XML/ XMLIndex OR Storage Extended/Hybrid Model SQLX View/Relational Data
General XSLT to XQuery Translate Technique • Fokoue etc “Compiling XSLT 2.0 into XQuery 1.0” paper at WWW 2005 • Translate XSLT template into XQuery Function • Translate XSLT instruction into corresponding XQuery construct • Translate <xsl:apply-template> into XQuery function calls with large XQuery conditional expression matching XSLT pattern • Issues • Resultant XQuery is cumbersome and requires aggressive optimization • Where to Add Intelligence in the translation ?
XSLT to XQuery Rewrite example • XMLType view over relational data CREATE VIEW dept_emp AS SELECT XMLElement("dept", XMLElement("dname", dname), XMLElement("loc", loc), XMLElement("employees", (SELECT XMLAgg(XMLElement("emp", XMLElement("empno", empno), XMLElement("ename", ename), XMLElement("sal", sal))) FROM emp WHERE emp.deptno = dept.deptno))) as dept_content FROM dept
Example contd.. • Result of XMLType View ============================================================ <dept> dname>ACCOUNTING</dname> <loc>NEW YORK</loc> <employees> <emp> <empno>7782</empno> <ename>CLARK</ename> <sal>2450</sal> </emp> <emp> <empno>7934</empno> <ename>MILLER</ename> <sal>1300</sal> </emp> </employees> </dept>
Example- XSLT on XMLType -1 SELECT XMLTransform(dept_emp.dept_content, '<?xml version="1.0"?><xsl:stylesheet version="1.0“ xmlns:xsl="http://www.w3.org/1999/XSL/Transform"> <xsl:template match="dept"> <H1>HIGHLY PAID DEPT EMPLOYEES</H1> <xsl:apply-templates/> </xsl:template> <xsl:template match="dname"> <H2>Department name: <xsl:value-of select="."/></H2> </xsl:template> <xsl:template match="loc"> <H2>Department location: <xsl:value-of select="."/></H2> </xsl:template>
Example- XSLT on XMLType -2 <xsl:template match="employees"> <H2>Employees Table</H2> <table border="2"> <td><b>EmpNo</b></td> <td><b>Name</b></td> <td><b>Weekly Salary</b></td> <xsl:apply-templates select="emp[sal > 2000]"/> </table> </xsl:template> <xsl:template match = "emp"> <tr> <td><xsl:value-of select="empno"/></td> <td><xsl:value-of select="ename"/></td> <td><xsl:value-of select="sal"/></td> </tr> </xsl:template>
Example- XSLT on XMLType -3 <xsl:template match="text()"> <xsl:value-of select="."/> </xsl:template> </xsl:stylesheet>') FROM dept_emp;
Input XML Structural Info with XSLT template Analysis root dept dept template default template employees dname loc dname template loc template emp Employees template text text emp template text Text template
XSLT Template Invocation Graph root Default template dept dept template dname employees loc dname template loc template Employees template text Emp[sal>2000] text Text template Text template emp template text Text template
Example- XQuery from XSLT SELECT XMLQuery( 'declare variable $var000 := .; (: builtin template :) ( let $var002 := $var000/dept return (: <xsl:template match="dept"> :) ( <H1>HIGHLY PAID DEPT EMPLOYEES</H1>, ( let $var003 := $var002/dname return (: <xsl:template match="dname"> :) <H2>{fn:concat("Department name: ", fn:string($var003))}</H2>, let $var003 := $var002/loc return (: <xsl:template match="loc"> :) <H2>{fn:concat("Department location: ",fn:string($var003))}</H2>,
Example- XQuery from XSLT let $var003 := $var002/employees return (: <xsl:template match="employees"> :) ( <H2>Employees Table</H2>, <table border="2"> { <td><b>EmpNo</b></td>, <td><b>Name</b></td>, <td><b>Weekly Salary</b></td>, (
Example- XQuery from XSLT Rewrite for $var005 in ($var003/emp[sal > 2000]) return (: <xsl:template match="emp"> :) <tr> <td>{fn:string($var005/empno)}</td> <td>{fn:string($var005/ename)}</td> <td>{fn:string($var005/sal)}</td> </tr> ) } </table> ) ) ) )' PASSING dept_emp.dept_content RETURNING CONTENT) FROM DEPT FROM dept_emp
Final Optimized SQL/XML Query • SELECT XMLConcat( XMLElement( "H1",'HIGHLY PAID DEPT EMPLOYEES'), XMLElement( "H2",'Department name: ' ||"SYS_ALIAS_4"."DNAME"), XMLELement( "H2",'Department location:’ ||"SYS_ALIAS_4"."LOC"), XMLELement( "H2",'Employees Table'), XMLElement( "table",XMLAttributes('2' AS "border"), XMLElement( "td", XMLElement( "b",'EmpNo')), XMLElement( "td",XMLElement( "b",'Name')), XMLElement( "td",XMLElement( "b",'Weekly Salary')), (SELECT XMLAGG( XMLElement( "tr", XMLElement( "td","EMP"."EMPNO"), XMLElement( "td","EMP"."ENAME"), XMLElement( "td","EMP"."SAL"))) FROM EMP WHERE SAL > 2000 AND DEPTNO=DEPT.DEPTNO))) FROM DEPT
XSLT to XQuery Rewrite Key • Leverge XML structural information to generate Template Invocation Graph • XML Schema, DTD, SQL/XML construction functions • Inline Template with caller • This generates compact XQuery amendable for further optimization • Cancellation with XML view / relational data • Path/Value Index for binary XML
Partial Evaluation • Partial Evaluation to obtain template invocation graph • Application computation is described as F(X,Y), X changes less frequently than Y and significant part of F’s computation depends on X. • Optimize F statically by holding X as constant • Key observation – let F be the XSLT stylesheet, X be the input XML structural information, Y be the actual XML instance document content
Comparison with Related Work • Fokoue - “Compiling XSLT 2.0 into XQuery 1.0” WWW 2005 • Not leveraging Input XML Structure Information for optimizing XQuery generation process • Concluded context sensitive flow analysis & function specialization for static optimization • Our work – • Optimization of XSLT based on input XML structure • Leverage Partial Evaluation for obtaining template invocation graph
Comparison with Related Work • Moerkotte - “Incorporating XSL Processing Into Database Engines” VLDB 2002 • XSLT into internal algebra and integrate with RDBMS • Concluded future research in combined optimizations of XSLT with XML construction • Our work: • XSLT into XQuery • Combined optimizations of XSLT with XML input
Comparison with Related Work • Jain & Li etc - “Translating XSLT Programs to Efficient SQL Queries” WWW 2002 & “Composing XSL Transformations with XML Publishing Views” SIGMOD 2003 • Our work: • XQuery as intermediate language • Work with any XML storage/Index Model
XSLT Mark • Db-one row query – Index Probe (Table Scan Vs Index Scan) Rewrite – Integrated Approach No-Rewrite – Coprocessor Approach
XSLT Mark • Avts, metric: XML construction • Chart, total: XSLT uses xquery aggregate functions: count/sum
Conclusions • Efficient XSLT processing in RDBMS is feasible despite the template rule based XSLT language • Use XQuery as intermediate language to which XSLT is translated • Leverage XML input structural information to get efficient & compact XQuery • Index Probing, pull based execution model, parallel aggregation, sort applicable to XSLT in RDMS engine • XSLT is native to RDBMS, just as XQuery,SQL/XML