110 likes | 262 Views
XML et. al vs Tables. Michael Stonebraker Adjunct Professor Massachusetts Institute of Technology (stonebraker@lcs.mit.edu). Tables. Good for structured data remember Codd’s laws Schema first have to think about your data up front Data integrity key not loosey-goosey semantics. XML.
E N D
XML et. al vs Tables Michael Stonebraker Adjunct Professor Massachusetts Institute of Technology (stonebraker@lcs.mit.edu)
Tables • Good for structured data • remember Codd’s laws • Schema first • have to think about your data up front • Data integrity key • not loosey-goosey semantics
XML • Good for documents • Bad for data (Codd’s laws) • Schema later ok • Loosey-goosey ok
Applicability lots Tables data volume XML little lots little structure
Why? • Low end content uses HTML -> XML • High end content uses DBMS • invariably structured • High end business data processing uses DBMS • Low end may well use XML • because it is easier
Which is More Important? • Internet has generated lots of HTML content (on the left) • As well as lots of click stream data (on the right) • Which is more important depends on metric • bytes represented? • bytes * monetary value of a byte? • market value?
Nothing is New • Document systems have never used data bases • IR community versus DBMS community • XML will allow better IR • Outer fringes have never used data bases • in the 1970’s it was airline reservation systems • in the 1990’s it was web crawlers
But DBMS is “Good” • Makes you think about you data up front • good discipline (used by business data processing folks) • registers meta data for downstream data sharability • Transactions, recovery, etc. are good
Battleground Between The Two Approaches • Corporate media asset management • Scientific data
How Can DBMS Do Better? • Become easier to use • Better tools
My Main Fear • Two fiefdoms are incompatible • shades of Corba and Com • cross fiefdom joins may not work well • DBMS will export XML • no guarantee that client won’t update it and hand it back • horrible view update problem….