120 likes | 226 Views
An Analysis of the Publication "An Overview of Data Warehousing and OLAP Technology”. Michael Goshey University of Minnesota, Fall 2006 CSci 8701: Overview of Database Research. Introduction. Selected paper
E N D
An Analysis of the Publication "An Overview of Data Warehousing and OLAP Technology” Michael Goshey University of Minnesota, Fall 2006 CSci 8701: Overview of Database Research
Introduction • Selected paper • S. Chaudhuri and U. Dayal, An Overview of Data Warehousing and OLAP Technology, SIGMOD Record 26(1): 65-74(1997). • Motivation • Personal Interest Michael Goshey: 9/19/2006
Problem Addressed • Problem Statement • Survey: organizing the data warehousing space • Differing requirements between OLTP and OLAP • Significance • Growth area • Reference work establishing consensus on terms, architectures and issues Michael Goshey: 9/19/2006
Major Contributions • Bridging the gulf between industry and academia • OLTP vs. OLAP: clarifying the differences • Concise survey of relevant issues, architectures and tools • Concrete list of data warehouse design and build steps Michael Goshey: 9/19/2006
Key Concepts • OLTP, OLAP • Relational and dimensional data models Michael Goshey: 9/19/2006
Relational Model Example Michael Goshey: 9/19/2006
Dimensional Model Example (image from http://www.laynetworks.com) Michael Goshey: 9/19/2006
Key Concepts (con’t) • Data warehouses and data marts • ETL • Metadata • Managed query vs. ad hoc query and analysis environments Michael Goshey: 9/19/2006
Key Concepts (con’t) • ROLAP and MOLAP • Materialized views • Bitmap indices • SQL extensions Michael Goshey: 9/19/2006
Validation Methodology • Survey paper goals • Academic and industry citations • Referencing tools, vendors Michael Goshey: 9/19/2006
Assumptions • Read-only data warehouse environments • Self-service users and custom views • Ties to transactional requirements • Unanticipated data revisions Michael Goshey: 9/19/2006
2006 Rewrite • Changes in terminology, tools, vendors • Fact constellations -> conformed dimensions • Decision support -> BI • Vendors and tools in BI, ETL, OLAP • Data history difficulties • Multiple user constituencies Michael Goshey: 9/19/2006