330 likes | 547 Views
Introduction to Data Warehousing. Harley Eisenberg Tactics, Incorporated. Data Warehousing Presentation Topics. What is a Data Warehouse? Value Risks and Pitfalls Terminology I want one! - What next?. Who am I?. Data Warehousing Group Manager, Tactics Past President, Eastern Canada OUG
E N D
Introduction to Data Warehousing Harley Eisenberg Tactics, Incorporated
Data Warehousing Presentation Topics • What is a Data Warehouse? • Value • Risks and Pitfalls • Terminology • I want one! - What next?
Who am I? • Data Warehousing Group Manager, Tactics • Past President, Eastern Canada OUG • Past Member, Board of Directors, IOUG • Oracle User and Developer for 11 years
What is a Data Warehouse? • Why do I need a Data Warehouse? • What are its goals? • What are its objectives?
Why do I need a Data Warehouse? • Multiple source systems collecting similar or related information. • Difficulty in accessing this data. • Unable to perform ad-hoc reporting.
Why do I need a DW? • Do you know who your company’s: • Top/Bottom 10 customers are? • By product line? • By region? • By gross sales? • By profitability? • By quantity of orders?
Why do I need a DW? • Do you know who your company’s: • Top/Bottom 10 Salespeople are? • By year/quarter/month? • By gross sales? • By profitability? • By volume of sales?
Why do I need a DW? • Do you know your company’s: • Top/Bottom 10 products? • By gross sales? • By profitability? • By number of repeat orders? • By color/size/shape? • By season/quarter/month?
Why do I need a DW? • Do you know your company’s trends in any of the proceeding areas? • By percentage? • By gross amount? • By profitability?
Data Warehouse Defined “A data warehouse is a collection of corporate information, derived directly from operational systems and some external data sources. Its specific purpose is to support business decisions, not business operations” Oracle Data Warehousing, Corey et al.
The Value of the Data Warehouse The Data Warehouse Defined: • Subject Oriented • Integrated • Time Variant • Non-volatile • Strategic Analysis Tool
The Value of the Data Warehouse Subject Oriented • Focused on the data to meet strategic business goals • Not an enterprise-wide store of all OLTP data
The Value of the Data Warehouse Integrated • Multiple Source Systems • Common Terminology • Single Domain for a Given Data Item • Consistently Updated
The Value of the Data Warehouse Time Variant • Data in the Warehouse is associated with a point in time • Essential for time-series historical analysis and forecasting
The Value of the Data Warehouse Non-volatile • Once loaded, data does not change. • The answer to a given historical question will not change based on the time the question is posed to the warehouse.
The Value of the Data Warehouse Strategic Analysis • Historical analysis • Forecasting analysis • Evaluation of performance to corporate strategic goals • Discovery of revenue opportunities • Cost reduction
Risks and Pitfalls • Target the Wrong or an Incomplete Data Set • Data Cleansing • Design • Clear Dimensional Model • Slowly Changing Dimensions
Risks and Pitfalls Target the Wrong or an Incomplete Data Set • The Field of Dreams Methodology • Lack of End User Involvement
Risks and Pitfalls Data Cleansing Extraction, Transformation and Cleansing account for up to 80% of total project effort Source DM Review®
Risks and Pitfalls - Design Clear Dimensional Model (Star Schema) • Facts - Measures of Business Activity • Dimensions - Business Entities that Describe Facts
Risks and Pitfalls - Design A dimension is considered a Slowly Changing Dimension when its attributes remain almost constant over time, requiring relatively minor alterations to represent the evolved state.
Terminology • Fact Table • Dimension Table • Hierarchy • Data Warehouse/Data Mart
Fact Table • A table that is used to store business information (measures) that can be used in mathematical equations. • Quantities • Percentages • Prices
Dimension Table • Table used to store qualitative data about fact records • Who • What • When • Where • Why
Hierarchies • Allow for the ‘rollup’ of data to more summarized levels. • Time • day • month • quarter • year
Data Warehouse vs. Data Mart • Commonly misused/interchanged terms • Everyone has a different definition
Data Warehouse vs. Data Mart • Data Warehouse is larger (more data) • Data Warehouse has information about multiple subject areas. • Data Warehouse may be used to feed one (or more) data marts.
Data Warehouse vs. Data Mart • Data Mart is smaller • Data Mart is restricted (usually) to a single subject area. • Data Mart may feed Data Warehouse.
I Want One! • What’s the next step?
I Want One! • Plan! Plan! Plan! • Resources • Experienced DW Architect • Programmers Familiar with Legacy Systems • Users Familiar with Legacy Data • Code Generator
I Want One! • How long will it take? • Up to Six Months per source system
I Want One! • End User Presentation Tools • Oracle Discoverer • Oracle Express (OFA, OSA) • Business Objects • Cognos (Powerplay, Impromptu) • Many, many others
Summary • DW are used for • Decision support • Analysis • Not the same as your operational systems • Designed badly, they are not useful