190 likes | 199 Views
This course provides an overview of data warehousing, including its definition, principles, and implementation. Students will learn how to design and build data warehouses, extract and transform data, and use data cubes for analysis. Prerequisites include knowledge of relational databases and SQL. Assessment methods include exams, homework, quizzes, and presentations.
E N D
Data Warehousing Course (602389) Outline: What is Data Warehousing? Why study it? Course Prerequisite!! Skills you will learn!! Objectives. Assessment Methods. Main Topics in Data Warehousing. Note: All Data Warehousing Course Materials can be found at the following site: http:// May Al-Nashashibi University of Petra, Amman, Jordan
Information Data • Data Warehousing: is a process of transforming data into information and making it available to users in a timely enough manner to make a difference (through using a Data warehouse). May Al-Nashashibi University of Petra, Amman, Jordan
Why study Data Warehousing? • Operational database undergoes per day transactions which causes frequent changes to the data on daily basis. • But in future if business executive wants to analyze the previous feedback on any data such as product supplier, or the consumer data, then the analyst have no data available to analyze because the previous data is updated due to transactions. • Thus, Operational databases as well as other resources of data required cannot be used. So the need for another method to handle such Data and acquire information to support business decisions. • But How to do so?? May Al-Nashashibi University of Petra, Amman, Jordan
Which are our lowest/highest margin customers ? Who are my customers and what products are they buying? What is the most effective distribution channel? What product prom--otions have the biggest impact on revenue? Which customers are most likely to go to the competition ? What impact will new products/services have on revenue and margins? A Business Decision Maker wants to know…. May Al-Nashashibi University of Petra, Amman, Jordan
All Enterprises have is Data, Data everywhere yet ... • I can’t find the data I need • data is scattered over the network • many versions, subtle differences • I can’t get the data I need • need an expert to get the data • I can’t understand the data I found • available data poorly documented • I can’t use the data I found • results are unexpected • data needs to be transformed from one form to other May Al-Nashashibi University of Petra, Amman, Jordan
What are the Users saying... • Data should be integrated across an enterprise • Summary data has a real value to the organization • Historical data holds the key to understanding data over time • What-if capabilities are required May Al-Nashashibi University of Petra, Amman, Jordan
Prerequisites • Introduction to Data Base. • Students should be comfortable with: • Relational model basics • Relational algebra • SQL • Views and Security • Conceptual database design and ER models • Schema refinement and normal forms • Physical database design and tuning May Al-Nashashibi University of Petra, Amman, Jordan
Skills You’ll Learn • Work in a group in order to build different simple Data Warehouses. • Deploy communication skills. • Deploy proper report writing skills. May Al-Nashashibi University of Petra, Amman, Jordan
Objectives: • Introduce Data modeling. • Principles of database design and database access are revised. • Introduce the principles of data extraction, cleansing, transformation and loading. • Introduce Data cube computation and materialized view selection. • Examine OLAP query processing, star schema, fact tables and dimension tables with emphasis on multi dimensional databases. • Discuss Issues in data warehouse planning, design, implementation, and administration. • Review the role of data warehouse in supporting decision support systems. May Al-Nashashibi University of Petra, Amman, Jordan
Assessment Method %Grade First Exam 20 Second Exam 20 Home works , Quizzes & Presentations 20 Final Exam 40 ___________________________________________________ Total 100 Textbook (s): Ralph Kimball and Margy Ross, “The Data Warehouse Toolkit: The definitive guide to dimensional modeling”, 2013. John Wiley and sons Inc. References: R1: A Vaisman, E Zimányi, “Data Warehouse Systems: Design and Implementation”. Springer, 2014, ISBN: 978-3-642-54654-9. R2: R. Elmasri and S.B. Navathe, “Fundamentals of Database Systems”. 2007 (5th edition). Pearson Education. Other Resources: Students enrolled in the course utilize SQL Server and Microsoft Business Intelligence tools in the Labs May Al-Nashashibi University of Petra, Amman, Jordan
Major Topics in Data Warehousing (1) • Definition of data warehouse. Data warehouse vs. operational DBMS. Dimension and fact tables. Data marts. Surrogate keys. • Choosing a business process to model data. Choosing the grain (atomic level of data) of the business process. Choosing the dimensions that will apply to each fact table record. Choosing the measure that will populate each fact table record. • A generic data warehouse framework. Inmon’s top-down approach. Kimball’s bottom-up approach. • Star Scheme. Snow Flake Scheme. Fact Constellation Scheme. • OLAP. Data warehouse multidimensional cubes. Storage modes: MOLAP, ROLAP, and HOLAP. May Al-Nashashibi University of Petra, Amman, Jordan
Major Topics in Data Warehousing (2) • Building and Querying data warehouse cubes using: Business Intelligence for Visual Studio, SQL Server Analysis Services. • Task of ETL. Data Profiling in ETL. Change Data Capture in ETL. Quality Screens in ETL. Logging in ETL. Dimension/Fact Manager in ETL. Aggregate Builder in ETL. • Use Visual Studio Business Intelligence to Extract, Transform, and Load data into warehouse database. Dimensional Design Process – Case Studies. May Al-Nashashibi University of Petra, Amman, Jordan
Data Warehouse Applications A Data Warehouse helps business executives to organize, analyze and use their data for decision making. Data Warehouse serves as a soul part of a plan-execute-assess "closed-loop" feedback system for enterprise management. Data Warehouse is widely used in the following fields: • Financial Services • Banking Services • Consumer Goods • Retail Sectors. • Controlled Manufacturing May Al-Nashashibi University of Petra, Amman, Jordan
Suggested Videos for Data Warehousing: 1- http://www.youtube.com/watch?v=q77B-G8CA24 2- http://www.youtube.com/watch?v=Jhcz0w4PC7Y 3- http://www.youtube.com/watch?v=d8nC_IaUPG4 4- http://www.youtube.com/watch?v=6k3nwXXpnMY 5- http://www.youtube.com/watch?v=zTs5zjSXnvs 6- http://www.youtube.com/watch?v=9Akvz2x0az4 May Al-Nashashibi University of Petra, Amman, Jordan
The Three domains of educational activities or learning are: Cognitive: mental skills (knowledge) Affective: growth in feelings or emotional areas (attitude or self) Psychomotor: manual or physical skills (skills) May Al-Nashashibi University of Petra, Amman, Jordan