90 likes | 178 Views
MAIN BOOKS. DATA WAREHOUSING IN THE REAL WORLD : Sam Anshory & Dennis Murray, Pearson DATA MINING CONCEPTS AND TECHNIQUES : Jiawei Han & Micheline Kamber, Morgan Kaufmann BUILDING THE DATA WAREHOUSE, W.H. Inmon, John Wiley & Sons DATA MINING TECHNIQUES : Arun Pujari, Universities-Press.
E N D
MAIN BOOKS DATA WAREHOUSING IN THE REAL WORLD : Sam Anshory & Dennis Murray, Pearson DATA MINING CONCEPTS AND TECHNIQUES : Jiawei Han & Micheline Kamber, Morgan Kaufmann BUILDING THE DATA WAREHOUSE, W.H. Inmon, John Wiley & Sons DATA MINING TECHNIQUES : Arun Pujari, Universities-Press
ADDITIONAL READINGS • INTRODUCTION TO DATA MINING WITH CASE STUDIES, G. K. Gupta (PHI) • DATA MINING Introductory & Advanced Topics, Margaret Dunham, Prentice Hall
OLTP & OLAP • OLTP covers day-to-day operations of an organization such as purchasing, inventory, payroll etc. • OLAP caters to knowledge workers and aid in decision making
OLTP & OLAP • OLTP is customer oriented & used by clerks, clients & IT professional • OLAP is market oriented used for data analysis by knowledge workers including managers, executives and analysts
OLTP & OLAP • OLTP uses current data, usually very detailed • OLAP uses large amounts of historical data with facilities for summarization, aggregation. Information is maintained at different levels of granularity
OLTP & OLAP • OLTP uses E-R model • OLAP uses STAR or SNOWFLAKE model & a subject oriented database design
OLTP & OLAP • OLTP consist of small, atomic transactions. No of records usually in terms of tens • OLAP consist mainly of read-only operations. No of records accessed usually in terms of millions
OLTP & OLAP • In OLTP number of users in terms of thousands, DB size is 100 MB to GB, high performance, high availability Metric for measurement is transaction throughput • In OLAP number of users in terms of hundreds, 100 GB to TB, high flexibility & end-user autonomy Metric for measurement is query throughput
OLTP & OLAP • OLTP - unit of work is short simple transaction, detailed flat relational view, index/hashing on primary key for operation • OLAP – unit of work is complex query, summarized multi-dimensional view, lots of scan involved in operation