150 likes | 174 Views
CPIT 440 Data Mining and Warehouse. Lab1 Introduction and Overview. CPIT 440 Data Mining and Warehouse. kholoudsaadkau@yahoo.com Office #: 127 Office Hours Sunday and Tuesday: (11 - 1) Monday: (12 – 1) Thursday (11 - 12). CPIT 440 Data Mining and Warehouse. The Lab Syllabus.
E N D
CPIT 440 Data Mining and Warehouse Lab1Introduction and Overview
CPIT 440 Data Mining and Warehouse kholoudsaadkau@yahoo.com • Office #: 127 • Office Hours • Sunday and Tuesday: (11 - 1) • Monday: (12 – 1) • Thursday (11 - 12)
CPIT 440 Data Mining and Warehouse The Lab Syllabus
CPIT 440 Data Mining and Warehouse The Lab Syllabus
CPIT 440 Data Mining and Warehouse Software used • Oracle Data Miner (ODM) • Rapidminer (for ETL)
CPIT 440 Data Mining and Warehouse The Grades Distribution
CPIT 440 Data Mining and Warehouse Lab Group • In Acadox
CPIT 440 Data Mining and Warehouse Lab1: Outlines • Overview • Important Terminologies • Data • Information • Knowledge • Warehouses. • What is Data Mining ?
CPIT 440 Data Mining and Warehouse What is Data Mining ? • Data mining is the process of analyzing data from different perspectives - different dimensions or angels, categorize and summarize it into useful information. • This information can be used to increase the revenue, cuts costs, find some solutions on specific issue or make development plan in the learning environment…. so on
CPIT 440 Data Mining and Warehouse What is Data Mining ?
CPIT 440 Data Mining and Warehouse What is Data Mining ?
CPIT 440 Data Mining and Warehouse Terminologies • Data: Data are any facts, numbers, or text that can be processed by a computer, and it could be: • Operational or transactional data • Nonoperational data • Meta data • Information: The patterns, associations, or relationships among all this data can provide information. • For example, analysis of retail point of sale transaction data can yield information on which products are selling and when.
CPIT 440 Data Mining and Warehouse Terminologies • Knowledge Information can be converted into knowledge about historical patterns and future trends. • For example, summary information on retail supermarket sales can be analyzed to provide knowledge of consumer buying behavior. • Thus, a manufacturer or retailer could determine which items are most susceptible to promotional efforts. • Data Warehouses Data warehousing is defined as a process of centralized data management and retrieval.
CPIT 440 Data Mining and Warehouse Data Mining overlaps with many disciplines • Statistics • Machine Learning • Information Retrieval • Distributed Computing • Database Systems
CPIT 440 Data Mining and Warehouse Why we Need Data Mining ? • Student Discussion