140 likes | 350 Views
Data Mining: An Introduction. Wing Kee Ho Xiaohua Luan. Outline of Today Presentation. Definition of data mining Comparison of Data mining vs. DBMS Sample data mining tasks in daily life Data mining development. Definition of Data Mining.
E N D
Data Mining: An Introduction Wing Kee Ho Xiaohua Luan
Outline of Today Presentation • Definition of data mining • Comparison of Data mining vs. DBMS • Sample data mining tasks in daily life • Data mining development
Definition of Data Mining The nontrivial extraction of implicit, previously unknown, and potentially useful information
Why do we need to mine the data? • Too much data and too little information • There is a need to extract useful information from the data and to interpret the data
DBMS Query SQL Output Precise Subset of database Data Mining Query Not precise query lang Output Fuzzy Not in Subset of database Comparison of DBMS and Data Mining
Data Mining or DBMS? • Last months sales for each product • The profit forecast on next month • List of customers who lapsed their policy • The characteristics of customers who lapsed their policies
Sample Data Mining Example • Association Rules • Clustering • Time-Series Forecasting
Association Rule, cont • Objective: Identify items that occur together • Support of {salsa, chip} is 80%, • Support of {bread, milk} is 60% • Data is useful for shelving, merchandizing, and pricing.
Each point represent the characters of a customer Clustering-- market segmentation as an example
Clustering, cont • Objective: grouping members that have similar characteristics together • Widely applied on fraud detection, business and finance, science
Statistical Analysis • Regression: • Time Series: Housing Price Area (sq.feet) Sales Volume Time
More data mining techniques • Decision Tree • Neural Network • Combination of several data mining techniques
Implications for different interest parties: • Database users: --- new skills to explore to secure your job! • Database developers: --- develop new functions, and better interface • General Public: --- less privacy, or more convenient?