60 likes | 71 Views
This summary discusses the principles, methods, and functionalities of data mining systems, focusing on the role of data mining in KDD processes, data preprocessing and cleaning methods, association rules, classification methods, and clustering methods. It covers topics such as data reduction, missing values, outlier elimination, apriori search for frequent patterns, instance-based classification techniques, Bayesian classifiers, decision tree methods, decision rules methods, and clustering algorithms.
E N D
Summary „Data mining” Vietnam national university inHanoi, College of technology, Feb.2006
Main topics: • Definition, principles and functionalities of data mining systems • Data mining role in KDD processes • Data preprocessing and data cleaning methods • Association rules • Classification methods • Clustering methods
Data preprocessing and data cleaning • Discretization methods • Data reduction methods • Missing values • Outlier elimination
Association rules • Definition, possible applications • Apriori search for frequent patterns and association rules • Modifications of apriori algorithms: hash tree, Apriori-Tid, Apriori-Hybrid • FP-tree method
Classification methods • Instance-based classification techniques • Bayesian classifiers • Decision tree methods • Decision rules methods • Classifier evaluation techniques
Clustering methods • K-means and K-medoids algorithms • Hierarchical clustering • Density clustering