320 likes | 493 Views
Advanced classifiers. Decision trees Object-based classifiers Others. DECISION TREES. Non-parametric approach Data mining tool used in many applications, not just RS Classifies data by building rules based on image values
E N D
Advanced classifiers Decision trees Object-based classifiers Others
DECISION TREES • Non-parametric approach • Data mining tool used in many applications, not just RS • Classifies data by building rules based on image values • Rules form trees that are multi-branched with nodes and “leaves” or endpoints • Also referred to as classification trees • Regression trees are similar but work for predicting continuous data • Classification trees only predict thematic data (e.g. landcover class)
DECISION TREES Based on entropy and information gain Entropy represents homogeneity in the data; the less homogeneous the data, the more entropy you have in the system Try to reduce entropy through the clumping of data into groups. Reduced entropy = increase in information gain
DECISION TREES Source: Wikipedia • A tree is created by splitting the source set into subsets based on an attribute value test. • This process is repeated on each derived subset in a recursive manner called recursive partitioning. The recursion is completed when the subset at a node has all the same value of the target variable, or when splitting no longer adds value to the predictions. • This is a ‘top-down’ strategy known as top-down induction of decision trees (TDIDT).
DECISION TREES – C5/See5 • Widely used in the remote sensing community • Evolved from ID3 and then C4.5 • Developed by Russ Quinlan in Australia • Founded Rulequest Research company, specializing in data mining tools • Now available via open source products like R and Weka • Other decision trees are available, notably Random Forests which is available in R
Dection tree vs. MLC Change Detection in Sumatra
Knowledge-based classifiers Rules-based approach that uses conditional statements to test a series of hypotheses describing a set of informational classes; these conditions are usually supported by ancillary GIS data (image or vector-based)
WORKFLOW IN ERDAS KNOWLEDGE ENGINEER Knowledge-Based Classifier
Object-Based Image Analysis Traditional image classification methods only consider “color” (spectral signature) when classifying imagery. Object-oriented classification also considers shape, size, texture, pattern, shadow, and association
Object-Based Image Analysis • Image pixels are treated as groups, or objects, not individually • Statistics and relationships are generated and assigned for the entire object • Especially useful for high resolution imagery, which contain a large amount of within-class spectral variability • Consider example of forest area where trees and shadows are visible. Some pixels will be much darker because of the shadow, but still belong to the forest class.
Segmentation • Segmentation is necessary to create the groups • Based on similar image characteristics, as well as location. Neighboring pixels are joined. • Segments can be created at multiple scales • Multiple datasets can be used to create the segments
Segmentation Jensen, 2005, Plate 9-4
Segmentation Level 25 Level 3
Image Objects • Are attributed with spectral and spatial metrics: spectral mean, spectral variance, mean difference to neighbor, length to width ratio, area, shape index (smoothness), etc.
Object-based classification • Image objects can be classified using statistical cluster analysis algorithms.
Advantages of object-oriented classification • Most accurate method with high-resolution imagery • Can be applied to both multi-spectral and single band data • More consistent, efficient, and replicable than visual interpretation/digitizing • Established methods, though expensive and “black box”
Object-based Classification Accuracy Assessment • Traditional methods may not be appropriate when mapping a single feature; often used in combination with a “quantity difference” comparison (overestimation/underestimation of class compared with digitized data)
Feature Extraction • Feature extraction is an object-oriented classification technique traditionally used to extract the key features (or objects) of a given image
Feature Extraction • Feature extraction software typically responds to training and feedback by the analyst in a process known as learning
Train Learner Identify Correct, Incorrect, and Missed Run Process Examine Results Accept Results
Train Learner learning Identify Correct, Incorrect, and Missed Run Process Examine Results Accept Results
Feature Extraction • Utilizes spatial context information as well as spectral information to determine results
Basic steps of feature extraction Define the targeted features by digitizing training examples Use spectral, spatial, and mathematical image information to generate results from the training set Improve results using “learning” to select correct and incorrect examples from returned features Examine final results and post-process to remove “clutter”
Other advanced classifiers: machine learning options Support vector machines Neural networks Highly sophisticated – generally used by more advanced image analysts