Pattern Recognition Final Task

Pattern RecognitionFinal Task Ibrahim Arief – 185099 Timo Eckhard – 185126 University of Joensuu December 17th, 2009

Contents • M-Fold-Cross Training • Color Data • Preprocessing • Bayesian Classifier • Multilayer Perceptron • K-Means Clustering • Speech Data • Preprocessing • Bayesian Classifier • Multilayer Perceptron • K-Means Clustering • Summary

M-Fold-Cross Training • Partition into M subsets • One subset is assigned as test subset, the rest is training subset • We use the training subset for testing against test subset • Assign other subset as new test subset, the rest is training subset for that particular one • Repeat until all partition took their turn being tested

Spectral Color Data - Preprocessing

Spectral Color Data – Bayesian Classifier (1) • Raw spectral input – all classified to class 3

Spectral Color Data – Bayesian Classifier (2) • Preprocessing : Tristimulus • Nice clumping, linearly separable

Spectral Color Data – Bayesian Classifier (3) • Very high accuracy : 99.97%

Spectral Color Data – Multi Layer Perceptron • Raw spectral data as input : ~5% • Tristimulus as input : ~30% • Question : parameters? • Answer : exhaustive search?

Spectral Color Data – K-Means Clustering

Speech Data – Preprocessing (1) • MFCC – Timeseries? • Plot of coefficients within a class

Speech Data – Preprocessing (2) • Plot of variance for each coefficient

Speech Data – Preprocessing (3) • Plot of bayesian accuracy for n-least-varied

Speech Data – Preprocessing (4) • Delta-coefficients • Source: http://cslu.cse.ogi.edu/fsj/issues/issue5/sparse-ann/PhoneProbEst.html • Formula • Dimensionality reduction

Speech Data – Bayesian Classifier • Frequency matters • No risk matrix • Raw accuracy : 18.13% • Delta-coefficient preprocessing : 96.06%

Speech Data – Multi Layer Perceptron • Hidden Neuron : 22 • Normalized Raw Data : 20.25% • Reduced dimension, delta coefficient : 29.52% • Delta coefficient without reduced dimension : 27.84%

Summary – Spectral Color Data • Bayesian Classifier • Raw Data : 3.92% • Preprocessed : 99.97% (tristimulus) • Multi Layer Perceptron • Raw Data : ~5% • Preprocessed : 58.1% (tristimulus) 99.7% (tristimulus + CIELAB + sRGB) • K-Means Clustering • Raw data : 92% • Preprocessed : 95%

Summary – Speech Data • Bayesian Classifier • Raw Data : 18.19% • Preprocessed : 96.09% (delta-derivative, high variance elimination) • Multi Layer Perceptron • Raw Data : 20.25% • Preprocessed : 29.52% (delta-derivative, high variance elimination) • K-Means Clustering • Raw data : 24% • Preprocessed : 62% (normalized, delta-derivative)

Pattern Recognition Final Task

Pattern Recognition Final Task

Presentation Transcript

Pattern Recognition

Pattern recognition

Pattern Recognition

Pattern Recognition

Pattern Recognition

Pattern Recognition

Pattern recognition

Pattern Recognition

Pattern Recognition

Pattern Recognition

Pattern Recognition

Pattern Recognition

Pattern Recognition

Pattern Recognition

Pattern Recognition

Pattern Recognition

Pattern Recognition

Pattern recognition

Pattern Recognition

Pattern Recognition