THE LEARNING AND USE OF GRAPHICAL MODELS FOR IMAGE INTERPRETATION

THE LEARNING AND USE OF GRAPHICAL MODELS FOR IMAGE INTERPRETATION Thesis for the degree of Master of Science By Leonid Karlinsky Under the supervision of Professor Shimon Ullman

Introduction

Part I: MaxMI Training

Best = Maximal MI Classification Goal: Classify C, using a subset of “trained” features - F on new examples with minimum error Training tasks: • Best F • Best • Efficient model More…

MaxMI Training - The Past 4 5 6 1 2 3 • Model: simple “Flat” structure, NCC thresholds • Training: • Features and thresholds selected one by one • Cond. independence in C increased MI upper bound More…

MaxMI Training – Our Approach Learn model and alltogether maximizing:

MaxMI Training – Learning Maximize for all together MaxMI: Decompose MI Efficiently learn parameters using GDL More…

MaxMI Training – Assumptions • TAN model structure– Tree Augmented Naïve Bayes [Friedman, 97] • Feature Tree (FT)– can remove C preserving the feature tree.

MaxMI Training – TAN and • TAN structure is unknown • Learn and TAN s.t.: • is maximized. • Asymptotic correctness • FT holds • Efficiency

MaxMI Training – MaxMI hybrid

MaxMI Training – MaxMI hybrid [Chow & Liu, 68] MaxMI: [Friedman, 97] More…

MaxMI Training – MaxMI hybrid • Convergent algorithm: TAN More…

MaxMI Training – empirical results More…

Train any parameters MaxMI Training – Generalizations • Any low-TREEWIDTH structure • Even without assumptions:

Back to the Goals

Part II: Loopy MAP approximation

Loopy network example • Want to solve MAP: • NP-hard in general! [Cooper 90, Shimony 94]

Our approach – opening loops • Now, we can maximize: • The assignment is legal for the loopy problem if:

Our approach – opening loops • Legally maximize: • Can maximize unrestricted: • Usually • Our solution –slow connections

Our approach – slow connections • Fix z=Z • Maximize (loop-free, use GDL): • Now legalize and return to step one. • Iterate until convergence. This is the Maximize-and-Legalize algorithm.

Our approach – slow connections When will this work? • The intuition:z-minor • Strong z-minor global maximum–single step • Weak z-minor local optimum–several steps

Making the assumptions true Selecting z-variables • The intuition: recursive z-selection • Recursivestrong z-minor: single step, global maximum! • Recursiveweak z-minor: iterations, local maximum. • Different / Same speed • Remove – Contract – Split algorithm More…

Making the assumptions true Approximating the function • The intuition: recursively “chip away” small parts of the function More…

Existing approximation algorithms • Clustering: triangulation [Pearl, 88] • Loopy Belief Revision [McEliece, 98] • Bethe-Kikuchi Free-Energy: CCCP [Yuille, 02] • Tree Re-Parametrization (TRP) [Wainwright, 03]

Experimental Results More…

More… Maximum MI vs. Minimum PE More…

Thank you for your time

Explanations

Classification Specifics MAP: • How do we classify a new example? • What are “the best” features and parameters? Maximize MI: • Why maximize MI? • Tightlyrelated to PE • More reasons – if time permits Back…

MaxMI Training - The Past - Reasons 4 5 6 1 2 3 • Why did it work? • Conditional independence in C • Increased MI upper bound • What was missing? • Conditional independence in C was assumed! • Maximizing the “whole” MI. • Learning model structure. Back…

MaxMI Training – JT • JT structure = TAN structure • GDL - exponential in TREEWIDTH • TREEWIDTH = 2 Back…

MaxMI Training – EM • [Redner, Walker, 84] EM algorithm: • Training CPTs with EM Why not EM? • EM assumes static training data! • Not true in our scenario! Back…

MaxMI Training – MaxMI hybrid solution • [Chow, Liu 68] “Best” Feature Tree • [Friedman, et al. 97] “Best” TAN • [We, 2004] Maximal MI Back…

MaxMI Training – MaxMI hybrid solution • Increase: ? • ICR • Non-decrease: • TAN Asymptotic correctness Back…

MaxMI Training – MaxMI hybrid Back…

MaxMI Training – empirical results Before training: After training: Back…

MaxMI Training – empirical results Back…

Remove – Contract – Split Back…

Making the assumptions true Approximating the function • Strong z-minor • Challenge: selecting proper Z constants • Benefit: single step convergence • Weak z-minor • Drawback: exponential in number of “chips” • Benefit: less restrictive Back…

The clique tree Back…

Experimental Results

Experimental Results Back…

Optional

MaxMI Training – extensions • Observed and unobserved model. • MaxMI augmented to support O&U • Training observed only + EM heuristic. • Complete training • Constrained and greedy TAN restructure. • MaxMI vs. MinPE in ideal scenario – characterization and comparison. • Future research directions

THE LEARNING AND USE OF GRAPHICAL MODELS FOR IMAGE INTERPRETATION

THE LEARNING AND USE OF GRAPHICAL MODELS FOR IMAGE INTERPRETATION

Presentation Transcript

Graphical Models

Graphical Models for the Internet

Image Interpretation

Learning in Undirected Graphical Models

Image Interpretation

Learning with Inference for Discrete Graphical Models

Graphical Models

Graphical Models

Graphical Models for Machine Learning and Computer Vision

Graphical Models

GRAPHICAL MODELS

Hierarchical Reinforcement Learning Using Graphical Models

Graphical Models

Graphical Models - Learning -

Graphical Models in Machine Learning

Graphical Interpretation of a Phasor

Query-Specific Learning and Inference for Probabilistic Graphical Models

Graphical Models for Machine Learning and Computer Vision

Graphical Models - Learning -

Hierarchical Reinforcement Learning Using Graphical Models

Graphical Models