Learning Low-Level Vision William T. Freeman Egon C. Pasztor Owen T. Carmichael

Learning Low-Level Vision William T. Freeman Egon C. Pasztor Owen T. Carmichael

Model image and scene patches as nodes in a Markov network image patches scene patches image F(xi, yi) Y(xi, xj) scene

Network joint probability 1 Õ Õ = Y F y P ( x , y ) ( x , x ) ( x , ) i j i i Z , i j i scene Scene-scene compatibility function Image-scene compatibility function image neighboring scene nodes local observations

image scene Super-resolution • Image: low resolution image • Scene: high resolution image ultimate goal...

Representation Zoomed low-freq. Full freq. original True high freqs Low-band input (contrast normalized, PCA fitted) (to minimize the complexity of the relationships we have to learn, we remove the lowest frequencies from the input image, and normalize the local contrast level).

Training images,~100,000 image/scene patch pairs Images from two Corel database categories: “giraffes” and “urban skyline”.

Gather ~100,000 patches ... ... high freqs. low freqs. Training data samples (magnified)

Nearest neighbor estimate Input low freqs. Estimated high freqs. ... ... high freqs. low freqs. Training data samples (magnified)

y Image-scene compatibility function, F(xi, yi) Assume Gaussian noise takes you from observed image patch to synthetic sample: x

d Scene-scene compatibility function, Y(xi, xj) Assume overlapped regions, d, of hi-res. patches differ by Gaussian observation noise: Uniqueness constraint, not smoothness.

Form linking matrices between nodes scene samples at node xk F(xk, xj) Linking matrix: F(xk,xj)at samples 0.16 0.14 0.23 0.40 0.38 0.72 0.61 0.58 0.13 0.05 0.60 0.55 0.52 0.11 0.07 0.48 0.32 0.29 0.03 0.00 0.09 0.04 0.03 0.01 0.00 Local likelihoods are all 1 for the scene samples scene samples at node xj

Markov network image patches F(xi, yi) scene patches Y(xi, xj)

y1 x1 y2 x2 y3 x3 Derivation of belief propagation

y1 y3 y2 x1 x3 x2 The posterior factorizes

y1 y3 y2 x1 x3 x2 Propagation rules

Belief, and message updates = i j i j

Optimal solution in a chain or tree:Belief Propagation • “Do the right thing” Bayesian algorithm. • For Gaussian random variables over time: Kalman filter. • For hidden Markov models: forward/backward algorithm (and MAP variant is Viterbi).

y1 y3 y2 x1 x3 x2 Y ( x , x ) 1 3 No factorization with loops!

Justification for running belief propagation in networks with loops • Experimental results: • Error-correcting codes • Vision applications • Theoretical results: • For Gaussian processes, means are correct. • Large neighborhood local maximum for MAP. • Equivalent to Bethe approx. in statistical physics. Kschischang and Frey, 1998; McEliece et al., 1998 Freeman and Pasztor, 1999; Frey, 2000 Weiss and Freeman, 1999 Weiss and Freeman, 2000 Yedidia, Freeman, and Weiss, 2000

VISTA--Vision by Image-Scene TrAining image patches scene patches image F(xi, yi) Y(xi, xj) scene

Super-resolution application image patches F(xi, yi) scene patches Y(xi, xj)

Belief Propagation After a few iterations of belief propagation, the algorithm selects spatially consistent high resolution interpretations for each low-resolution patch of the input image. Input Iter. 0 Iter. 1 Iter. 3

Cubic spline zoom to 340x204 Max. likelihood zoom to 340x204 Zooming 2 octaves We apply the super-resolution algorithm recursively, zooming up 2 powers of 2, or a factor of 4 in each dimension. 85 x 51 input

Generic training images Next, train on a generic set of training images. Using the same camera as for the test image, but a random collection of photographs.

Original 70x70 Cubic Spline Markov net, training: generic True 280x280

Training image

Processed image

Learning Low-Level Vision William T. Freeman Egon C. Pasztor Owen T. Carmichael

Learning Low-Level Vision William T. Freeman Egon C. Pasztor Owen T. Carmichael

Presentation Transcript

William T. Sherman

Analysis of Contour Motions Ce Liu William T. Freeman Edward H. Adelson

C C T V

Learning low-level vision

t- ub t ub c- ub c ub

William Barret T ravis

William T he Conqueror

T c

A G C C G A T G T C G T C G T C T A C C A T G C A T

A T C T G C T G G C T G A C C T T T T T T T T T T T A A A

C T A C abs G A T C G A T C G G T A G

T°C

Low level Computer Vision

T c

t- ub t ub c- ub c ub

T he C ombating T errorism C enter

T c