320 likes | 433 Views
Multiscale Analysis of Images. Gilad Lerman Math 5467 (stealing slides from Gonzalez & Woods, and Efros). The Multiscale Nature of Images. Recall: Gaussian pre-filtering. G 1/8. G 1/4. Gaussian 1/2. Solution: filter the image, then subsample
E N D
Multiscale Analysis of Images Gilad Lerman Math 5467 (stealing slides from Gonzalez & Woods, and Efros)
Recall: Gaussian pre-filtering G 1/8 G 1/4 Gaussian 1/2 • Solution: filter the image, then subsample • Filter size should double for each ½ size reduction.
Subsampling with Gaussian pre-filtering Gaussian 1/2 G 1/4 G 1/8 • Solution: filter the image, then subsample • Filter size should double for each ½ size reduction.
Image Pyramids • Known as a Gaussian Pyramid [Burt and Adelson, 1983] • In computer graphics, a mip map [Williams, 1983] • A precursor to wavelet transform
A bar in the big images is a hair on the zebra’s nose; in smaller images, a stripe; in the smallest, the animal’s nose Figure from David Forsyth
Gaussian pyramid construction filter mask • Repeat • Filter • Subsample • Until minimum resolution reached • can specify desired number of levels (e.g., 3-level pyramid) • The whole pyramid is only 4/3 the size of the original image!
What does blurring take away? (recall) smoothed (5x5 Gaussian)
High-Pass filter smoothed – original
Band-pass filtering • Laplacian Pyramid (subband images) • Created from Gaussian pyramid by subtraction Gaussian Pyramid (low-pass images)
Laplacian Pyramid • How can we reconstruct (collapse) this pyramid into the original image? Need this! Original image
Image resampling (interpolation) • So far, we considered only power-of-two subsampling • What about arbitrary scale reduction? • How can we increase the size of the image? d = 1 in this example 1 2 3 4 5 • Recall how a digital image is formed • It is a discrete point-sampling of a continuous function • If we could somehow reconstruct the original function, any new image could be generated, at any resolution and scale
Image resampling • So far, we considered only power-of-two subsampling • What about arbitrary scale reduction? • How can we increase the size of the image? d = 1 in this example 1 2 3 4 5 • Recall how a digital image is formed • It is a discrete point-sampling of a continuous function • If we could somehow reconstruct the original function, any new image could be generated, at any resolution and scale
Answer: guess an approximation • Can be done in a principled way: filtering 1 d = 1 in this example 1 2 2.5 3 4 5 • Image reconstruction • Convert to a continuous function • Reconstruct by cross-correlation: Image resampling • So what to do if we don’t know
Resampling filters • What does the 2D version of this hat function look like? performs linear interpolation (tent function) performs bilinear interpolation • Better filters give better resampled images • Bicubic is common choice • Why not use a Gaussian? • What if we don’t want whole f, but just one sample?
Bilinear interpolation • Smapling at f(x,y):
Applications (more efficient with wavelets) • Coding • High frequency coefficients need fewer bits, so one can • collapse a compressed Laplacian pyramid • Denoising/Restoration • Setting “most” coefficients in Laplacian pyramid to zero • Image Blending
0 1 0 1 0 1 Application: Pyramid Blending Left pyramid blend Right pyramid
+ = 1 0 1 0 Feathering Ileft Iright right left Encoding transparency I(x,y) = (aR, aG, aB, a) Iblend = left Ileft + right Iright
0 1 0 1 Affect of Window Size left right
0 1 0 1 Affect of Window Size
0 1 Good Window Size “Optimal” Window: smooth but not ghosted Burt and Adelson (83): Choose by pyramids…
Laplacian Pyramid: Blending • General Approach: • Build Laplacian pyramids LA and LB from images A and B • Build a Gaussian pyramid GR from selected region R (black white corresponding images) • Form a combined pyramid LS from LA and LB using nodes of GR as weights: • LS(i,j) = GR(I,j,)*LA(I,j) + (1-GR(I,j))*LB(I,j) • Collapse the LS pyramid to get the final blended image
laplacian level 4 laplacian level 2 laplacian level 0 left pyramid right pyramid blended pyramid
Simplification: Two-band Blending • Brown & Lowe, 2003 • Only use two bands: high freq. and low freq. • Blends low freq. smoothly • Blend high freq. with no smoothing: use binary mask Can be explored in a project…