1.22k likes | 1.33k Views
Multiscale Geometric Image Processing. Besov Bayes Chomsky Plato. Richard Baraniuk Rice University dsp.rice.edu Joint work with Hyeokho Choi Justin Romberg Mike Wakin. Image Processing. Analyzing, modeling, processing images special class of 2-d functions ex: digital photos
E N D
Multiscale Geometric Image Processing Besov BayesChomsky Plato Richard Baraniuk Rice University dsp.rice.edu Joint work with Hyeokho ChoiJustin RombergMike Wakin
Image Processing • Analyzing, modeling, processingimages • special class of 2-d functions • ex: digital photos • Problems: • representation, approximation, compression, estimation, restoration, deconvolution, detection, classification, segmentation, …
Low-Level Image Structures • Prevalent image structures: • smooth regions - grayscale regularity • smooth edge contours - geometric regularity • Must exploit both for maximum performance
Lossy Image Compression • Given image approximate using bits • Error incurred = distortionExample: squared error
Rate-Distortion Analysis as • Asymptotic analysis; maximize rate achievable region
Transform Coding • Quantize coefficients of an image expansion coefficients basis, frame Ex: Fourier sinusoids DCT (JPEG) wavelets curvelets …
Transform Coding • Quantize coefficients of an image expansion coefficients basis, frame quantize to total bits
JPEG Compressed (DCT) PSNR = 26.32dB @ 0.0103bpp
Multiscale Image Analysis • Analyze an image a multiple scales • How? Zoom out and record information lost in wavelet coefficients … info 1 info 2 info 3
Wavelet-based Image Processing • Standard 2-D tensor product wavelet transform
Transform-domain Modeling • Transform-domain modeling and processing transform coefficient model
Transform-domain Modeling • Transform-domain modeling and processing transformvocabulary coefficient model
Transform-domain Modeling • Transform-domain modeling and processing • Vocabulary + grammar capture image structure • Challenging co-design problem transformvocabulary coefficient grammar model
Nonlinear Image Modeling • Natural images do not form a linear space! • Form of the “set of natural images”? + =
Set of Natural Images Small: NxN sampled images comprise an extremely small subset of RN2
Set of Natural Images Small: NxN sampled images comprise an extremely small subset of RN2
Set of Natural Images Small: NxN sampled images comprise an extremely small subset of RN2 Complicated: “Manifold” structure RN2
Wedge Manifold 3 Haar wavelets
Wedge Manifold 3 Haar wavelets
Set of Natural Images Small: NxN sampled images comprise an extremely small subset of RN2 Complicated: “Manifold” structure RN2
Stochastic Projections • Projection = “shadow”
Higher-D Stochastic Projections • Model (partial) wavelet coefficient joint statistics • Shadow more faithful to manifold
Wavelet-based Image Processing • Standard 2-D tensor product wavelet transform
Wavelet Coefficient Sparsity • Most
Wavelet Quadtree Structure • Wavelet analysis is self-similar • Image parent square divides into 4 children squares at next finer scale
Wavelet Quadtree Correlations • Smooth regionsmallwc’s tumble down tree • Edge/ridge regionlarge wc’s tumble down tree
Zero Tree Compression • Idea: Prune wavelet subtrees in smooth regions Z • tree-structured thresholding • spend bits on (large) “significant” wc’s • spend no bits on (small) wc’s… • zerotree symbolZ = {wc and all decendants = 0}
Zero Tree Compression • Label pruned wavelet quadtree with 2 stateszero-tree- smooth region (prune)significant- edge/texture region (keep) S smooth smooth Z Z S not smooth Z smooth S S S S
Zero Tree Compression • Label pruned wavelet quadtree with 2 stateszero-tree- smooth region (prune)significant- edge/texture region (keep) • Can optimize placement of Z and S with dynamic programming (SFQ) [Xiong, Ramchandran, Orchard] S smooth smooth Z Z S not smooth Z smooth S S S S
JPEG Compressed (DCT) PSNR = 26.32dB @ 0.0103bpp
JPEG2000 Compressed (Wavelets) PSNR = 29.77dB @ 0.0103bpp
SFQ Compressed (Wavelets) PSNR = 29.77dB @ 0.0103bpp
SFQ WC Code Map green = significantblue = zero tree
The JPEG2000 Disappointment • JPEG2000: replace DCT with wavelet transform • Does not improve on decay rate of JPEG! • WHY?neither DCT norwavelets are theright transform JPEG (DCT) JPEG2000 (wavelets)
Wavelet Challenges • Geometrical info not explicit • Inefficient-large number of large wc’s cluster around edge contours, no matter how smooth
Wavelets and Cartoons 13 26 52 • Even for a smooth C2 contour, which straightens at fine scales…
2-D Wavelets: Poor Approximation 13 26 52 • Even for a smooth C2 contour, which straightens at fine scales… • Too many wavelets required! -term wavelet approximation not
2-D Wavelets: Poor Approximation 13 26 52 • Even for a smooth C2 contour, which straightens at fine scales… • Too many wavelets required! -bit wavelet approximation not
Solution 1: Upgrade the Transform 13 26 52 • Introduce anisotropic transform • curvelets, ridgelets, contourlets, … • Optimal error decay rates for some simple images
Solution 2: Upgrade the Processing 13 26 52 • Replace coefficient thresholding by a new wc modelthat captures anisotropicspatial correlations
Our Goal: New Image Models • Wavelet coefficient models that capture both geometric and textural aspects of natural images • Optimal error decay rates for some image class
Geometry Model for Cartoons C2 boundary • Toy model: flat regions separated by smooth contours • Goal: representation that is • sparse • simply modeled • efficiently computed • extensible to texture regions
2-D Dyadic Partition • Multiscale analysis • Partition; not a basis/frame • Zoom in by factor of 2 each scale