1 / 67

Recap from Friday

Recap from a CS195g Computational Photography lecture discussing Fourier Transform in image processing, filtering techniques, convolution, and boundary conditions. Learn about frequency domain, Fourier series, filtering types, Laplacian Pyramid, image gradient, and DCT in JPEG compression.

selmac
Download Presentation

Recap from Friday

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Recap from Friday linear Filtering convolution differential filters filter types boundary conditions.

  2. The Frequency Domain Somewhere in Cinque Terre, May 2005 CS195g: Computational Photography James Hays, Brown, Spring 2010 Slides from Steve Seitzand Alexei Efros

  3. Salvador Dali “Gala Contemplating the Mediterranean Sea, which at 30 meters becomes the portrait of Abraham Lincoln”, 1976 Salvador Dali, “Gala Contemplating the Mediterranean Sea, which at 30 meters becomes the portrait of Abraham Lincoln”, 1976 Salvador Dali, “Gala Contemplating the Mediterranean Sea, which at 30 meters becomes the portrait of Abraham Lincoln”, 1976

  4. A nice set of basis Teases away fast vs. slow changes in the image. This change of basis has a special name…

  5. Jean Baptiste Joseph Fourier (1768-1830) • had crazy idea (1807): • Any periodic function can be rewritten as a weighted sum of sines and cosines of different frequencies. • Don’t believe it? • Neither did Lagrange, Laplace, Poisson and other big wigs • Not translated into English until 1878! • But it’s true! • called Fourier Series

  6. A sum of sines • Our building block: • Add enough of them to get any signal f(x) you want! • How many degrees of freedom? • What does each control? • Which one encodes the coarse vs. fine structure of the signal?

  7. Inverse Fourier Transform Fourier Transform F(w) f(x) F(w) f(x) Fourier Transform • We want to understand the frequency w of our signal. So, let’s reparametrize the signal by w instead of x: • For every w from 0 to inf, F(w) holds the amplitude A and phase f of the corresponding sine • How can F hold both? Complex number trick! We can always go back:

  8. Time and Frequency • example : g(t) = sin(2pf t) + (1/3)sin(2p(3f) t)

  9. Time and Frequency • example : g(t) = sin(2pf t) + (1/3)sin(2p(3f) t) = +

  10. Frequency Spectra • example : g(t) = sin(2pf t) + (1/3)sin(2p(3f) t) = +

  11. Frequency Spectra • Usually, frequency is more interesting than the phase

  12. Frequency Spectra = + =

  13. Frequency Spectra = + =

  14. Frequency Spectra = + =

  15. Frequency Spectra = + =

  16. Frequency Spectra = + =

  17. Frequency Spectra =

  18. Frequency Spectra

  19. Extension to 2D in Matlab, check out: imagesc(log(abs(fftshift(fft2(im)))));

  20. Man-made Scene

  21. Can change spectrum, then reconstruct

  22. Low and High Pass filtering

  23. The greatest thing since sliced (banana) bread! The Fourier transform of the convolution of two functions is the product of their Fourier transforms The inverse Fourier transform of the product of two Fourier transforms is the convolution of the two inverse Fourier transforms Convolution in spatial domain is equivalent to multiplication in frequency domain! The Convolution Theorem

  24. 2D convolution theorem example |F(sx,sy)| f(x,y) * h(x,y) |H(sx,sy)| g(x,y) |G(sx,sy)|

  25. Fourier Transform pairs

  26. Low-pass, Band-pass, High-pass filters low-pass: High-pass / band-pass:

  27. Edges in images

  28. What does blurring take away? original

  29. What does blurring take away? smoothed (5x5 Gaussian)

  30. High-Pass filter smoothed – original

  31. Band-pass filtering • Laplacian Pyramid (subband images) • Created from Gaussian pyramid by subtraction Gaussian Pyramid (low-pass images)

  32. Laplacian Pyramid • How can we reconstruct (collapse) this pyramid into the original image? Need this! Original image

  33. Why Laplacian? Gaussian Laplacian of Gaussian delta function

  34. - = + a = Unsharp Masking

  35. The gradient direction is given by: • how does this relate to the direction of the edge? • The edge strength is given by the gradient magnitude Image gradient • The gradient of an image: • The gradient points in the direction of most rapid change in intensity

  36. Effects of noise • Consider a single row or column of the image • Plotting intensity as a function of position gives a signal How to compute a derivative? Where is the edge?

  37. Look for peaks in Solution: smooth first Where is the edge?

  38. Derivative theorem of convolution • This saves us one operation:

  39. Laplacian of Gaussian • Consider Laplacian of Gaussian operator Where is the edge? Zero-crossings of bottom graph

  40. Laplacian of Gaussian is the Laplacian operator: 2D edge detection filters Gaussian derivative of Gaussian

  41. Try this in MATLAB • g = fspecial('gaussian',15,2); • imagesc(g); colormap(gray); • surfl(g) • gclown = conv2(clown,g,'same'); • imagesc(conv2(clown,[-1 1],'same')); • imagesc(conv2(gclown,[-1 1],'same')); • dx = conv2(g,[-1 1],'same'); • imagesc(conv2(clown,dx,'same')); • lg = fspecial('log',15,2); • lclown = conv2(clown,lg,'same'); • imagesc(lclown) • imagesc(clown + .2*lclown)

  42. Campbell-Robson contrast sensitivity curve

  43. Depends on Color R G B

  44. Lossy Image Compression (JPEG) Block-based Discrete Cosine Transform (DCT)

  45. Using DCT in JPEG • The first coefficient B(0,0) is the DC component, the average intensity • The top-left coeffs represent low frequencies, the bottom right – high frequencies

  46. Image compression using DCT • DCT enables image compression by concentrating most image information in the low frequencies • Lose unimportant image info (high frequencies) by cutting B(u,v) at bottom right • The decoder computes the inverse DCT – IDCT • Quantization Table • 3 5 7 9 11 13 15 17 • 5 7 9 11 13 15 17 19 • 7 9 11 13 15 17 19 21 • 9 11 13 15 17 19 21 23 • 11 13 15 17 19 21 23 25 • 13 15 17 19 21 23 25 27 • 15 17 19 21 23 25 27 29 • 17 19 21 23 25 27 29 31

  47. Block size in JPEG • Block size • small block • faster • correlation exists between neighboring pixels • large block • better compression in smooth regions • It’s 8x8 in standard JPEG

  48. JPEG compression comparison 89k 12k

More Related