Dense Motion Estimation

CS 636 Computer Vision Dense Motion Estimation Nathan Jacobs

overview • any issues or questions? • review from last time • dense motion estimation • relationship to feature-based • today: simple motion models • next time: more complex motion models

Motion and perceptual organization • Sometimes, motion is the only cue Lazebnik

Motion and perceptual organization • Even “impoverished” motion data can evoke a strong percept G. Johansson, “Visual Perception of Biological Motion and a Model For Its Analysis", Perception and Psychophysics 14, 201-211, 1973. Lazebnik

Motion Estimation • Today’s Readings • Szeliski Chapters 7.1, 7.2, 7.4 • Human motion estimation • http://www.sandlotscience.com/Ambiguous/Barberpole_Illusion.htm • http://www.sandlotscience.com/Distortions/Breathing_Square.htm Most slides by Steve Seitz

Why estimate motion? • Lots of uses • Track object behavior • Correct for camera jitter (stabilization) • Align images (mosaics) • 3D shape reconstruction • Special effects 1998 Oscar for Visual Effects http://www.youtube.com/watch?v=au0ibcWUxPI

review from last time… • Discrete/Feature-based Motion Estimation • Least-squares • Robust Estimators • RANSAC, Hough Transform • Calibration

What is the difference? feature-based dense (simple translational motion)

feature-based vs. dense feature-based dense (simple translational motion) extract features extract features minimize residual minimize residual optimal motion estimate optimal motion estimate

Motion estimation • Input: sequence of images • Output: point correspondence • Feature correspondence: “Feature Tracking” • we’ve seen this already (e.g., SIFT) • can modify this to be more accurate/efficient if the images are in sequence (e.g., video) • Pixel (dense) correspondence: “Optical Flow” • today’s lecture

Motion field and parallax X(t+dt) V X(t) • X(t) is a moving 3D point • Velocity of scene point: V = dX/dt • x(t) = (x(t),y(t)) is the projection of X in the image • Apparent velocity v in the image: given by components vx = dx/dt and vy = dy/dt • These components are known as the motion field of the image v x(t+dt) x(t) Lazebnik

Motion field and parallax X(t+dt) To find image velocity v, differentiate x=(x,y) with respect to t (using quotient rule): V X(t) v x(t+dt) x(t) Image motion is a function of both the 3D motion (V) and thedepth of the 3D point (Z) Lazebnik

Motion field and parallax • Pure translation: V is constant everywhere Lazebnik

Motion field and parallax • Pure translation: V is constant everywhere • The length of the motion vectors is inversely proportional to the depth Z • if Vzis nonzero: • Every motion vector points toward (or away from) the vanishing point of the translation direction Lazebnik

Motion field and parallax • Pure translation: V is constant everywhere • The length of the motion vectors is inversely proportional to the depth Z • Vzis nonzero: • Every motion vector points toward (or away from) the vanishing point of the translation direction • Vz is zero: • Motion is parallel to the image plane, all the motion vectors are parallel Lazebnik

Optical flow • Definition: optical flow is the apparent motion of brightness patterns in the image • Ideally, optical flow would be the same as the motion field • Have to be careful: apparent motion can be caused by lighting changes without any actual motion • Think of a uniform rotating sphere under fixed lighting vs. a stationary sphere under moving illumination Lazebnik

Key assumptions • color constancy: a point in H looks the same in I • For grayscale images, this is brightness constancy • small motion: points do not move very far • This is called the optical flow problem Problem definition: optical flow • How to estimate pixel motion from image H to image I? • Solve pixel correspondence problem • given a pixel in H, look for nearby pixels of the same color in I

Optical flow constraints (grayscale images) • Let’s look at these constraints more closely • brightness constancy: Q: what’s the equation? • small motion: (u and v are less than 1 pixel) • suppose we take the Taylor series expansion of I:

Optical flow equation • Combining these two equations

Optical flow equation • Combining these two equations • In the limit as u and v go to zero, this becomes exact

The brightness constancy constraint • How many equations and unknowns per pixel? • One equation, two unknowns • Intuitively, what does this constraint mean? • The component of the flow perpendicular to the gradient (i.e., parallel to the edge) is unknown gradient (u,v) • If (u, v) satisfies the equation, so does (u+u’, v+v’) if (u’,v’) (u+u’,v+v’) edge

Aperture problem

The barber pole illusion http://en.wikipedia.org/wiki/Barberpole_illusion

Solving the aperture problem • Basic idea: assume motion field is smooth • Horn & Schunk: add smoothness term • Lucas & Kanade: assume locally constant motion • pretend the pixel’s neighbors have the same (u,v) • Many other methods exist. Here’s an overview: • S. Baker, M. Black, J. P. Lewis, S. Roth, D. Scharstein, and R. Szeliski. A database and evaluation methodology for optical flow. In Proc. ICCV, 2007 • http://vision.middlebury.edu/flow/

Lucas-Kanade flow • How to get more equations for a pixel? • Basic idea: impose additional constraints • most common is to assume that the flow field is smooth locally • one method: pretend the pixel’s neighbors have the same (u,v) • If we use a 5x5 window, that gives us 25 equations per pixel!

RGB version • How to get more equations for a pixel? • Basic idea: impose additional constraints • most common is to assume that the flow field is smooth locally • one method: pretend the pixel’s neighbors have the same (u,v) • If we use a 5x5 window, that gives us 25*3 equations per pixel!

Solution: solve least squares problem • minimum least squares solution given by solution (in d) of: • The summations are over all pixels in the K x K window • This technique was first proposed by Lucas & Kanade (1981) Lucas-Kanade flow • Prob: we have more equations than unknowns

Conditions for solvability • Optimal (u, v) satisfies Lucas-Kanade equation • When is this solvable? • ATA should be invertible • ATA should not be too small due to noise • eigenvalues l1 and l2 of ATA should not be too small • ATA should be well-conditioned • l1/ l2 should not be too large (l1 = larger eigenvalue) • Does this look familiar? • ATA is the Harris matrix

Observation • This is a two image problem BUT • Can measure sensitivity by just looking at one of the images! • This tells us which pixels are easy to track, which are hard • very useful for feature tracking... Szeliski

Errors in Lucas-Kanade What are the potential causes of errors in this procedure? • Suppose ATA is easily invertible • Suppose there is not much noise in the image • When our assumptions are violated • Brightness constancy is not satisfied • The motion is not small • A point does not move like its neighbors • window size is too large • what is the ideal window size?

Improving accuracy • Recall our small motion assumption • This is not exact • To do better, we need to add higher order terms back in: This is a polynomial root finding problem • Can solve using Newton’s method • Lucas-Kanade method does one iteration of Newton’s method • Better results are obtained via more iterations

Iterative Refinement • Iterative Lucas-Kanade Algorithm • Estimate velocity at each pixel by solving Lucas-Kanade equations • Warp H towards I using the estimated flow field • - use image warping techniques • Repeat until convergence

Revisiting the small motion assumption • Is this motion small enough? • Probably not—it’s much larger than one pixel (2nd order terms dominate) • How might we solve this problem?

Optical flow result

Reduce the resolution!

u=1.25 pixels u=2.5 pixels u=5 pixels u=10 pixels image H image I image H image I Gaussian pyramid of image H Gaussian pyramid of image I Coarse-to-fine optical flow estimation

warp & upsample run iterative L-K . . . image J image I image H image I Gaussian pyramid of image H Gaussian pyramid of image I Coarse-to-fine optical flow estimation run iterative L-K

Error metrics quadratic truncated quadratic lorentzian Robust methods • L-K minimizes a sum-of-squares error metric • least squares techniques overly sensitive to outliers

first image quadratic flow lorentzian flow detected outliers Robust optical flow • Robust Horn & Schunk • Robust Lucas-Kanade • Reference • Black, M. J. and Anandan, P., A framework for the robust estimation of optical flow, Fourth International Conf. on Computer Vision (ICCV), 1993, pp. 231-236 http://www.cs.washington.edu/education/courses/576/03sp/readings/black93.pdf

Benchmarking optical flow algorithms • Middlebury flow page • http://vision.middlebury.edu/flow/

Discussion: features vs. flow? • Features are better for: • Flow is better for:

Advanced topics • Particles: combining features and flow • Peter Sand et al. • http://rvsn.csail.mit.edu/pv/ • State-of-the-art feature tracking/SLAM • Georg Klein et al. • http://www.robots.ox.ac.uk/~gk/

State-of-the-art Optical Flow in Matlab • Median filtering with spatially varying weights • Edge consistency • Robust error http://www.cs.brown.edu/~black/code.html http://people.csail.mit.edu/celiu/OpticalFlow/

for next time • parametric motion models • layered motion models

Dense Motion Estimation

Dense Motion Estimation

Presentation Transcript

MOTION ESTIMATION

Dense Motion Estimation

Motion Estimation and Prediction

Motion Estimation

3D Motion Estimation

Motion Estimation Final Project

Motion estimation

Final Project : Motion estimation

Motion estimation

Motion estimation

Hash-Aided Motion Estimation

Motion estimation

Independent Motion Estimation

Motion estimation

Motion estimation

Motion Estimation

Lucas-Kanade Motion Estimation

Motion estimation

Motion Estimation

LSH-based Motion Estimation