340 likes | 401 Views
Activity and Motion Detection in Videos. Longin Jan Latecki and Roland Miezianko, Temple University Dragoljub Pokrajac, Delaware State University. Dover , August 2005. Definition of Motion Detection. Action of sensing physical movement in a give area
E N D
Activity and Motion Detection in Videos Longin Jan Latecki and Roland Miezianko, Temple University Dragoljub Pokrajac, Delaware State University Dover, August 2005
Definition of Motion Detection • Action of sensing physical movement in a give area • Motion can be detected by measuring change in speed or vector of an object
Motion Detection Goals of motion detection • Identify moving objects • Detection of unusual activity patterns • Computing trajectories of moving objects Applications of motion detection • Indoor/outdoor security • Real time crime detection • Traffic monitoring Many intelligent video analysis systems are based on motion detection.
Two Approaches to Motion Detection • Optical Flow • Compute motion within region or the frame as a whole • Change detection • Detect objects within a scene • Track object across a number of frames
Background Subtraction • Uses a reference background image for comparison purposes. • Current image (containing target object) is compared to reference image pixel by pixel. • Places where there are differences are detected and classified as moving objects. Motivation: simple difference of two images shows moving objects
b. Same scene later a. Original scene Subtraction of scene a from scene b Subtracted image with threshold of 100
Model the background and subtract to obtain object mask Filter to remove noise Group adjacent pixels to obtain objects Track objects between frames to develop trajectories Static Scene Object Detection and Tracking
Approaches to Background Modeling • Background Subtraction • Statistical Methods(e.g., Gaussian Mixture Model,Stauffer and Grimson 2000) Background Subtraction: • Construct a background image B as average of few images • For each actual frame I, classify individual pixels as foreground if |B-I| > T (threshold) • Clean noisy pixels
Background Subtraction Background Image Current Image
Statistical Methods • Pixel statistics: average and standard deviation of color and gray level values (e.g., W4 by Haritaoglu, Harwood, and Davis 2000) • Gaussian Mixture Model(e.g., Stauffer and Grimson 2000)
Gaussian Mixture Model • Model the color values of a particular pixel as a mixture of Gaussians • Multiple adaptive Gaussians are necessary to cope with acquisition noise, lighting changes, etc. • Pixel values that do not fit the background distributions (Mahalanobis distance) are considered foreground
Gaussian Mixture Model Block 44x42 Pixel 172x165 R-G Distribution R-G-B Distribution
Proposed ApproachMeasuring Texture Change • Classical approaches to motion detection are based on background subtraction, i.e., a model of background image is computed, e.g., Stauffer and Grimson (2000) • Our approach does not model any background image. • We estimate the speed of texture change.
In our system we divide video plane in disjoint blocks (4x4 pixels), and compute motion measure for each block. mm(x,y,t) for a given block location (x,y) is a function of t
Block size relative to image size Block 24x28 1728 blocks per frame Image Size: 36x48 blocks
Motion Measure Computation • We use spatial-temporal blocks to represent videos • Each block consists of NBLOCK x NBLOCK pixels from 3 consecutive frames • Those pixel values are reduced to K principal components using PCA (Kahrunen-Loeve trans.) • In our applications, NBLOCK=4, K=10 • Thus, we project 48 gray level values to a texture vector with 10 PCA components
t+1 tt-1 4*4*3 spatial-temporal block Location I=24, J=28, time t-1, t, t+1 48-component block vector (4*4*3) 10 principal components -0.5221 -0.0624 -0.1734 -0.2221 -0.2621 -0.4739 -0.4201 -0.4224 -0.0734 -0.1386 Motion Measure Computation 3D Block Projection with PCA (Kahrunen-Loeve trans.)
Texture of spatiotemporal blocks works better than color pixel values • More robust • Faster We illustrate this with texture trajectories.
499 624 863 1477
Trajectory of block (24,8) (Campus 1 video) Moving blocks corresponds to regions of high local variance, i.e., higher spread Space of spatiotemporal block vectors
Comparison to the trajectory of a pixel inside block (24,8) Campus 1 video block I=24, J=28 Standardized PCA components of RGB pixel values at pixel location (185,217) that is inside of block (24,28).
Detection of Moving Objects Based on Local Variation For each block location (x,y) in the video plane • Consider texture vectors in a symmetric window [t-W, t+W] at time t • Compute the covariance matrix • Motion measureis defined as the largest eigenvalue of the covariance matrix
Feature Vectors in Space Feature vectors 4.2000 3.5000 2.6000 4.1000 3.7000 2.8000 3.9000 3.9000 2.9000 4.0000 4.0000 3.0000 4.1000 3.9000 2.8000 4.2000 3.8000 2.7000 4.3000 3.7000 2.6500 Covariance matrix Current time 0.0089 -0.0120 -0.0096 -0.0120 0.0299 0.0201 -0.0096 0.0201 0.0157 Motion Measure Eigenvalues 0.0499 0.00350.0011 0.0499
Feature Vectors in Space Feature vectors 4.3000 3.7000 2.6500 4.4191 3.5944 2.4329 4.1798 3.8415 2.6441 4.2980 3.6195 2.5489 4.2843 3.7529 2.7114 4.1396 3.7219 2.7008 4.3257 3.6078 2.8192 Covariance matrix 0.0087 -0.0063 -0.0051 -0.0063 0.0081 0.0031 -0.0051 0.0031 0.0154 Current time Motion Measure Eigenvalues 0.02090.00930.0020 0.0209
Graph of motion measuremm(40,66) of Sub_IR_2 video Motion Measure Detected Motion
(1) (2) (3) (4) (5) Detect Outlier Switch to a nominal state Update the estimates of mean and standard deviation only when the outliers are not detected Dynamic Distribution Learning and Outlier Detection