Xiuwen Liu Department of Computer Science Florida State University

Research Activities at Center for Applied Vision and Imaging Sciences andFlorida State Vision GroupFlorida State University Xiuwen Liu Department of Computer Science Florida State University http://cavis.fsu.edu & http://fsvision.fsu.edu

Research Statement • My research goal is to create machines that can “see” with similar human performance • This seems a trivial problem as each of us can do this without any effort • Computer + Camera = “A See Machine” ?

Visual Pathway

Visual Illusion

Outline • Motivations • Some applications of computer vision and pattern recognition techniques • Some of the research projects • Related Courses • Contact information

Computer Vision Applications • No hands across America • Sponsored by Delco Electronics, AssistWare Technology, and Carnegie Mellon University • Navlab 5 drove from Pittsburgh, PA to San Diego, CA, using the RALPHcomputer program. • The trip was 2849 miles of which 2797 miles were driven automatically with no hands • Which is 98.2%

Computer Vision Applications– continued

Human-Computer Interactions

Sign Language Recognition

CyberKnife

CyberKnife – Cont.

Image-Guided Neurosurgery

Intelligent Transportation Systems http://dfwtraffic.dot.state.tx.us/dal-cam-nf.asp

Computer Vision Applications – cont. • Military applications • Automated target recognition

Computer Vision Applications– continued

Biometrics – cont. Iris code can achieve zero false acceptance

Computer Vision in Sports • How was the yellow created?

How can we characterize all these images perceptually? Generic Image Modeling

Spectral Histogram Representation • Spectral histogram • Given a bank of filters F(a), a = 1, …, K, a spectral histogram is defined as the marginal distribution of filter responses

LoG filter Gabor filter Spectral Histogram Representation - continued • Choice of filters • Laplacian of Gaussian filters • Gabor filters • Gradient filters • Intensity filter

Spectral Histogram Representation - continued

Texture Synthesis Examples - continued • An image with periodic structures Observed image Synthesized image

Object Synthesis Examples - continued

Performance Comparison

Face Detection Based On Spectral Representations • Face detection is to detect all instances of faces in a given image • Each image window is represented by its spectral histogram • A support vector machine is trained on training faces • Then the trained support vector machine is used to classify each image window in an input image • More results athttp://fsvision.fsu.edu/face-detection

Face detection - continued

Rotation Invariant Face Detection

Rotation Invariant Face Detection - continued

Linear Representations • Linear representations are widely used in appearance-based object recognition and other applications • Simple to implement and analyze • Efficient to compute • Effective for many applications

Standard Linear Representations • Principal Component Analysis • Designed to minimize the reconstruction error on the training set • Obtained by calculating eigenvectors of the co-variance matrix • Fisher Discriminant Analysis • Designed to maximize the separation between means of each class • Obtained by solving a generalized eigen problem • Independent Component Analysis • Designed to maximize the statistical independence among coefficients along different directions • Obtained by solving an optimization problem with some object function such as mutual information, negentropy, ....

Standard Linear Representations - continued • Standard linear representations are sub optimal for recognition applications • Evidence in the literature • A toy example • Standard representations give the worst recognition performance • Optimal component analysis

Performance Measure - continued • Suppose there are C classes to be recognized • Each class has ktrain training images • It has kcross cross validation images • We used h(x) = 1/(1+exp(-2bx)

Performance Measure - continued • F(U) depends on the span of U but is invariant to change of basis • In other words, F(U)=F(UO) for any orthonormal matrix O • The search space of F(U) is the set of all the subspaces, which is known as the Grassmann manifold • It is not a flat vector space and gradient flow must take the underlying geometry of the manifold into account

Deterministic Gradient Flow - continued • Gradient at [J] (first d columns of n x n identity matrix)

Deterministic Gradient Flow - continued • Gradient at U: Compute Q such that QU=J • Deterministic gradient flow on Grassmann manifold

Stochastic Gradient and Updating Rules • Stochastic gradient is obtained by adding a stochastic component • Discrete updating rules

MCMC Simulated Annealing Optimization Algorithm • Let X(0) be any initial condition and t=0 • Calculate the gradient matrix A(Xt) • Generate d(n-d) independent realizations of wij’s • Compute Y (Xt+1) according to the updating rules • Compute F(Y) and F(Xt) and set dF=F(Y)- F(Xt) • Set Xt+1 = Y with probability min{exp(dF/Dt),1} • Set Dt+1 = Dt / g and set t=t+1 • Go to step 1

ORL Face Dataset

Performance Comparison

Performance Comparison – cont.

Brain Curve Classification

Brain Curve Classification – cont.

Real-time Scene Interpretation • Object detection and recognition problem • Given a set of images, find regions in these images which contain instances of relevant objects • Here the number of relevant objects is assumed to be large • For example, the system should be able to handle 30,000 different kinds of objects, an estimate of the human brain’s capacity for basic level visual categorization [I. Biederman, Psychological Review, vol. 94, pp. 115-147, 1987]

Global Monitoring Through High-resolution Satellite Images

Problem Statement for Scene Interpretation • Object detection and recognition problem • Given a set of images, find regions in these images which contain instances of relevant objects • Here the number of relevant objects is assumed to be large • For example, the system should be able to handle 30,000 different kinds of objects, an estimate of the human’s capacity for basic level visual categorization [I. Biederman, Psychological Review, vol. 94, pp. 115-147, 1987] • Goal • Develop a system that can achieve real-time detection and recognition for images of size 640 x 480 with high accuracy • Say, at a frame rate of 15 frames per second

Xiuwen Liu Department of Computer Science Florida State University

Xiuwen Liu Department of Computer Science Florida State University

Presentation Transcript

Florida State University Department of Computer Science

Ken D. Nguyen Department of Computer Science Georgia State University

FLORIDA DEPARTMENT OF STATE

Florida Department of State

FLORIDA DEPARTMENT OF STATE

Computer Science Department Sonoma State University

FLORIDA DEPARTMENT OF STATE

Florida State University Department of Computer Science

Florida State University Department of Computer Science

Yonggang Liu University of Florida

Department of Computer Science, Wayne State University

Alex Zelikovsky Department of Computer Science Georgia State University

Serghei Mangul Department of Computer Science Georgia State University

University Of Central Florida Department Of Computer Science The CRAY-1 Computer System

Florida Department of State

Department of Computer Science, Princeton University

RANI NALAMARU DEPARTMENT OF COMPUTER SCIENCE BALL STATE UNIVERSITY

FLORIDA DEPARTMENT OF STATE

Columbia University Department of Computer Science

Concordia University Department of Computer Science

Columbia University Department of Computer Science