370 likes | 767 Views
One-Shot Learning Gesture Recognition. Students: Itay Hubara Amit Nishry Supervisor: Maayan Harel Gal-On. Outline. Background. Gesture recognition is a strong upcoming field in computer vision
E N D
One-Shot LearningGesture Recognition Students: ItayHubara AmitNishry Supervisor: MaayanHarel Gal-On
Background Gesture recognition is a strong upcoming field in computer vision Gesture recognition can be seen as a way for computers to begin to understand human body language
Goals Learn and understand existing Gesture recognition algorithms. Compare different approaches Design Gesture recognition algorithm which reduces training time
Data • The Data is compose from several set each contains: • Gesture vocabulary (learning set) which contain only one sample per gesture. • Test set which contain one or more gestures. • Each of the sets has different vocabulary features such as large/small gesture hand/legs movement etc.
Data – Train Base gesture
Data – Test Multiple base gestures Large movements
Data - Test Multiple base gestures Small movements
Challenges • One shot learning - only one learning sample (unlike the common approach of multi class classification) • Tests videos segmentation • Same gesture can have different number of frames • Each set has different features (small/big gestures)
Reduced Problem • Assume that each of the test movies has only one gesture • Goal:finding features space and distance function which have good separation of the features space
Features • Motion Energy • subtracting consecutive frames • Space Quantization
Features • Harris Corner Detector • Find interest point in the difference image based on corner detection • Space Time Interest Points • Extend Harris to the time domain
Features Harris STIP
Features • Head Relative Interest Points
Features Interest points Head Histogram
Distance Functions • Good features space is defined not only by the features but also by the distance (similarity) function • Different features need different distance functions
Principal Motion Using PCA • Using principal component analysis (PCA), to find the main motion vectors. • For test set - project feature onto each of train principals and evaluate similarity
Earth Moving Distance • Given two sets of distribution, EMD will measure the minimum cost to shift “dirt” from one distribution to the other.
Perturbed Variations • Given two sets of distribution and predefined value of permitted variations optimally perturbs the distribution to best fit each other. Transportation problem under permitted variations constrain
LevenshteinDistance • Measure the difference between two sequences. • Consider lengths and classification.
Results Top 20 Top 10
Complete Problem Separate problems • Basic Segmentation (equal/movement) Whole problem solving approach • Moving Window • Dynamic Time Warping (DTW)
Moving Window • Move a window along the test video. • Assume each window frames has only one gesture • Preform basic analysis as did before to and build the distance matrix
Dynamic Time Warping • Create a state machine from train data: • Module standing position • Form standing position can move to start of base gestures • Assume we can move forward, or stay in the same sate. • For a given gesture – find the best path along the sate machine
Results Top 20 Top 10
Conclusions Each approach receive better results in different feature and similarity function Different algorithms has different strengths (segmentation\recognition) Segmentation require standing position model.
Conclusions • Pre-processing unsupervised algorithms help better representing the data. • There is still allot left to do on the field
Future Work Try different models for the standing position to improve segmentation results Try combing DTW for segmentation and PCA for recognition. Use different unsupervised algorithms to better represent the data.
References Ivan Laptev, "On Space-Time Interest Points”, 2005 Hugo JairEscalantea and Isabelle Guyonb, "Principal motion: PCA-based reconstruction of motion histograms” M.Harel, S.Manor, "The Perturbed Variation”, NIPS 2012 ElizavetaLevina, Peter Bickel Department of Statistics, “The EarthMover’s Distance is the Mallows Distance: Some Insights from Statistics”. OfirPele,MichaelWerman, “Fast and Robust Earth Mover’s Distances”.2008