190 likes | 484 Views
Discovering Important People and Objects for Egocentric Video Summarization. Yong Jae Lee, Joydeep Ghosh , and Kristen Grauman University of Texas at Austin. Outline. Introduction Approach Results Conclusion. Introduction. Introduction.
E N D
Discovering Important People and Objects for Egocentric Video Summarization Yong Jae Lee, JoydeepGhosh, and Kristen Grauman University of Texas at Austin
Outline Introduction Approach Results Conclusion
Introduction Focus on the most important objects and people with which the camera wearer interacts. Develop region cues indicative of high-level saliency feature in egocentric video Learn a regressor to predict the relative importance of any new region based on the cues.
Approach Train a regression model to predict region importance Segment the video into temporal events Scoring each region ' s importance using the regressor Generate a storyboard summary of important people /objects important things are those with which the camera wearer has significant interaction. four main steps:
Egocentric video data collection We use the Looxcie wearable camera, which captures video at 15 fps at 320 x 480 resolution. We collected 10 videos, each of 3-5 hours in length.
Learning region importance in egocentric video • Egocentric features • Interaction • Gaze • Frequency
Learning region importance in egocentric video [19] D. Lowe. Distinctive Image Features from Scale-Invariant Keypoints.IJCV, 60(2), 2004. • Frequency feature • Matching regions • Matching points(DoG+SIFT)[19]
Learning region importance in egocentric video [3] Constrained Parametric Min-Cutsfor Automatic Object Segmentation. In CVPR, 2010. [16]Key-Segments for Video ObjectSegmentation. In ICCV, 2011. [27]Rapid Object Detection using a Boosted Cascadeof Simple Features. In CVPR, 2001. • Object features • object-like appearance[3] • object-like motion[16] • likelihood of a person's face[27] • Region features • size、centroid • bounding box centroid、width、height
Regressor to predict region importance learned parameters i’thfeature value For training: ; for testing: predict given Training a linear regression model with pair-wise interaction terms to predict a region r'simportance score:
Results Evaluate on videos from all 4 users, total 17 hours.Train using data from 3 users and test on 1 video from remaining user.
Important region prediction accuracy [3] Constrained Parametric Min-Cuts for Automatic Object Segmentation. In CVPR, 2010. [6]Category Independent Object Proposals. In ECCV, 2010. [28] Modeling Attention to Salient Proto-Objects. Neural Networks, 19:1395–1407, 2006.
Conclusion A novel approach to perform summarization for egocentric video. Focus on the most important objects and people that generate the " story " of vedio. Novel egocentric features to train a regressor that predicts important regions.