340 likes | 418 Views
Retrieving Actions in Group Contexts. Tian Lan , Yang Wang, Greg Mori, Stephen Robinovitch Simon Fraser University Sept. 11, 2010. Outline. Action Retrieval as Ranking. Contextual Representation of Actions. Results and Future Work. Nursing Home.
E N D
Retrieving Actions in Group Contexts TianLan, Yang Wang, Greg Mori, Stephen Robinovitch Simon Fraser University Sept. 11, 2010
Outline • Action Retrieval as Ranking • Contextual Representation of Actions • Results and Future Work
Nursing Home • Fall analysis in nursing home surveillance videos • a system automatically rank the videos according to the relevance to fall action is expected
Action-Action Context What other people are doing ? Context
Actions in Group Context • Motivation • human actions are rarely performed in isolation, the actions of individuals in a group can serve as context for each other. • Goal • explore the benefit of contextual information in action retrieval in challenging real-world applications
Action Context Descriptor τ τ z + action Focal person Context action
Action Context Descriptor Feature Descriptor Multi-class SVM e.g. HOG by Dalal & Triggs score score score score max action class action class action class action class …
Outline • Action Retrieval as Ranking • Contextual Representation of Actions • Results and Future Work
Classification or Retrieval • Previous Work • Most work in human action understanding focuses on action classification.
Classification or Retrieval • Most surveillance tasks are typical retrieval tasks • retrieve a small video segment contains a particular action from thousands of hours of videos. • The “action of interest” is rare event • Extremely imbalanced classes
Query : fall Action Retrieval Rank according to the relevance to falls
Learning • Input: document-rank pair (xi,yi) • Optimization Joachims, KDD 06
Ranking SVM • Ranking function h(x) h(x) Rank r1 Rank r2 Rank r3
Action Retrieval - training irrelevant relevant very relevant
Outline • Action Retrieval as Ranking • Contextual Representation of Actions • Results and Future Work
Dataset • Nursing Home Dataset • 5 action categories: walking, standing, sitting, bending and falling. (per person) • 18 video clips. • Query: fall • Collective Activity Dataset (Choi et al. VS. 09) • 5 action categories: crossing, waiting, queuing, walking, talking. (per person) • 44 video clips. • Query: each of the five actions
Dataset • Nursing Home Dataset
Dataset • Collective Activity Dataset
System Overview u Person Detector Rank SVM Person Descriptor Video v • Pedestrian Detection • by Felzenszwalb et al. • Background Subtraction • HOG by Dalal & Triggs • LST by Loy et al. • at cvpr 09
Baselines • Context vs No Context • Action Context Descriptor • Original feature descriptors, e.g. HOG (Dalal & Triggs at CVPR 05), LST (Loy et al. at CVPR 09) • RankSVMvs SVM • Methods • Context + RankSVM (our method) • Context + SVM • No Context + RankSVM • No Context + SVM
Retrieval Results Nursing Home Dataset
Retrieval Results Collective Activity Dataset
Retrieval Results Collective Activity Dataset
Retrieval Results Collective Activity Dataset
1 2 3 4
5 6 7 8
Action Classification [10] Choi et al. in VS. 09 Collective Activity Dataset
Conclusion • A new contextual feature descriptor to represent actions • action context (AC) descriptor • Formulate our problem as a retrieval task.
Future Work • Contextual Feature Descriptors • How to only encode useful context? • Rank-SVM loss, optimize the NDCG score
5 6 7 8