1 / 21

Making Action Recognition Robust to Occlusions and Viewpoint Changes

Making Action Recognition Robust to Occlusions and Viewpoint Changes. Daniel Weinland Mustafa Özuysal Pascal Fua. European Conference on Computer Vision, 2010. Outline. Introduction Block Embedding of 3DHOG Classifier Combination Experiments Conclusion. Outline. Introduction

vivi
Download Presentation

Making Action Recognition Robust to Occlusions and Viewpoint Changes

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Making Action Recognition Robust to Occlusions and Viewpoint Changes Daniel Weinland Mustafa Özuysal Pascal Fua European Conference on Computer Vision, 2010

  2. Outline • Introduction • Block Embedding of 3DHOG • Classifier Combination • Experiments • Conclusion

  3. Outline • Introduction • Block Embedding of 3DHOG • Classifier Combination • Experiments • Conclusion

  4. Introduction • Most state-of-the-art approaches achieve nearly perfect results on KTH and Weizmann datasets • Subjects are seen from similar viewpoints • Uniform backgrounds • The motions in the test and training set look very similar • A novel approach is proposed to provide robustness to both occlusions and viewpoint changes

  5. Introduction • System Overview

  6. Outline • Introduction • Block Embedding of 3DHOG • Classifier Combination • Experiments • Conclusion

  7. Block Embedding of 3DHOG • Histograms of Oriented Gradient(HOG) [2] [2] Dalal N., Triggs, B.: Histograms of Oriented Gradients for Human Detection. In: CVPR (2005)

  8. Block Embedding of 3DHOG • Densely distributed locations within a ROI • 3DHOG [11] as feature descriptor [11] Kläser A., Marszałek, M, Schmid, C.: A Spatio-Temporal Descriptor Based on 3D-Gradients. In: BMVC (2008)

  9. Block Embedding of 3DHOG • Block descriptor • A normalized histogram with clipping applied • Sequence of blocks • : number of overlapping blocks that fit within the duration of the sequence • : number of blocks that fit within the ROI

  10. Block Embedding of 3DHOG • Block Embedding • Create a set of prototype block descriptors • Randomly sampling from training subsequencies • Create a -dimensional vector for each • The dimensionis the -distance from to the closest block in

  11. Outline • Introduction • Block Embedding of 3DHOG • Classifier Combination • Experiments • Conclusion

  12. Classifier Combination • Use L2-regularized logistic regression as local classifier • Estimate for each class • : learned logistic repression weights at position • action classes with represents the probability of region being occluded • Four combination schemes • Product Rule • Sum Rule • Weighted Rule • Hierarchical

  13. Classifier Combination • Product Rule • Sum Rule • Weighted Rule • Hierarchical • Combine the outputs of all local classifiers into a single feature vector and learned a SVM on top of this representation

  14. Outline • Introduction • Block Embedding of 3DHOG • Classifier Combination • Experiments • Conclusion

  15. Experiments • Datasets • Weizmann, KTH, UCF, and IXMAS • Baseline methods • Global SVM • BoW SVM • Parameters • ROI is scaled and concatenated to produce a 48×64×t cube • 6 oriented bins for 3DHOG • 500 prototypes for embedding

  16. Experiments • Weizmann, KTH, and UCF

  17. Experiments • IXMAS

  18. Experiments • IXMAS testing training

  19. Experiments • IXMAS

  20. Outline • Introduction • Block Embedding of 3DHOG • Classifier Combination • Experiments • Conclusion

  21. Conclusion • The authors proposed a new approach based on a local 3DHOG descriptor • The authors demonstrated that the proposed descriptor can perform action recognition from multiple viewpoints and can against partial occlusions • Compared to our work

More Related