3D v ideo understanding using a topology dictionary

3D video understanding using a topology dictionary Tony Tung & Takashi Matsuyama Kyoto University, Japan Dagstuhlseminar Oct.14th, 2010

3D video - Markerless motion/surface capture - Image-based system [Matsuyama et al., CVIU'04]

3D video framework - Reconstruction space: 3m x 3m x3m - 16 video cameras UXGA 30 fps - Synchronization by external trigger - Geometrically calibrated

3D video framework 3D surface reconstruction by MVS technique One or several subjects per frame Volumetric graph-cuts (5cm resolution) 1 frame ~ 1.5 MB (30,000 triangles) 5 min ~ 11.25 GB No 3D video tapestry! [Tung et al., CVPR’08] [Tung et al., ICCV’09]

3D video modeling using a topology dictionary • Encoding of 3D video sequences • Description of content for human behavior understanding

3D video modeling using a topology dictionary • Structure of the model 1 - Pattern detection using a topology descriptor (Reeb graph) 2 - Encoding of topology clusters 3 - Probabilistic motion graph • Video skimming • Pose/Action recognition • 3D performance segmentation [Tung et al., CVPR’09] [Tung et al., ICIP’10]

3D video sequence Independent frame reconstruction

3D video sequence Inconsistent topology between frames

I – Enhanced Reeb graphs

Topology description • Morse theory  : Swith : real continuous function S : manifold surface (mesh surface) Reeb graph = quotient space of the graph of  in S defined by the equivalence relation ~ . (X) = (Y) (X ,Y) S2, X ~ Y . X and Y same connected component as -1((X)) [Reeb, 1946]

Shape description • Multiresolution Reeb graphs - Automatic extraction of graphs • R, t, scale invariant • Homotopic • Multiresolution coarse-to-fine matching [Hilaga et al., SIGGRAPH’01] [Tung et al.,CVPR’07]

Topology description

Topology matching - Invariance to rotation, translation and scale - Coarse-to-fine multiresolution strategy - Matching using topological and geometrical attributes (valency, relative area) - The similarity of two models M,N is obtained by evaluation of the “similarity” of topology consistent pairs {(mi, nj)} at every level of resolution SIM(M,N) =  sim(mi, nj) R r=0 {ij} [Hilaga et al., SIGGRAPH’01] [Tung et al.,CVPR’07]

Performance evaluation Pose retrieval in 3D video sequences [Huang et al., 3DPVT'10]

II - Topology dictionary

Topology clusters Encoding of (repetitive) poses

Topology clusters • Encoding of (repetitive) poses

Topology clusters • Encoding of (repetitive) poses i i

Topology clusters • Motion graph structure SIGGRAPH’02: [Arikan&Forsyth] [Kovar et al.] [Lee et al.] i

Topology clusters • Motion graph structure SIGGRAPH’02: [Arikan&Forsyth] [Kovar et al.] [Lee et al.] i SUMMARIZATION

Topology clusters • Motion graph structure SIGGRAPH’02: [Arikan&Forsyth] [Kovar et al.] [Lee et al.] i 3D VIDEO SKIMMING SKIMIN

Topology clusters Dataset clustering using SSM • Similarity function SIM

Topology clusters • Dataset clustering using SSM Repetitive poses Long poses Short poses Transitions • Similarity function SIM

Topology clusters flashkick head free

Topology clusters lock pop kickup

Directed motion graph • Motion graphs allow users to design new sequences by building walks on the graph

Directed motion graph • 3D video sequences contain noises and redundancies

Probabilistic motion graph • Graph G = (C=U{Ci},E) • Node weight depends on topology cluster size if P(Ci) >> 0, then Ci corresponds to a long pose of a repetitive pose

Probabilistic motion graph • Graph G = (C=U{Ci},E) • Node weight depends on topology cluster size if P(Ci) >> 0, then Ci corresponds to a long pose of a repetitive pose • Transition determines how relevant is a motion if P(Ci|Cj) <1, then Cj corresponds to cycle junction node

Probabilistic motion graph …P(C’i) P(C’k)… P(C’j) Selection of the most probable paths p = {C'i, e'ij} argmax P(p) = ∏ P(C'i | C‘j) P(C'i) {e'ij}

Probabilistic motion graph • 3D video skimming by cycle trimming small cycles (noise, short action) Cycles are identified by cluster index in the sequence

Probabilistic motion graph • 3D video skimming by cycle trimming small cycles (noise, short action) Cycles are evaluated by Size and Relevance: Small S(L) and P(L) are first candidates for skimming

3D video skimming 1 - Evaluate cycle size and relevancy • . Small cycles with low probability are first candidates 2 - Compute path probability regarding cycles • . The most probable paths indicate which cycles to remove

3D video skimming

Training dataset with annotation

3D video description

Synthetic dataset

3D video description

Segmented training dataset

Performance segmentation

Summary 3D video is a markerless motion capture technique which allows to a capture subject as is Topology dictionary model to represent 3D videos Sequence encoding Topology matching using Reeb graphs Probabilistic motion graph structure 3D video skimming/summarization 3D video description and segmentation

Future work • How accurate is the matching? • What can we not recognize? • Action -> behavior? • Sequence reconstruction artifacts • Smoothness parameters

3D Shape Reconstructionfrom Multi-viewpoint images and silhouettes Thank you for your attention.

3D v ideo understanding using a topology dictionary

3D v ideo understanding using a topology dictionary

Presentation Transcript

Understanding SharePoint 2010 Topology

ideo

Understanding Topology for Soil Survey

Dome v ideo production

Review Your V ideo

IDEO

Using a Dictionary Entry

Using A Dictionary

IDEO and the Palm V

A stronomical D igital V ideo S ystem (ADVS)

V IDEO SURVEILLANCE ( public /private areas )

Dictionary structures and dictionary using skills

Postsecondary V ideo Remot e Interpreting

Using a Medical Dictionary