Learning Compact Visual SLAM Representation with CodeSLAM

Vision Reading Group, 2nd of May 2018 Martin Rünz CodeSLAM — Learning a Compact, Optimisable Representation for Dense Visual SLAM

In a nutshell An alternative representation to depth maps is presented

Background – Common Map Representations Surfels Mesh Pose-Graph + Depth-Maps TSDF

Ideas • Depth-maps are notrandom

Ideas • Depth-maps are notrandom(especially in man-made environments)

Ideas • Depth-maps are notrandom • A compact representation – code – can be learned by encoder • Decoders are differentiable → code can be optimised (w.r.t. photometric loss...) • This is useful in SfM or SLAM scenarios • Reconstruct high-frequency details from color

Depth Reconstruction • Depth-from-mono • Depth-from-mono + code • Given this differentiable function, warping constraints can be used to optimise c and pose

Architecture Color feature extractor + uncertainty predictor Laplace distribution Only for training, ground-truth depth • Variational autoencoder, to increase smoothness of mapping between code and depth • Small changes in code → small changes in depth Decoder: CODE

Video

Inverse depth parametrization average • Error behaves moreGaussian original depth

Warping Transform → B 3D Photometric error: • Both functions differentiable to inputs Expensive(convolutions) Pre-computed if decoder is linear!

Application • SfM • Initialise poses and code with zero-vector • Use residuals + Jacobians in Gauss-Newton style optimisation • Cost function: • Functionality of : • Mask invalid correspondences • Relative weighting • Huber weighting • Down-weight slanted surfaces • Down-weight occluded pixels

Experiments • SfM

Experiments • SLAM

Experiments • Setups

Experiments • Influence code entries

Video

Thanks for listening!

Learning Compact Visual SLAM Representation with CodeSLAM

Learning Compact Visual SLAM Representation with CodeSLAM

Presentation Transcript

BONES

SLAM Summer School 2004

Visual/Auditory Representation of Poetry

Let’s SLAM our Extend Response

3D SLAM for Omni-directional Camera

A Compact Random-Access Representation for Urban Modeling and Rendering

Visual/Spatial Giftedness

Learning outcomes

Key Concepts: Representation

Winds and convection

Heuristic learning

Visual Representation

LEARNING STYLES

7.1 Visual Representation of Data

7.1 Visual Representation of Data

SLAM/FastSLAM

Vision and SLAM

The Representation of Visual Salience in the Superior Colliculus

A visual representation

A Visual Query Language for Business Processes

Visual Representation

Motion