240 likes | 274 Views
Explore the fusion of Visual SLAM, Carrier-Phase GPS, and Inertial Measurements for globally-referenced position and attitude. This research delves into motivating applications such as Augmented Reality and provides simulation results showcasing high-precision camera motion estimation and environment mapping. With a focus on accuracy in diverse applications like Construction, Real-Estate, Gaming, and Social Media, the study investigates the integration of sensors like camera, GPS antennas, and IMU, offering insights into real-time operation challenges, computational trade-offs, and sensor fusion strategies. Discover how Bundle Adjustment techniques, robust cost minimization, and novel initialization methods enhance the accuracy and robustness of positioning estimates.
E N D
High-Precision Globally-Referenced Position and Attitude via a Fusion of Visual SLAM, Carrier-Phase-Based GPS, and Inertial Measurements Daniel Shepard and Todd Humphreys 2014 IEEE/ION PLANS Conference, Monterey, CA | May 8, 2014
Overview • Globally-Referenced Visual SLAM • Motivating Application: Augmented Reality • Estimation Architecture • Bundle Adjustment (BA) • Simulation Results for BA
Stand-Alone Visual SLAM • Produces high-precision estimates of • Camera motion (with ambiguous scale for monocular SLAM) • A map of the environment • Limited in application due to lack of a global reference [1] G. Klein and D. Murray, “Parallel tracking and mapping for small AR workspaces,” in 6th IEEE and ACM International Symposium on Mixed and Augmented Reality. IEEE, 2007, pp. 225–234.
Visual SLAM with Fiduciary Markers • Globally-referenced solution if fiduciary markers are globally-referenced • Requires substantial infrastructure and/or mapping effort • Microsoft’s augmented reality maps (TED2010[2]) [2] B. A. y Arcas, “Blaise Agueray Arcas demos augmented-realitymaps,” TED, Feb. 2010, http://www.ted.com/talks/blaise aguera.html.
Can globally-referenced position and attitude (pose) be recovered from combining visual SLAM and GPS?
Observability of Visual SLAM + GPS No GPS positions Translation Rotation Scale 1 GPS position Translation Rotation Scale 2 GPS positions Translation Rotation Scale 3 GPS positions Translation Rotation Scale ~
Combined Visual SLAM and CDGPS • CDGPS anchors visual SLAM to a global reference frame • Can add an IMU to improve dynamic performance (not required!) • Can be made inexpensive • Requires little infrastructure Very Accurate!
Motivating Application: Augmented Reality • Augmenting a live view of the world with computer-generated sensory input to enhance one’s current perception of reality[3] • Current applications are limited by lack of accurate global pose • Potential uses in • Construction • Real-Estate • Gaming • Social Media [3] Graham, M., Zook, M., and Boulton, A. "Augmented reality in urban places: contested content and the duplicity of code." Transactions of the Institute of British Geographers. .
Estimation Architecture Motivation • Sensors: • Camera • Two GPS antennas (reference and mobile) • IMU • How can the information from these sensors best be combined to estimate the camera pose and a map of the environment? • Real-time operation • Computational burden vs. precision
Sensor Fusion Approach • Tighter coupling = higher precision, but increased computational burden IMU IMU IMU IMU Visual SLAM CDGPS Visual SLAM CDGPS Visual SLAM CDGPS Visual SLAM CDGPS
Hybrid Batch/Sequential Estimator • Only geographically diverse frames (keyframes) in batch estimator
Bundle Adjustment State and Measurements • State Vector: • Measurement Models: • CDGPS Positions: • Image Feature Measurements:
Bundle Adjustment Cost Minimization • Weighted least-squares cost function • Employs robust weight functions to handle outliers • Sparse Levenberg-Marquartalgorithm • Computational complexity linear in number of point features, but cubic in number of keyframes
Bundle Adjustment Initialization • Initialize BA based on stand-alone visual SLAM solution and CDGPS positions • Determine similarity transform relating coordinate systems • Generalized form of Horn’s transform[4] • Rotation: Rotation that best aligns deviations from mean camera position • Scale: A ratio of metrics describing spread of camera positions • Translation: Difference in mean antenna position [4] B. K. Horn, “Closed-form solution of absolute orientation using unit quaternions,” JOSA A, vol. 4, no. 4, pp. 629–642, 1987.
Simulation Scenario for BA • Simulations investigating estimability included in paper • Hallway Simulation: • Measurement errors: • 2 cm std for CDGPS • 1 pixel std for vision • Keyframes every 0.25 m • 242 keyframes • 1310 point features • Three scenarios: • GPS available • GPS lost when hallway entered • GPS reacquired when hallway exited A D C B
Summary • Hybrid batch/sequential estimator for loosely-coupled visual SLAM and CDGPS with IMU for state propagation • Compared to optimal estimator • Outlined algorithm for BA (batch) • Presented a novel technique for initialization of BA • BA simulations • Demonstrated positioning accuracy of cm and attitude accuracy of in areas of GPS availability • Attained slow drift during GPS unavailability (0.4% drift over 50 m)
Navigation Filter • State Vector: • Propagation Step: • Standard EKF propagation step using accelerometer and gyro measurements • Accelerometer and gyro biases modeled as a first-order Gauss-Markov processes • More information in paper …
Navigation Filter (cont.) • Measurement Update Step: • Image feature measurements from all non-keyframes • Temporarily augment the state with point feature positions • Prior from map produced by BA • Must ignore cross-covariances filter inconsistency • Similar block diagonal structure in the normal equations as BA