Abstract

We present a new parametrization for point features within monocular simultaneous localization and mapping (SLAM) that permits efficient and accurate representation of uncertainty during undelayed initialization and beyond, all within the standard extended Kalman filter (EKF). The key concept is direct parametrization of the inverse depth of features relative to the camera locations from which they were first viewed, which produces measurement equations with a high degree of linearity. Importantly, our parametrization can cope with features over a huge range of depths, even those that are so far from the camera that they present little parallax during motion---maintaining sufficient representative uncertainty that these points retain the opportunity to "come in'' smoothly from infinity if the camera makes larger movements. Feature initialization is undelayed in the sense that even distant features are immediately used to improve camera motion estimates, acting initially as bearing references but not permanently labeled as such. The inverse depth parametrization remains well behaved for features at all stages of SLAM processing, but has the drawback in computational terms that each point is represented by a 6-D state vector as opposed to the standard three of a Euclidean <i xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">XYZ</i> representation. We show that once the depth estimate of a feature is sufficiently accurate, its representation can safely be converted to the Euclidean <i xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">XYZ</i> form, and propose a linearity index that allows automatic detection and conversion to maintain maximum efficiency---only low parallax features need be maintained in inverse depth form for long periods. We present a real-time implementation at 30 Hz, where the parametrization is validated in a fully automatic 3-D SLAM system featuring a handheld single camera with no additional sensing. Experiments show robust operation in challenging indoor and outdoor environments with a very large ranges of scene depth, varied motion, and also real time 360 <sup xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">deg</sup> loop closing.

Keywords

Parametrization (atmospheric modeling)InitializationSimultaneous localization and mappingParallaxArtificial intelligenceFeature (linguistics)Computer visionRepresentation (politics)Computer scienceStandard illuminantInverseExtended Kalman filterAlgorithmMathematicsKalman filterRobotGeometryPhysicsMobile robot

Affiliated Institutions

Related Publications

Maximum distanceq-nary codes

A <tex xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">q</tex> -nary error-correcting code with <tex xmlns:mml="http://www.w3.org/1998/...

1964 IEEE Transactions on Information Theory 508 citations

Publication Info

Year
2008
Type
article
Volume
24
Issue
5
Pages
932-945
Citations
760
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

760
OpenAlex

Cite This

Javier Civera, Andrew J. Davison, J. M. M. Montiel (2008). Inverse Depth Parametrization for Monocular SLAM. IEEE Transactions on Robotics , 24 (5) , 932-945. https://doi.org/10.1109/tro.2008.2003276

Identifiers

DOI
10.1109/tro.2008.2003276