Journals & Magazines >IEEE Access >Volume: 8

Learning Depth for Scene Reconstruction Using an Encoder-Decoder Model

For accurate scene reconstruction based on monocular depth estimation, what should we do?

Abstract:

Depth estimation has received considerable attention and is often applied to visual simultaneous localization and mapping (SLAM) for scene reconstruction. At least to our...Show More

Metadata

Abstract:

Depth estimation has received considerable attention and is often applied to visual simultaneous localization and mapping (SLAM) for scene reconstruction. At least to our knowledge, sufficiently reliable depth always fails to be provided for monocular depth estimation-based SLAM because new image features are rarely re-exploited effectively, local features are easily lost, and relative depth relationships among depth pixels are readily ignored in previous depth estimation methods. Based on inaccurate monocular depth estimation, SLAM still faces scale ambiguity problems. To accurately achieve scene reconstruction based on monocular depth estimation, this paper makes three contributions. (1) We design a depth estimation model (DEM), consisting of a precise encoder to re-exploit new features and a decoder to learn local features effectively. (2) We propose a loss function using the depth relationship of pixels to guide the training of DEM. (3) We design a modular SLAM system containing DEM, feature detection, descriptor computation, feature matching, pose prediction, keyframe extraction, loop closure detection, and pose-graph optimization for pixel-level scene reconstruction. Extensive experiments demonstrate that the DEM and DEM-based SLAM are effective. (1) Our DEM predicts more reliable depth than the state of the arts when inputs are RGB images, sparse depth, or the fusion of both on public datasets. (2) The DEM-based SLAM system achieves comparable accuracy as compared with well-known modular SLAM systems.

For accurate scene reconstruction based on monocular depth estimation, what should we do?

Published in: IEEE Access ( Volume: 8)

Page(s): 89300 - 89317

Date of Publication: 11 May 2020

Electronic ISSN: 2169-3536

DOI: 10.1109/ACCESS.2020.2993494

Funding Agency:

Keywords assist with retrieval of results and provide a means to discovering other relevant content. Learn more.

IEEE Keywords
Index Terms
- Encoder-decoder Model ,
- Scene Reconstruction ,
- Loss Function ,
- Image Features ,
- Local Features ,
- State Of The Art ,
- Public Datasets ,
- RGB Images ,
- Feature Matching ,
- Depth Estimation ,
- Simultaneous Localization And Mapping ,
- Loop Closure ,
- Calculation Of Descriptors ,
- Pose Prediction ,
- Pixel Depth ,
- Monocular Depth Estimation ,
- Scale Ambiguity ,
- Root Mean Square Error ,
- Training Dataset ,
- Convolutional Neural Network ,
- Depth Images ,
- Model Inference ,
- RGB Depth ,
- Depth Map ,
- Input RGB ,
- Depth Prediction ,
- KITTI Dataset ,
- Sparse Sampling ,
- Monocular Camera ,
- Error Metrics
Author Keywords

Contents

Keywords assist with retrieval of results and provide a means to discovering other relevant content. Learn more.

IEEE Keywords
Index Terms
- Encoder-decoder Model ,
- Scene Reconstruction ,
- Loss Function ,
- Image Features ,
- Local Features ,
- State Of The Art ,
- Public Datasets ,
- RGB Images ,
- Feature Matching ,
- Depth Estimation ,
- Simultaneous Localization And Mapping ,
- Loop Closure ,
- Calculation Of Descriptors ,
- Pose Prediction ,
- Pixel Depth ,
- Monocular Depth Estimation ,
- Scale Ambiguity ,
- Root Mean Square Error ,
- Training Dataset ,
- Convolutional Neural Network ,
- Depth Images ,
- Model Inference ,
- RGB Depth ,
- Depth Map ,
- Input RGB ,
- Depth Prediction ,
- KITTI Dataset ,
- Sparse Sampling ,
- Monocular Camera ,
- Error Metrics
Author Keywords

References is not available for this document.

Learning Depth for Scene Reconstruction Using an Encoder-Decoder Model

Abstract:

Metadata

Abstract:

Funding Agency:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Learning Depth for Scene Reconstruction Using an Encoder-Decoder Model

Alerts

Abstract:

Metadata

Abstract:

Funding Agency:

Authors

Figures

References

Citations

Keywords

Metrics

Supplemental Items

Footnotes

References

IEEE Account

Purchase Details

Profile Information

Need Help?