Conferences >2017 IEEE Global Conference o...

RGB-D camera pose estimation using deep neural network

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

This paper presents a study for RGB-D camera pose estimation using deep learning techniques. The proposed network architecture is composed of two components: the convolut...Show More

Metadata

Abstract:

This paper presents a study for RGB-D camera pose estimation using deep learning techniques. The proposed network architecture is composed of two components: the convolution neural network (CNN) for exploiting the vision information, and the Long Short-Term Memory (LSTM) block for incorporating the temporal information. The CNN, more precisely a RGB-D variant of GoogLeNet, functionalizes as a feature-oriented camera pose estimator, while the LSTM works as a temporal filter to model the pose transition. A modified loss function is also proposed to help regulate the convergence of the pose parameters. Experimental results show that the combination of CNN and LSTM can achieve a higher pose estimation accuracy, while the pipeline structure defined in the network can also provide flexibility for handling different scenarios.

Published in: 2017 IEEE Global Conference on Signal and Information Processing (GlobalSIP)

Date of Conference: 14-16 November 2017

Date Added to IEEE Xplore: 08 March 2018

ISBN Information:

DOI: 10.1109/GlobalSIP.2017.8308674

Conference Location: Montreal, QC, Canada

Contents

References is not available for this document.

RGB-D camera pose estimation using deep neural network

Abstract:

Metadata

Abstract:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

RGB-D camera pose estimation using deep neural network

Alerts

Abstract:

Metadata

Abstract:

Authors

Figures

References

Citations

Keywords

Metrics

References

IEEE Account

Purchase Details

Profile Information

Need Help?