Abstract:
Deep learning methods have proven useful for head pose estimation, but the effect of their depth, type and input resolution based on infrared (IR) images still need to be...Show MoreMetadata
Abstract:
Deep learning methods have proven useful for head pose estimation, but the effect of their depth, type and input resolution based on infrared (IR) images still need to be explored. In this paper, we present a study on in-car head pose estimation on the IR images of the AutoPOSE dataset, where we extract 64 x 64 and 128 x 128 pixel cropped head images. We propose the novel networks Head Orientation Network (HON) and ResNetHG and compare them with state-of-the-art methods like the HPN model from DriveAHead on different input resolutions. In addition, we evaluate multiple depths within our HON and ResNetHG networks and their effect on the accuracy. Our experiments show that higher resolution images lead to lower estimation errors. Furthermore, we show that deep learning methods with fewer layers perform better on head orientation regression based on IR images. Our HON and ResNetHG18 architectures outperform the state-of-the-art on IR images on four different metrics, where we achieve a reduction of the residual error of up to 74%.
Published in: 2020 IEEE Intelligent Vehicles Symposium (IV)
Date of Conference: 19 October 2020 - 13 November 2020
Date Added to IEEE Xplore: 08 January 2021
ISBN Information:
ISSN Information:
Keywords assist with retrieval of results and provide a means to discovering other relevant content. Learn more.
- IEEE Keywords
- Index Terms
- Infrared Imaging ,
- Pose Estimation ,
- Head Orientation ,
- Input Resolution ,
- Head Pose ,
- Root Mean Square Error ,
- Deep Network ,
- Deep Neural Network ,
- Validation Set ,
- Computer Vision ,
- Adam Optimizer ,
- Raw Images ,
- Bounding Box ,
- Grayscale Images ,
- RGB Images ,
- Level Of Resolution ,
- Skip Connections ,
- Position Estimation ,
- 3D Information ,
- Cropped Images ,
- Feature-based Approaches ,
- 2D Information ,
- High Level Of Resolution ,
- Center Of Head ,
- Amount Of Layers ,
- Information In The Form ,
- Natural Movement ,
- Image Pixels ,
- Output Of Block
Keywords assist with retrieval of results and provide a means to discovering other relevant content. Learn more.
- IEEE Keywords
- Index Terms
- Infrared Imaging ,
- Pose Estimation ,
- Head Orientation ,
- Input Resolution ,
- Head Pose ,
- Root Mean Square Error ,
- Deep Network ,
- Deep Neural Network ,
- Validation Set ,
- Computer Vision ,
- Adam Optimizer ,
- Raw Images ,
- Bounding Box ,
- Grayscale Images ,
- RGB Images ,
- Level Of Resolution ,
- Skip Connections ,
- Position Estimation ,
- 3D Information ,
- Cropped Images ,
- Feature-based Approaches ,
- 2D Information ,
- High Level Of Resolution ,
- Center Of Head ,
- Amount Of Layers ,
- Information In The Form ,
- Natural Movement ,
- Image Pixels ,
- Output Of Block