Abstract:
Recent advances in deep-learning-based accurate optical flow (OF) estimation have sparked interest in building hardware dedicated to it, making OF a real-time commodity f...Show MoreMetadata
Abstract:
Recent advances in deep-learning-based accurate optical flow (OF) estimation have sparked interest in building hardware dedicated to it, making OF a real-time commodity for downstream computer vision tasks. Relative camera pose, and depth estimations (DE) are closely related to OF, but obtaining one from the others is not a trivial task. While previous works have attempted simultaneous learning of OF and DE, or utilized OF-nets only for camera pose prediction and sparse depth map supervision, we propose OF-aided self-supervised DE, that is, pre-computed OF is used as a network input. One of the main challenges in OF-aided DE is preventing the DE network from learning an incorrect OF magnitude prior, which is valid for rigid regions but breaks for scenes with dynamic objects or static camera sequences. We achieve OF-aided DE by instead transforming the input OF into a 3D surface normals space, which provides a well-informed geometrical input while being invariant to the OF magnitude. In combination with a new training strategy, we show that our models with OF-aided DE achieve state-of-the-art (SOTA) results on the KITTI dataset.
Published in: 2023 IEEE International Conference on Visual Communications and Image Processing (VCIP)
Date of Conference: 04-07 December 2023
Date Added to IEEE Xplore: 29 January 2024
ISBN Information: