I. Introduction and Related Work
Visual-inertial odometry (VIO) that typically combines the inertial data from inertial measurement unit (IMU) and the visual observations from camera to compute the orientation and position of the sensing platform has been becoming popular for GPS-denied navigation applications, ranging from the augmented/virtual reality (AR/VR), autonomous driving to even the unmanned planet exploration. Especially, in order for computational efficiency, a VIO is usually realized by either extended Kalman filter (EKF) or fixed-lag smoothing (FLS) which optimizes over a bounded-size sliding window of recent states by marginalizing the past states periodically, for which a complete review of the recent efforts can be found in [2].