Can we integrate color and depth information for richer captions? | IEEE Conference Publication | IEEE Xplore