Integrate multi-modal cues for category-independent object detection and localization | IEEE Conference Publication | IEEE Xplore