Extracting key frames from first-person videos in the common space of multiple sensors | IEEE Conference Publication | IEEE Xplore