Improving Audiovisual Active Speaker Detection in Egocentric Recordings with the Data-Efficient Image Transformer | IEEE Conference Publication | IEEE Xplore