Bridging the Gap: A Fusion of CNN and Transformer Models for Real-Time Object Detection | IEEE Conference Publication | IEEE Xplore