X -Trans2Cap: Cross-Modal Knowledge Transfer using Transformer for 3D Dense Captioning | IEEE Conference Publication | IEEE Xplore