Human-Intention Prediction with Visual-Language Model | IEEE Conference Publication | IEEE Xplore