Integrating Computer Vision and language model for interactive AI - Robot | IEEE Conference Publication | IEEE Xplore