Prediction of actions and places by the time series recognition from images with Multimodal LLM | IEEE Conference Publication | IEEE Xplore