History Tracker: Retrieving Historical Image Embeddings for Efficient Fine-Grained Reasoning in Vision-Language Models | IEEE Conference Publication | IEEE Xplore