Synthesize, Diagnose, and Optimize: Towards Fine-Grained Vision-Language Understanding | IEEE Conference Publication | IEEE Xplore