Cascade Attention Fusion for Fine-Grained Image Captioning Based on Multi-Layer LSTM | IEEE Conference Publication | IEEE Xplore