RSTNet: Captioning with Adaptive Attention on Visual and Non-Visual Words | IEEE Conference Publication | IEEE Xplore