Models of Visually Grounded Speech Signal Pay Attention to Nouns: A Bilingual Experiment on English and Japanese | IEEE Conference Publication | IEEE Xplore