Image Caption Generation for Dai Ethnic Clothing Based on ViT-B and BertLMHeadModel | IEEE Conference Publication | IEEE Xplore