St-Bert: Cross-Modal Language Model Pre-Training for End-to-End Spoken Language Understanding | IEEE Conference Publication | IEEE Xplore