Multi-modal Transformer for Indoor Human Action Recognition | IEEE Conference Publication | IEEE Xplore