Multimodal Multi-View Spectral-Spatial-Temporal Masked Autoencoder for Self-Supervised Emotion Recognition | IEEE Conference Publication | IEEE Xplore