Indoor Scene Classification Using RGB-D Data: A Vision Transformer and Conditional Random Field Approach | IEEE Conference Publication | IEEE Xplore