Perspective Makes Perfect: Prompt-tuning Vision-Language Models for Action Recognition with Diversified Multi-Modal Observation | IEEE Conference Publication | IEEE Xplore