AToM: Aligning Text-to-Motion Model at Event-Level with GPT-4Vision Reward | IEEE Conference Publication | IEEE Xplore