Joint Resource Allocation for Time-Varying Underwater Acoustic Communication System: A Self-Reflection Adversarial Bandit Approach | IEEE Journals & Magazine | IEEE Xplore