APTMoE: Affinity-Aware Pipeline Tuning for MoE Models on Bandwidth-Constrained GPU Nodes | IEEE Conference Publication | IEEE Xplore