Parallel Gradient Computation and Synchronization: Enhancing the Efficiency of Distributed Training for LLMs | IEEE Journals & Magazine | IEEE Xplore