HSDP: Accelerating Large-scale Model Training via Efficient Sharded Data Parallelism | IEEE Conference Publication | IEEE Xplore