PredTOP: Latency Predictor Utilizing DAG Transformers for Distributed Deep Learning Training with Operator Parallelism | IEEE Conference Publication | IEEE Xplore