Scaling Distributed Deep Learning Workloads beyond the Memory Capacity with KARMA | IEEE Conference Publication | IEEE Xplore