XLA-NDP: Efficient Scheduling and Code Generation for Deep Learning Model Training on Near-Data Processing Memory | IEEE Journals & Magazine | IEEE Xplore