Dynamic Graph Network with Spatial-Temporal Gated Attention: A Deep Reinforcement Learning Approach for Urban Logistics Optimization | IEEE Conference Publication | IEEE Xplore