Off-policy reinforcement learning for distributed output synchronization of linear multi-agent systems | IEEE Conference Publication | IEEE Xplore