Time-extended policies in multi-agent reinforcement learning | IEEE Conference Publication | IEEE Xplore