Learning for multi-robot cooperation in partially observable stochastic environments with macro-actions | IEEE Conference Publication | IEEE Xplore