Decentralized reinforcement social learning based on cooperative policy exploration in multi-agent systems | IEEE Conference Publication | IEEE Xplore