Rapid behavior learning in multi-agent environment based on state value estimation of others | IEEE Conference Publication | IEEE Xplore