Multi-agent Q-learning of channel selection in multi-user cognitive radio systems: A two by two case | IEEE Conference Publication | IEEE Xplore