Multi-objective reinforcement learning based routing in cognitive radio networks: Walking in a random maze | IEEE Conference Publication | IEEE Xplore