Labeling Q-learning embedded with knowledge update in partially observable mdp environments | IEEE Conference Publication | IEEE Xplore