Handling different level of unstable reward environment through an estimation of reward distribution in XCS | IEEE Conference Publication | IEEE Xplore