Deriving a near-optimal power management policy using model-free reinforcement learning and Bayesian classification | IEEE Conference Publication | IEEE Xplore