I. Introduction
Learning classifier systems (LCSs), a promising technology for evolutionary machine learning, seek to solve a learning problem by evolving a group of IF-THEN rules [31], [32]. These rules are not only easily understandable (e.g., as opposed to the neural network based approach) but also general enough for a wide range of learning tasks, such as classification [4], pattern recognition [35], and clustering [49]. Recently, the successful application of LCSs on many real-world problems has triggered increasing research attention [3], [51], [52]. Michigan LCSs in particular, which are the main focus of this paper, have been extensively studied for their remarkable ability of solving various multistep reinforcement learning problems [11], [23], [56], for example robotic control problems [2], [54], chemical reaction control problems [9], and traffic control problems [15].