Skip to Main Content
In discrete event systems, the supervisor controls events to satisfy the control specifications given by formal languages. However a precise description of the specifications and the discrete event systems is required for constructing the supervisor. So, this paper proposes a method to construct a supervisor based on a reinforcement learning for partially observed discrete event systems. In the proposed method, specifications are given by rewards, and an optimal supervisor is derived by considering rewards for the occurrence of events and disabling events. Moreover learning speed is accelerated by updating plural Q values. It is done by utilizing characteristics of a supervisory control. An efficiency of the proposed method is examined by computer simulation. The proposed method shows a new approach for applying a supervisory control in the case of implicit specifications and uncertain environment.
Systems, Man and Cybernetics, 2003. IEEE International Conference on (Volume:3 )
Date of Conference: 5-8 Oct. 2003