An MDP Model-Based Reinforcement Learning Approach for Production Station Ramp-Up Optimization: Q-Learning Analysis | IEEE Journals & Magazine | IEEE Xplore