LB-MDP-CL: A Reinforcement Learning and Co-Evolutionary Approach for Optimizing Responses in Multi-Step Cyberattacks | IEEE Conference Publication | IEEE Xplore