Q-learning based control algorithm with dynamic combination of peak shaving and self-consumption optimization for industrial battery storage systems | VDE Conference Publication | IEEE Xplore