Doubly Pessimistic Algorithms for Strictly Safe Off-Policy Optimization | IEEE Conference Publication | IEEE Xplore