Optimizing Warfarin Dosing Using Contextual Bandit: An Offline Policy Learning and Evaluation Method | IEEE Conference Publication | IEEE Xplore