Learning with binary-valued utility using derivative adaptive critic methods | IEEE Conference Publication | IEEE Xplore