Exploring embodiment and dueling bandit learning for preference adaptation in human-robot interaction | IEEE Conference Publication | IEEE Xplore