Temporal-difference Q-learning in active fault diagnosis | IEEE Conference Publication | IEEE Xplore