Joint audio-video object localization using a recursive multi-state multi-sensor estimator | IEEE Conference Publication | IEEE Xplore