Loading [MathJax]/extensions/MathZoom.js
A Joint Deep Boltzmann Machine (jDBM) Model for Person Identification Using Mobile Phone Data | IEEE Journals & Magazine | IEEE Xplore

A Joint Deep Boltzmann Machine (jDBM) Model for Person Identification Using Mobile Phone Data


Abstract:

We propose an audio-visual person identification approach based on a joint deep Boltzmann machine (jDBM) model. The proposed jDBM model is trained in three steps: 1) lear...Show More

Abstract:

We propose an audio-visual person identification approach based on a joint deep Boltzmann machine (jDBM) model. The proposed jDBM model is trained in three steps: 1) learning the unimodal DBM models corresponding to the speech and facial image modalities, 2) learning the shared layer parameters using a joint restricted Boltzmann machine (jRBM) model, and 3) the fine-tuning of the jDBM model after the initialization with the parameters of the unimodal DBMs and the shared layer. The activation probabilities of the units of the shared layer are used as the joint features and a logistic regression classifier is used for the combined speech and facial image recognition. We show that by learning the shared layer parameters using a jRBM, a higher accuracy can be achieved compared to the greedy layer-wise initialization. The performance of our proposed model is also compared with a state-of-the art support vector machine (SVM), deep belief network (DBN), and the deep auto-encoder (DAE) models. In addition, our experimental results show that the joint representations obtained from the proposed jDBM model are robust to noise and missing information. Experiments were carried out on the challenging MOBIO database, which includes audio-visual data captured using mobile phones.
Published in: IEEE Transactions on Multimedia ( Volume: 19, Issue: 2, February 2017)
Page(s): 317 - 326
Date of Publication: 05 October 2016

ISSN Information:

Funding Agency:


Contact IEEE to Subscribe

References

References is not available for this document.