Learning to synthesize faces using voice clips for Cross-Modal biometric matching | IEEE Conference Publication | IEEE Xplore