Learning Hierarchical Cross-Modal Association for Co-Speech Gesture Generation | IEEE Conference Publication | IEEE Xplore