Vision and Natural Language for Metadata Extraction from Scientific PDF Documents: A Multimodal Approach | IEEE Conference Publication | IEEE Xplore