Understanding Contexts Inside Robot and Human Manipulation Tasks through Vision-Language Model and Ontology System in Video Streams | IEEE Conference Publication | IEEE Xplore