Scalable, Training-Free Visual Language Robotics: a modular multi-model framework for consumer-grade GPUs | IEEE Conference Publication | IEEE Xplore