Enhancing Multi-Agent Systems via Reinforcement Learning with LLM-Based Planner and Graph-Based Policy | IEEE Conference Publication | IEEE Xplore