Multi-label Instance-level Generalised Visual Grounding in Agriculture
This paper introduces gRef-CW, the first benchmark dataset for generalised visual grounding in agriculture that includes negative expressions, and proposes Weed-VG, a modular framework designed to overcome the domain gap and effectively localise crop and weed instances under challenging field conditions.