Seeing as Experts Do: A Knowledge-Augmented Agent for Open-Set Fine-Grained Visual Understanding
The paper introduces KFRA, a knowledge-augmented agent that emulates expert analysis through a three-stage closed reasoning loop to achieve superior open-set fine-grained visual understanding and interpretable, evidence-driven reasoning, validated by the newly constructed FGExpertBench.