Specificity-aware reinforcement learning for fine-grained open-world classification
This paper proposes SpeciaRL, a specificity-aware reinforcement learning framework that fine-tunes reasoning Large Multimodal Models to achieve an optimal balance between correctness and specificity in open-world fine-grained image classification by employing a dynamic, verifier-based reward signal.