Better Eyes, Better Thoughts: Why Vision Chain-of-Thought Fails in Medicine
This paper reveals that Chain-of-Thought prompting often underperforms direct answering in medical visual question answering due to a "medical perception bottleneck," and proposes training-free grounding interventions to restore visual accuracy and improve model reasoning.