Physically Ground Commonsense Knowledge for Articulated Object Manipulation with Analytic Concepts
This paper proposes a framework that bridges Multi-modal Large Language Models and physical robot control by introducing "analytic concepts"—procedurally defined mathematical representations—to ground commonsense knowledge for generalized articulated object manipulation.