SSR: Pushing the Limit of Spatial Intelligence with Structured Scene Reasoning
The paper introduces SSR, a 7B-parameter framework that achieves state-of-the-art spatial intelligence by integrating 2D and 3D representations through lightweight alignment and a novel scene graph generation pipeline, enabling precise geometric reasoning without costly large-scale pre-training.