ReasonXL: Shifting LLM Reasoning Language Without Sacrificing Performance
This paper introduces ReasonXL, a large-scale multilingual reasoning corpus, and demonstrates that a two-stage fine-tuning and reinforcement learning pipeline can successfully adapt LLMs to reason in non-English languages without sacrificing performance, while revealing that language identity is determined by early model layers and adaptation is concentrated in upper layers.