: Unlocking LLM Reasoning via Reinforcement Learning with Re-solving
The paper introduces Re, a reinforcement learning framework that enables large language models to dynamically abandon unproductive reasoning paths and restart their solution process, thereby significantly improving reasoning efficiency and performance without requiring preliminary supervised fine-tuning.