Mem-T: Densifying Rewards for Long-Horizon Memory Agents
Mem-T introduces an autonomous memory agent trained via the MoT-GRPO framework, which densifies sparse long-horizon rewards through tree-guided backpropagation to achieve superior performance and efficiency compared to existing memory management systems.