MemOCR: Layout-Aware Visual Memory for Efficient Long-Horizon Reasoning
MemOCR is a multimodal memory agent that enhances long-horizon reasoning under tight context budgets by converting structured rich-text history into a visually compressed image, allowing the agent to prioritize crucial evidence through layout-aware information density while aggressively reducing low-value details.