Welcome to Gist.Science

Research papers,
explained for humans.

We read the latest papers from arXiv, bioRxiv, and medRxiv and produce easy-to-understand explanations, key takeaways, and technical summaries — in ten languages.

91,124 papers explained across 10 languages·Last paper added 8h ago
📄We read the full paperNot just the abstract — every word
🧠We simplify itAnalogies, metaphors, plain language
🌎In 10 languagesNatively generated, not machine translated

For researchers: keep up with your field

Too many papers, too little time. Get daily digests of the most novel papers matching your exact research interests — with technical summaries in your language.

Try Digest →

EVE: A Generator-Verifier System for Generative Policies

The paper introduces EVE, a modular, training-free framework that enhances the test-time performance of frozen generative robotic policies by employing zero-shot VLM-based verifiers to propose action refinements and a classifier-guided incorporator to fuse this feedback, thereby improving success rates across diverse tasks without additional finetuning.

Yusuf Ali, Gryphon Patlin, Karthik Kothuri, Jeremiah Coholich, Muhammad Zubair Irshad, Wuwei Liang, Zsolt Kira2026-06-05💻 cs

On the complexity of computing Strahler numbers

This paper establishes that computing the Strahler number of a binary tree is complete for uniform NC1\mathsf{NC}^1 when the tree is given as a term, while determining variants for pointer structures, DAGs, and tree straight-line programs, and proving that deciding if a context-free grammar produces a derivation tree with a Strahler number of at least kk is P\mathsf{P}-complete (or PSPACE\mathsf{PSPACE}-complete for acyclic trees).

Moses Ganardi, Markus Lohrey2026-06-05💻 cs

Training One Model to Master Cross-Level Agentic Actions via Reinforcement Learning

The paper introduces CrossHA, a unified agentic model trained via a novel pipeline combining supervised fine-tuning and Multi-Turn Group Relative Policy Optimization to autonomously select and switch between heterogeneous action spaces, thereby achieving state-of-the-art performance and adaptability in dynamic, long-horizon tasks within the Minecraft environment.

Kaichen He, Zihao Wang, Muyao Li, Anji Liu, Yitao Liang2026-06-05💻 cs

Facial-R1: Aligning Reasoning and Recognition for Facial Emotion Analysis

The paper introduces Facial-R1, a three-stage alignment framework that combines instruction fine-tuning, reinforcement learning, and data synthesis to overcome hallucination and misalignment in Vision-Language Models, achieving state-of-the-art performance in Facial Emotion Analysis through explainable, fine-grained reasoning.

Jiulong Wu, Yucheng Shen, Lingyong Yan, Haixin Sun, Deguo Xia, Jizhou Huang, Min Cao2026-06-05💻 cs

VOLD: Reasoning Transfer from LLMs to Vision-Language Models via On-Policy Distillation

The paper proposes VOLD, a framework that enhances vision-language model reasoning by combining Group Relative Policy Optimization with on-policy distillation from text-only teacher models, demonstrating that initial supervised fine-tuning alignment is critical for effective knowledge transfer and achieving state-of-the-art performance on complex reasoning benchmarks.

Walid Bousselham, Hilde Kuehne, Cordelia Schmid2026-06-05💻 cs

Browse by category