Automatic Generation of High-Performance RL Environments

This paper introduces a cost-effective, automated recipe combining generic prompts, hierarchical verification, and iterative agent-assisted repair to translate complex reinforcement learning environments into high-performance implementations with zero sim-to-sim gap, achieving massive throughput gains (up to 22,320x) across diverse use cases including game emulation, physics simulation, and card game engines.

Seth Karten, Rahul Dev Appapogu, Chi Jin2026-03-13🤖 cs.LG

IsoCompute Playbook: Optimally Scaling Sampling Compute for LLM RL

This paper establishes compute-optimal scaling laws for on-policy LLM reinforcement learning by demonstrating that the ideal number of parallel rollouts per problem increases predictably with the compute budget before saturating, driven by solution sharpening on easy tasks and coverage expansion on hard ones, while providing practical allocation rules for batch size and update steps to maximize training efficiency.

Zhoujun Cheng, Yutao Xie, Yuxiao Qu, Amrith Setlur, Shibo Hao, Varad Pimpalkhute, Tongtong Liang, Feng Yao, Zhengzhong Liu, Eric Xing, Virginia Smith, Ruslan Salakhutdinov, Zhiting Hu, Taylor Killian, Aviral Kumar2026-03-13🤖 cs.LG

Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections

This paper introduces the MADQA benchmark and a novel accuracy-effort evaluation protocol to demonstrate that while multimodal agents can match human accuracy on document-based tasks, they rely on inefficient brute-force search rather than genuine strategic reasoning, failing to close the performance gap to oracle levels.

Łukasz Borchmann, Jordy Van Landeghem, Michał Turski, Shreyansh Padarha, Ryan Othniel Kearns, Adam Mahdi, Niels Rogge, Clémentine Fourrier, Siwei Han, Huaxiu Yao, Artemis Llabrés, Yiming Xu, Dimosthenis Karatzas, Hao Zhang, Anupam Datta2026-03-13💬 cs.CL

Proof-Carrying Materials: Falsifiable Safety Certificates for Machine-Learned Interatomic Potentials

This paper introduces Proof-Carrying Materials (PCM), a rigorous framework combining adversarial falsification, statistical refinement, and formal Lean 4 certification to overcome the high failure rates of single machine-learned interatomic potentials, thereby significantly improving the reliability and discovery yield of high-throughput materials screening.

Abhinaba Basu, Pavan Chakraborty2026-03-13🔬 cond-mat.mtrl-sci

RDNet: Region Proportion-Aware Dynamic Adaptive Salient Object Detection Network in Optical Remote Sensing Images

This paper proposes RDNet, a salient object detection network for optical remote sensing images that leverages a SwinTransformer backbone and three novel modules—Dynamic Adaptive Detail-aware, Frequency-matching Context Enhancement, and Region Proportion-aware Localization—to overcome challenges related to scale variations and global context modeling, thereby achieving superior detection performance compared to state-of-the-art methods.

Bin Wan, Runmin Cong, Xiaofei Zhou, Hao Fang, Yaoqi Sun, Sam Kwong2026-03-13🤖 cs.AI

Portfolio of Solving Strategies in CEGAR-based Object Packing and Scheduling for Sequential 3D Printing

This paper presents Portfolio-CEGAR-SEQ, a parallelized algorithm that leverages modern multi-core CPUs and a portfolio of diverse object arrangement strategies to outperform the original CEGAR-SEQ method in solving the combinatorial challenges of object arrangement and scheduling for sequential 3D printing, often resulting in more efficient use of printing plates.

Pavel Surynek2026-03-13🤖 cs.AI

Security Considerations for Artificial Intelligence Agents

Drawing from Perplexity's operational experience with general-purpose agentic systems, this paper outlines the unique security failure modes introduced by AI agents, maps their primary attack surfaces, proposes a layered defense strategy, and identifies critical research gaps and standards needed to secure multi-agent systems in alignment with NIST risk management principles.

Ninghui Li, Kaiyuan Zhang, Kyle Polley, Jerry Ma2026-03-13🤖 cs.LG

Separable neural architectures as a primitive for unified predictive and generative intelligence

This paper introduces the separable neural architecture (SNA) as a domain-agnostic primitive that unifies predictive and generative intelligence across physics, language, and perception by formalizing a structural inductive bias that factorizes high-dimensional mappings into low-arity components, thereby enabling effective modeling of both chaotic continuous systems and discrete sequences.

Reza T. Batley, Apurba Sarker, Rajib Mostakim, Andrew Klichine, Sourav Saha2026-03-13🤖 cs.LG