cs.OS papers | Gist.Science

This collection explores the dynamic frontier of research spanning from carbon nanotubes to organic semiconductors, where chemists and materials scientists are redefining what is possible at the atomic scale. These studies investigate how molecular structures interact to create new technologies, often bridging the gap between theoretical chemistry and real-world applications like flexible electronics or advanced energy storage.

Every new preprint in this category arrives directly from arXiv, and Gist.Science immediately processes each submission to make the findings accessible to everyone. We provide both clear, plain-language overviews for general readers and detailed technical summaries for specialists, ensuring that complex discoveries in this rapidly evolving field are easy to understand and verify. Below are the latest papers exploring these groundbreaking materials and their transformative potential.

RTP-LLM: High-Performance Alibaba LLM Inference Engine

RTP-LLM is a high-performance, open-source inference engine deployed at Alibaba Group that achieves superior throughput and latency reductions compared to vLLM and SGLang through integrated optimizations like Prefill-Decode Disaggregation, hierarchical KV cache management, and modular speculative decoding.

Boyu Tan, Jiarui Guo, Zongwei Lv, Hanbo Sun, Tong Yang, Kan Liu, Xinfei Shi, Zetao Hu, Yaxin Yu, Chi Zhang, Jianning Zhang, Xi Yang, Wei Zhang, Bo Cai, Silu Zhou, Xiyu Wang, Na He, Yinghao Yu, Wending (…)2026-05-29💻 cs

Sandlock: Confining AI Agent Code with Unprivileged Linux Primitives

Sandlock is a lightweight, unprivileged Linux sandbox that isolates AI agents by compiling static security policies into kernel-enforced rules while delegating runtime decisions to a narrow supervisor, thereby achieving strong confinement without the overhead of containers, microVMs, or root privileges.

Cong Wang, Yusheng Zheng2026-05-27💻 cs

VLCs: Managing Parallelism with Virtualized Libraries

This paper introduces Virtual Library Contexts (VLCs), a process-level mechanism that encapsulates software libraries and their resource allocations to enable safe, high-performance parallel execution of non-composable or thread-unsafe libraries without requiring modifications to the library code or operating system.

Yineng Yan, William Ruys, Hochan Lee, Ian Henriksen, Arthur Peters, Sean Stephens, Bozhi You, Henrique Fingler, Martin Burtscher, Milos Gligoric, Keshav Pingali, Mattan Erez, George Biros, Christopher (…)2026-05-26💻 cs

A Per-Access Upper Bound for Shared-Resource Interference in Direct-Mapped Multicore Architectures

This paper presents a formal, per-access analytical proof establishing that the maximum credible interference stall for a task in a direct-mapped multicore processor with specific architectural invariants is strictly bounded by (N-1)Lmem, thereby providing a traceable method for certifying airborne software under DO-178C/CAST-32A standards.

Felipe T. Pedroni2026-05-26💻 cs

DeltaBox: Scaling Stateful AI Agents with Millisecond-Level Sandbox Checkpoint/Rollback

This paper introduces DeltaBox, a novel sandbox system leveraging the OS-level abstractions DeltaFS and DeltaCR to achieve millisecond-level checkpoint and rollback by duplicating only state changes rather than full system states, thereby enabling high-frequency exploration for LLM-powered AI agents.

Yunpeng Dong, Jingkai He, Yuze Hou, Dong Du, Zhonghu Xu, Si Yu, Yubin Xia, Haibo Chen2026-05-22💻 cs

THEMIS: Time, Heterogeneity, and Energy Minded Scheduling for Fair Multi-Tenant Use in FPGAs

This paper introduces THEMIS, an enhanced fair scheduling algorithm for multi-tenant FPGAs that improves upon existing methods by incorporating spatiotemporal fairness, energy efficiency, and hardware heterogeneity constraints, resulting in significant gains in fairness and energy-fairness trade-offs.

Emre Karabulut, Arsalan Ali Malik, Amro Awad, Aydin Aysu2026-05-21💻 cs

Clove: Object-Level CXL Memory Management in Managed Runtimes

This paper presents Clove, a system that extends managed language runtimes with profile-guided hotness tracking and object relocation policies to enable efficient object-level CXL memory management, significantly outperforming traditional page-based systems by reducing application slowdown by 22–84%.

Sam Son, Zhihong Luo, Wen Zhang, Sylvia Ratnasamy, Scott Shenker2026-05-21💻 cs

ParaCell: Paravirtualized Secure Containers with Lightweight Intra-Container Isolation and Intent-Driven Memory Management

ParaCell is a paravirtualized secure container runtime that resolves the isolation-performance trade-off by leveraging MPK-based XGates for lightweight intra-container domain switching and a Pager mechanism for intent-driven, proactive memory management, thereby significantly reducing latency and memory overhead in both traditional and agentic workloads.

Yiyang Wu, Xunjie Wang, Jinyu Gu, Haibo Chen2026-05-21💻 cs

Skim: Speculative Execution for Fast and Efficient Web Agents

Skim is a speculative execution framework that significantly reduces the cost and latency of web agents by leveraging predictable website structures to bypass heavyweight inference components for most queries, while using a lightweight verifier to ensure accuracy and seamlessly cascade rare failures to full agents.

Mike Wong, Kevin Hsieh, Suman Nath, Ravi Netravali2026-05-20🤖 cs.AI

OSWorld-Human: Benchmarking the Efficiency of Computer-Use Agents

This paper introduces OSWorld-Human, a manually annotated benchmark revealing that current computer-use agents suffer from prohibitive latency and inefficiency, primarily due to excessive model calls for planning and reflection, resulting in them taking 2.7 to 4.3 times more steps than humans to complete tasks.

Reyna Abhyankar, Qi Qi, Yiying Zhang2026-05-19🤖 cs.LG

← Previous Next →