cs papers | Gist.Science

RIS Control through the Lens of Stochastic Network Calculus: An O-RAN Framework for Delay-Sensitive 6G Applications

This paper proposes DARIO, an O-RAN-compliant framework that leverages a novel Stochastic Network Calculus model to dynamically assign Reconfigurable Intelligent Surfaces (RIS) to users, achieving significant uplink delay reductions for heterogeneous 6G applications by solving a near-optimal nonlinear integer program with low computational overhead.

Oscar Adamuz-Hinojosa, Lanfranco Zanzi, Vincenzo Sciancalepore, Marco Di Renzo, Xavier Costa-Pérez2026-03-10💻 cs

Graph Neural Model Predictive Control for High-Dimensional Systems

This paper presents a real-time control framework that integrates Graph Neural Network-based dynamics models with a GPU-accelerated, structure-exploiting condensing algorithm to enable efficient, high-accuracy Model Predictive Control for high-dimensional systems like soft robots, achieving up to 1,000 nodes at 100 Hz with significant performance gains over baselines.

Patrick Benito Eberhard, Luis Pabon, Daniele Gammelli, Hugo Buurmeijer, Amon Lahr, Mark Leone, Andrea Carron, Marco Pavone2026-03-10💻 cs

3DMedAgent: Unified Perception-to-Understanding for 3D Medical Analysis

The paper introduces 3DMedAgent, a unified agent that leverages a flexible MLLM and long-term structured memory to coordinate heterogeneous tools for decomposing complex 3D CT analysis into tractable 2D-based subtasks, thereby enabling general-purpose 3D medical understanding without 3D-specific fine-tuning.

Ziyue Wang, Linghan Cai, Chang Han Low, Haofeng Liu, Junde Wu, Jingyu Wang, Rui Wang, Lei Song, Jiang Bian, Jingjing Fu, Yueming Jin2026-03-10💻 cs

OVerSeeC: Open-Vocabulary Costmap Generation from Satellite Images and Natural Language

OVerSeeC is a zero-shot modular framework that leverages large language models and open-vocabulary segmentation to generate executable global costmaps from satellite imagery and natural language instructions, enabling autonomous navigation to adapt to novel entities and dynamic mission constraints without requiring fixed ontologies.

Rwik Rana, Jesse Quattrociocchi, Dongmyeong Lee, Christian Ellis, Amanda Adkins, Adam Uccello, Garrett Warnell, Joydeep Biswas2026-03-10💻 cs

On the Energy Cost of Post-Quantum Key Establishment in Wireless Low-Power Personal Area Networks

This paper demonstrates that in wireless low-power Personal Area Networks, the communication energy cost of Post-Quantum Key Exchange often exceeds its computational cost, necessitating coordinated protocol and lower-layer optimizations to achieve efficient quantum-resilient operation.

Tao Liu, Gowri Ramachandra, Raja Jurdak2026-03-10💻 cs

ABD: Default Exception Abduction in Finite First Order Worlds

This paper introduces ABD, a benchmark for default-exception abduction in finite first-order worlds that evaluates ten frontier LLMs on their ability to generate sparse, satisfiability-restoring formulas across three observation regimes, revealing that while models achieve high validity, they struggle with parsimony and exhibit distinct generalization failures.

Serafim Batzoglou2026-03-10✓ Author reviewed ⓘ💻 cs

Open-Vocabulary Domain Generalization in Urban-Scene Segmentation

This paper introduces Open-Vocabulary Domain Generalization in Semantic Segmentation (OVDG-SS), a new setting and benchmark for autonomous driving that addresses both unseen domains and categories, and proposes S2-Corr, a state-space-driven mechanism to refine text-image correlations in Vision-Language Models to achieve robust performance across diverse urban environments.

Dong Zhao, Qi Zang, Nan Pu, Wenjing Li, Nicu Sebe, Zhun Zhong2026-03-10💻 cs

INDUCTION: Finite-Structure Concept Synthesis in First-Order Logic

This paper introduces INDUCTION, a benchmark designed to evaluate the ability of AI models to synthesize compact, generalizable first-order logic formulas that explain target predicates across small finite relational worlds, revealing distinct performance patterns and generalization strategies among recent elite models.

Serafim Batzoglou2026-03-10💻 cs

SKYLIGHT: A Scalable Hundred-Channel 3D Photonic In-Memory Tensor Core Architecture for Real-time AI Inference

This paper presents SKYLIGHT, a scalable 3D photonic in-memory tensor core architecture that leverages co-designed innovations in topology, wavelength routing, and non-volatile weights to achieve energy-efficient, real-time AI inference and local learning, outperforming state-of-the-art GPUs in throughput and power efficiency while maintaining robustness against hardware non-idealities.

Meng Zhang, Ziang Yin, Nicholas Gangi, Alexander Chen, Brett Bamfo, Tianle Xu, Jiaqi Gu, Zhaoran Rena Huang2026-03-10💻 cs

Universal 3D Shape Matching via Coarse-to-Fine Language Guidance

UniMatch is a novel coarse-to-fine framework that establishes dense semantic correspondences between strongly non-isometric, cross-category 3D shapes by leveraging class-agnostic segmentation, multimodal language models for part identification, and a rank-based contrastive learning scheme to overcome the limitations of prior isometry-dependent methods.

Qinfeng Xiao, Guofeng Mei, Bo Yang, Liying Zhang, Jian Zhang, Kit-lun Yick2026-03-10💻 cs

Why iCloud Fails: The Category Mistake of Cloud Synchronization

This paper argues that iCloud's fundamental failure in supporting complex workflows stems from a "Category Mistake" where its POSIX-like filesystem interface falsely projects a linear temporal chain onto a distributed causal graph, a structural error that causes data divergence and corruption but could be resolved by adopting Open Atomic Ethernet's transactional semantics to align protocol behavior with physical reality.

Paul Borrill2026-03-10💻 cs

InfScene-SR: Arbitrary-Size Image Super-Resolution via Iterative Joint-Denoising

InfScene-VF proposes a diffusion-based framework for arbitrary-size image super-resolution that eliminates boundary artifacts and enables memory-efficient, parallelized inference on gigapixel imagery by introducing Variance-Corrected Fusion and Spatially-Decoupled Variance Correction to achieve spatially continuous joint-denoising.

Shoukun Sun, Zhe Wang, Xiang Que, Jiyin Zhang, Xiaogang Ma2026-03-10💻 cs

Object-Scene-Camera Decomposition and Recomposition for Data-Efficient Monocular 3D Object Detection

This paper proposes an online data manipulation scheme that decomposes training images into independent object, scene, and camera components and recomposes them with perturbed poses to generate diverse training data, thereby improving the data efficiency and performance of monocular 3D object detection models across both fully and sparsely supervised settings.

Zhaonian Kuang, Rui Ding, Meng Yang + 2 more2026-03-10💻 cs

Cycle-Consistent Tuning for Layered Image Decomposition

This paper presents a cycle-consistent tuning framework that leverages lightweight LoRA adaptation of pretrained diffusion models to achieve robust, high-fidelity layered image decomposition, specifically for challenging logo-object separation, by enforcing bidirectional reconstruction consistency and iteratively refining performance through a progressive self-improving process.

Zheng Gu, Min Lu, Zhida Sun, Dani Lischinski, Daniel Cohen-Or, Hui Huang2026-03-10💻 cs

See It, Say It, Sorted: An Iterative Training-Free Framework for Visually-Grounded Multimodal Reasoning in LVLMs

This paper proposes "See It, Say It, Sorted," a lightweight, training-free, and plug-and-play framework that mitigates visual hallucination in large vision-language models by iteratively supervising each reasoning step with dynamically extracted visual evidence, thereby significantly improving reasoning accuracy without requiring additional model training.

Yongchang Zhang, Oliver Ma, Tianyi Liu, Guangquan Zhou, Yang Chen2026-03-10💻 cs

ARLArena: A Unified Framework for Stable Agentic Reinforcement Learning

This paper introduces ARLArena, a unified framework that systematically analyzes training instability in agentic reinforcement learning to derive SAMPO, a stable optimization method that ensures consistent performance across diverse agentic tasks.

Xiaoxuan Wang, Han Zhang, Haixin Wang, Yidan Shi, Ruoyan Li, Kaiqiao Han, Chenyi Tong, Haoran Deng, Renliang Sun, Alexander Taylor, Yanqiao Zhu, Jason Cong, Yizhou Sun, Wei Wang2026-03-10💻 cs

Tokenizing Semantic Segmentation with RLE

This paper introduces a unified language modeling approach for semantic and panoptic segmentation in images and videos that discretizes masks into run-length encoded tokens, employing novel compression strategies to enable autoregressive generation despite computational constraints.

Abhineet Singh, Justin Rozeboom, Nilanjan Ray2026-03-10💻 cs

EmoOmni: Bridging Emotional Understanding and Expression in Omni-Modal LLMs

This paper introduces EmoOmni, a unified framework that leverages an emotional Chain-of-Thought (E-CoT) to bridge the gap between fine-grained multimodal perception and accurate emotional expression in Omni-LLMs, accompanied by a new dataset and benchmark for systematic evaluation.

Wenjie Tian, Zhixian Zhao, Jingbin Hu, Huakang Chen, Haohe Liu, Binshen Mu, Lei Xie2026-03-10💻 cs

CryoNet.Refine: A One-step Diffusion Model for Rapid Refinement of Structural Models with Cryo-EM Density Map Restraints

CryoNet.Refine is a novel one-step diffusion model that automates and accelerates the refinement of atomic structures against cryo-EM density maps, outperforming traditional tools like Phenix in both model-map correlation and geometric quality while supporting diverse protein and nucleic acid complexes.

Fuyao Huang, Xiaozhu Yu, Kui Xu, Qiangfeng Cliff Zhang2026-03-10💻 cs

Vibe Researching as Wolf Coming: Can AI Agents with Skills Replace or Augment Social Scientists?

This paper argues that AI agents equipped with specialized skills can augment, but not fully replace, social scientists by executing codifiable research tasks autonomously through "vibe researching," while highlighting the enduring necessity of human theoretical originality and tacit knowledge alongside the profession's emerging risks of stratification and pedagogical crisis.

Yongjun Zhang2026-03-10💻 cs

← Previous Next →