cs papers | Gist.Science

OrdinalBench: A Benchmark Dataset for Diagnosing Generalization Limits in Ordinal Number Understanding of Vision-Language Models

The paper introduces OrdinalBench, a comprehensive benchmark dataset and evaluation framework designed to diagnose and expose the significant generalization limitations of Vision-Language Models in understanding ordinal numbers and performing sequential reasoning tasks involving large indices and complex paths.

Yusuke Tozaki, Hisashi Miyamori2026-03-10💻 cs

SGI: Structured 2D Gaussians for Efficient and Compact Large Image Representation

The paper proposes Structured Gaussian Image (SGI), a framework that represents high-resolution images using multi-scale, seed-based structured 2D Gaussians generated by lightweight MLPs, achieving significant compression and faster convergence compared to existing unstructured 2D Gaussian methods while maintaining high image fidelity.

Zixuan Pan, Kaiyuan Tang, Jun Xia, Yifan Qin, Lin Gu, Chaoli Wang, Jianxu Chen, Yiyu Shi2026-03-10💻 cs

4DRC-OCC: Robust Semantic Occupancy Prediction Through Fusion of 4D Radar and Camera

This paper introduces 4DRC-OCC, the first framework to fuse 4D radar and camera data for robust 3D semantic occupancy prediction, leveraging their complementary strengths to overcome adverse weather and lighting challenges while utilizing a newly created automatically labeled dataset to reduce annotation costs.

David Ninfa, Andras Palffy, Holger Caesar2026-03-10💻 cs

A Robust Antenna Provides Tactile Feedback in a Multi-legged Robot

This paper presents a multi-legged robot equipped with biomimetic, gradient-compliant tactile antennae that enable robust navigation and recovery in confined, obstacle-rich environments by mapping antenna deformation to collision states for real-time steering without relying on global environmental information or vision.

Zhaochen J. Xu, Juntao He, Delfin Aydan, Malaika Taylor, Tianyu Wang, Jianfeng Lin, Wesley Dyer, Daniel I. Goldman2026-03-10💻 cs

Inverse Resistive Force Theory (I-RFT): Learning granular properties through robot-terrain physical interactions

This paper introduces Inverse Resistive Force Theory (I-RFT), a physics-informed machine learning framework that enables robots to accurately estimate granular terrain properties from proprioceptive contact forces under arbitrary gait trajectories, thereby facilitating data-efficient environmental characterization and adaptive locomotion strategies.

Shipeng Liu, Feng Xue, Yifeng Zhang, Tarunika Ponnusamy, Feifei Qian2026-03-10💻 cs

MWM: Mobile World Models for Action-Conditioned Consistent Prediction

This paper introduces MWM, a mobile world model that enhances action-conditioned rollout consistency and inference efficiency for image-goal navigation through a novel two-stage training framework featuring Action-Conditioned Consistency post-training and Inference-Consistent State Distillation.

Han Yan, Zishang Xiang, Zeyu Zhang, Hao Tang2026-03-10💻 cs

Preference-Conditioned Reinforcement Learning for Space-Time Efficient Online 3D Bin Packing

The paper introduces STEP, a preference-conditioned reinforcement learning framework that optimizes robotic 3D bin packing by explicitly balancing spatial efficiency against operational time, achieving a 44% reduction in execution time without compromising packing density.

Nikita Sarawgi, Omey M. Manyar, Fan Wang, Thinh H. Nguyen, Daniel Seita, Satyandra K. Gupta2026-03-10💻 cs

Which Vertical Graphs are Non VPHT Reconstructible?

This paper investigates the non-injectivity of the verbose persistent homology transform (VPHT) for graphs with collinear vertices, identifying necessary and sufficient conditions for their non-reconstructibility to advance a complete classification of such cases.

Jette Gutzeit, Kalani Kistler, Tim Ophelders, Anna Schenfisch2026-03-10💻 cs

Temperature-Aware Scheduling of LLM Inference in Large-Scale Geo-Distributed Edge Data Centers with Distributed Optimization

This paper proposes a temperature-aware, distributed optimization approach using the alternating direction method of multipliers to co-optimize energy, carbon, water, and latency costs for LLM inference across geo-distributed edge data centers in Australia, leveraging ambient temperature diversity to enhance sustainability and efficiency.

Arash Khalatbarisoltani, Amin Mahmoudi, Jie Han, Muhammad Saeed, Wenxue Liu, Jinwen Li, Solmaz Kahourzade, Amirmehdi Yazdani, Xiaosong Hu2026-03-10💻 cs

Governance of AI-Generated Content: A Case Study on Social Media Platforms

This paper examines the governance of AI-generated content across 40 social media platforms, finding that while most focus on moderation and disclosure, few address ownership and monetization, prompting a call for more comprehensive policies and user education.

Lan Gao, Abani Ahmed, Oscar Chen, Margaux Reyl, Zayna Cheema, Nick Feamster, Chenhao Tan, Kurt Thomas, Marshini Chetty2026-03-10💻 cs

HybridStitch: Pixel and Timestep Level Model Stitching for Diffusion Acceleration

HybridStitch is a novel Text-to-Image generation paradigm that accelerates diffusion models by treating generation as an editing process, dynamically splitting the image into easy and complex regions to leverage a small model for coarse sketching and a large model for targeted refinement, thereby achieving a 1.83× speedup on Stable Diffusion 3.

Desen Sun, Jason Hon, Jintao Zhang, Sihang Liu2026-03-10💻 cs

Tracking Phenological Status and Ecological Interactions in a Hawaiian Cloud Forest Understory using Low-Cost Camera Traps and Visual Foundation Models

This study demonstrates that low-cost, animal-triggered camera traps combined with foundation vision models can effectively monitor fine-grained plant phenology and flora-faunal interactions in a Hawaiian cloud forest, revealing ecological trends that traditional sampling methods often miss.

Luke Meyers, Anirudh Potlapally, Yuyan Chen, Mike Long, Tanya Berger-Wolf, Hari Subramoni, Remi Megret, Daniel Rubenstein2026-03-10💻 cs

A Curved Monopole Antenna for HF Radar with Enhanced Gain and Bandwidth

This paper presents a curved monopole antenna design optimized for HF skywave radar that, through parametric analysis of curvature and straight-section length, achieves significantly enhanced gain and bandwidth compared to conventional monopoles, and demonstrates further performance improvements when scaled into a 12-element linear array.

Masoud Salmani Arani, Reza Shahidi, Lihong Zhang2026-03-10💻 cs

Broken Access: On the Challenges of Screen Reader Assisted Two-Factor and Passwordless Authentication

This paper introduces the AWARE evaluation framework to systematically analyze screen reader-assisted authentication, revealing that current two-factor and passwordless methods contain significant accessibility flaws that expose blind and visually impaired users to various security vulnerabilities.

Md Mojibur Rahman Redoy Akanda (Texas A&M University), Ahmed Tanvir Mahdad (Texas A&M University), Nitesh Saxena (Texas A&M University)2026-03-10💻 cs

Uncertainty Mitigation and Intent Inference: A Dual-Mode Human-Machine Joint Planning System

This paper proposes a dual-mode human-robot joint planning system that combines an LLM-assisted active elicitation mechanism with real-time intent inference to effectively mitigate task-relevant knowledge gaps and latent human intent, significantly reducing interaction costs and execution time in open-world environments.

Zeyu Fang, Yuxin Lin, Cheng Liu, Beomyeol Yu, Zeyuan Yang, Rongqian Chen, Taeyoung Lee, Mahdi Imani, Tian Lan2026-03-10💻 cs

Leveraging Quantum Annealing for Large-Scale Household Energy Scheduling with Hydrogen Storage

This paper proposes a hierarchical quantum annealing-based model predictive control framework for large-scale household energy scheduling with hydrogen storage, demonstrating its superior scalability and effectiveness in solving complex optimization problems as the number of connected households increases compared to traditional methods.

Arash Khalatbarisoltani, Amin Mahmoudi, Jie Han, Muhammad Saeed, Wenxue Liu, Jinwen Li, Solmaz Kahourzade, Amirmehdi Yazdani, Xiaosong Hu2026-03-10💻 cs

Reasoning Knowledge-Gap in Drone Planning via LLM-based Active Elicitation

This paper introduces MINT, a novel framework that enhances human-AI drone collaboration by using large language models to actively elicit minimal, targeted information from operators to resolve environmental uncertainties, thereby significantly improving task success rates while reducing the need for frequent human intervention.

Zeyu Fang, Beomyeol Yu, Cheng Liu, Zeyuan Yang, Rongqian Chen, Yuxin Lin, Mahdi Imani, Tian Lan2026-03-10💻 cs

Physics-infused Learning for Aerial Manipulator in Winds and Near-Wall Environments

This paper presents a unified control framework for aerial manipulators that integrates a physics-based blade-element model with a learning-based residual force estimator and online rotor-speed adaptation to achieve robust trajectory tracking and wall-contact operations in complex wind and near-wall environments.

Yiming Zhang, Junyi Geng2026-03-10💻 cs

A Novel Phase-Noise Module for the QUCS Circuit Simulator. Part II : Noise Analysis

This paper presents the implementation of a novel, rigorous time-domain phase-noise analysis module in the open-source QUCS simulator, featuring new closed-form expressions for amplitude and phase-amplitude correlations that surpass existing empirical models and commercial EDA capabilities in predicting the performance of noise-perturbed autonomous circuits.

Torsten Djurhuus, Viktor Krozer2026-03-10💻 cs

GazeShift: Unsupervised Gaze Estimation and Dataset for VR

This paper introduces VRGaze, the first large-scale off-axis gaze estimation dataset for VR, and GazeShift, an unsupervised, attention-guided framework that achieves real-time, label-efficient gaze tracking with high accuracy on both the new dataset and standard benchmarks.

Gil Shapira, Ishay Goldin, Evgeny Artyomov, Donghoon Kim, Yosi Keller, Niv Zehngut2026-03-10💻 cs

← Previous Next →