cs papers | Gist.Science

Cognitive-Flexible Control via Latent Model Reorganization with Predictive Safety Guarantees

This paper proposes a cognitive-flexible control framework that integrates an adaptive Deep Stochastic State-Space Model with Bayesian Model Predictive Control to ensure safety guarantees and rapid performance recovery in nonstationary cyber-physical systems through online latent representation reorganization.

Thanana Nuchkrua, Sudchai Boonto2026-03-10💻 cs

Green-VLA: Staged Vision-Language-Action Model for Generalist Robots

The paper introduces Green-VLA, a five-stage curriculum framework that combines large-scale multimodal pretraining, embodiment-specific adaptation, and reinforcement learning to enable a single generalist policy to robustly control diverse robotic systems, including the Green humanoid, with enhanced safety and long-horizon efficiency.

I. Apanasevich, M. Artemyev, R. Babakyan, P. Fedotova, D. Grankin, E. Kupryashin, A. Misailidi, D. Nerus, A. Nutalapati, G. Sidorov, I. Efremov, M. Gerasyov, D. Pikurov, Y. Senchenko, S. Davidenko, D. Kulikov, M. Sultankin, K. Askarbek, O. Shamanin, D. Statovoy, E. Zalyaev, I. Zorin, A. Letkin, E. Rusakov, A. Silchenko, V. Vorobyov, S. Sobolnikov, A. Postnikov2026-03-10💻 cs

Vulnerability-Amplifying Interaction Loops: a systematic failure mode in AI chatbot mental-health interactions

This paper introduces SIM-VAIL, a scalable auditing framework that reveals how consumer AI chatbots can systematically amplify mental health vulnerabilities through cumulative, context-dependent interaction loops, highlighting the need for multidimensional safety evaluations across diverse user phenotypes.

Veith Weilnhammer, Kevin YC Hou, Lennart Luettgau, Christopher Summerfield, Raymond Dolan, Matthew M Nour2026-03-10💻 cs

AgenticLab: A Real-World Robot Agent Platform that Can See, Think, and Act

This paper introduces AgenticLab, a real-world, model-agnostic robot agent platform and benchmark that utilizes a closed-loop pipeline to evaluate state-of-the-art vision-language models in unstructured environments, revealing critical failure modes in long-horizon manipulation that static evaluations miss.

Pengyuan Guo, Zhonghao Mai, Zhengtong Xu, Kaidi Zhang, Heng Zhang, Zichen Miao, Arash Ajoudani, Zachary Kingston, Qiang Qiu, Yu She2026-03-10💻 cs

Six Times to Spare: Characterizing GPU-Accelerated 5G LDPC Decoding for Edge-RSU Communications

This paper demonstrates that offloading 5G LDPC decoding to GPUs on compact edge platforms significantly improves throughput and reduces latency, thereby providing the necessary compute headroom to meet strict timing constraints for ultra-reliable low-latency vehicular communications.

Ryan Barker, Julia Boone, Tolunay Seyfi, Alireza Ebrahimi Dorcheh, Fatemeh Afghah, Joseph Boccuzzi2026-03-10💻 cs

Extracting Recurring Vulnerabilities from Black-Box LLM-Generated Software

This paper introduces FSTab, a framework that demonstrates how LLM-generated software exhibits predictable, recurring vulnerabilities by enabling black-box attacks based on frontend features and quantifying the consistency of these flaws across different domains and model variations.

Tomer Kordonsky, Maayan Yamin, Noam Benzimra, Amit LeVi, Avi Mendelson2026-03-10💻 cs

LMMRec: LLM-driven Motivation-aware Multimodal Recommendation

This paper introduces LMMRec, a model-agnostic framework that leverages large language models and chain-of-thought prompting to extract fine-grained user and item motivations from heterogeneous text data, effectively aligning them with interaction signals to significantly improve multimodal recommendation performance.

Yicheng Di, Zhanjie Zhang, Yun Wang, Jinren Liu, Jiaqi Yan, Jiyu Wei, Xiangyu Chen, Yuan Liu2026-03-10💻 cs

Assessing Problem-Solving in HR Contexts: A Comparison Between Game-Based and Self-Report Measures

This study finds no significant convergence between self-reported and game-based behavioral measures of problem-solving, suggesting that these distinct modalities provide complementary rather than redundant information for personnel selection.

Fabrizio Fornari, Eleonora Cova, Niccolò Vito Vacca, Francesco Bocci, Marcello Sarini, Luigi Caputo2026-03-10💻 cs

Conditional Diffusion Guidance under Hard Constraint: A Stochastic Analysis Approach

This paper proposes a principled conditional diffusion guidance framework based on Doob's h-transform that enforces hard constraints without modifying pretrained score networks, introducing novel off-policy learning algorithms to estimate the necessary guidance terms and providing non-asymptotic convergence guarantees for the resulting sampler.

Zhengyi Guo, Wenpin Tang, Renyuan Xu2026-03-10💻 cs

Beyond Judgment: Exploring Large Language Models as Non-Judgmental Support for Maternal Mental Health

This mixed-methods study of 107 mothers reveals that while Large Language Models serve as valuable non-judgmental resources for emotional support and reassurance regarding childcare decisions, most users still prioritize human warmth, highlighting the technology's role as a low-risk supplement rather than a replacement for human connection.

Shayla Sharmin, Sadia Afrin Ratna2026-03-10💻 cs

NAAMSE: Framework for Evolutionary Security Evaluation of Agents

This paper introduces NAAMSE, an evolutionary framework that automates and enhances AI agent security evaluation by using a single autonomous agent to iteratively mutate prompts and explore corpora, thereby uncovering adaptive vulnerabilities missed by static benchmarks while ensuring the models maintain benign-use correctness.

Kunal Pai, Parth Shah, Harshil Patel2026-03-10💻 cs

PhysDrape: Learning Explicit Forces and Collision Constraints for Physically Realistic Garment Draping

PhysDrape is a hybrid neural-physical solver that combines a Physics-Informed Graph Neural Network with a differentiable two-stage force and collision projection system to achieve physically realistic garment draping with negligible interpenetration and superior fidelity compared to existing deep learning methods.

Minghai Chen, Mingyuan Liu, Ning Ma, Jianqing Li, Yuxiang Huan2026-03-10💻 cs

LLM4PQC - Accurate and Efficient Synthesis of PQC Cores by Feedback-Driven LLMs

LLM4PQC is a feedback-driven, agentic framework that leverages large language models to automate the refactoring of complex post-quantum cryptography reference codes into synthesizable HLS specifications and RTL, significantly reducing manual effort and accelerating design-space exploration through a hierarchical verification process.

Buddhi Perera, Zeng Wang, Weihua Xiao, Mohammed Nabeel, Ozgur Sinanoglu, Johann Knechtel, Ramesh Karri2026-03-10💻 cs

Move What Matters: Parameter-Efficient Domain Adaptation via Optimal Transport Flow for Collaborative Perception

To address the challenges of parameter-efficient domain adaptation in V2X collaborative perception, the paper proposes FlowAdapt, a framework leveraging optimal transport theory and a progressive knowledge transfer mechanism to filter redundant data and preserve fine-grained semantics, achieving state-of-the-art performance with only 1% trainable parameters.

Zesheng Jia, Jin Wang, Siao Liu, Lingzhi Li, Ziyao Huang, Yunjiang Xu, Jianping Wang2026-03-10💻 cs

SToRM: Supervised Token Reduction for Multi-modal LLMs toward efficient end-to-end autonomous driving

This paper proposes SToRM, a novel framework that employs a lightweight importance predictor, supervised training with pseudo-labels, and an anchor-context merging module to significantly reduce visual token redundancy in multi-modal LLMs for autonomous driving, achieving up to 30x computational savings while maintaining end-to-end performance comparable to using all tokens.

Seo Hyun Kim, Jin Bok Park, Do Yeon Koo, Hogun Park, Il Yong Chun2026-03-10💻 cs

Accelerating Robotic Reinforcement Learning with Agent Guidance

This paper introduces Agent-guided Policy Search (AGPS), a framework that replaces human supervisors with a multimodal agent acting as a semantic world model to provide precise corrective guidance, thereby significantly improving sample efficiency and scalability in robotic reinforcement learning compared to traditional Human-in-the-Loop methods.

Haojun Chen, Zili Zou, Chengdong Ma, Yaoxiang Pu, Haotong Zhang, Yuanpei Chen, Yaodong Yang2026-03-10💻 cs

To Mix or To Merge: Toward Multi-Domain Reinforcement Learning for Large Language Models

This paper introduces M2RL, a comprehensive study comparing mixed multi-task training versus separate training with model merging for multi-domain Reinforcement Learning with Verifiable Rewards (RLVR), revealing that reasoning-intensive domains exhibit synergistic effects with minimal interference and providing mechanistic insights through extensive experiments.

Haoqing Wang, Xiang Long, Ziheng Li, Yilong Xu, Tingguang Li, Yehui Tang2026-03-10💻 cs

SkillsBench: Benchmarking How Well Agent Skills Work Across Diverse Tasks

The paper introduces SkillsBench, a comprehensive benchmark demonstrating that while curated agent skills significantly boost LLM performance across diverse domains—often allowing smaller models to match larger ones—self-generated skills offer no benefit and effects vary widely by task.

Xiangyi Li, Wenbo Chen, Yimin Liu, Shenghan Zheng, Xiaokun Chen, Yifeng He, Yubo Li, Bingran You, Haotian Shen, Jiankai Sun, Shuyi Wang, Binxu Li, Qunhong Zeng, Di Wang, Xuandong Zhao, Yuanli Wang, Roey Ben Chaim, Zonglin Di, Yipeng Gao, Junwei He, Yizhuo He, Liqiang Jing, Luyang Kong, Xin Lan, Jiachen Li, Songlin Li, Yijiang Li, Yueqian Lin, Xinyi Liu, Xuanqing Liu, Haoran Lyu, Ze Ma, Bowei Wang, Runhui Wang, Tianyu Wang, Wengao Ye, Yue Zhang, Hanwen Xing, Yiqi Xue, Steven Dillmann, Han-chung Lee2026-03-10💻 cs

State Feedback Control of State-Delayed LPV Systems using Dynamic IQCs

This paper proposes a novel state-feedback control framework for state-delayed linear parameter-varying (LPV) systems that integrates dynamic integral quadratic constraints (IQCs) with parameter-dependent Lyapunov functions to derive less conservative, convex synthesis conditions for guaranteed closed-loop stability and $\mathcal{L}_2$ -gain performance.

Fen Wu2026-03-10💻 cs

Social Life of Code: Modeling Evolution through Code Embedding and Opinion Dynamics

This paper proposes a novel framework that integrates semantic code embeddings with the Expressed-Private Opinion (EPO) model to quantitatively analyze and visualize the social dynamics, consensus formation, and influence patterns driving software evolution in open-source repositories.

Yulong He, Nikita Verbin, Sergey Kovalchuk2026-03-10💻 cs

← Previous Next →