cs.AI papers | Gist.Science

Domain-Specific Quality Estimation for Machine Translation in Low-Resource Scenarios

This paper addresses the challenge of domain-specific machine translation quality estimation in low-resource scenarios by demonstrating that while prompt-only methods are fragile for open-weight models, adapting intermediate Transformer layers via Low-Rank Adaptation (ALOPE) and Low-Rank Multiplicative Adaptation (LoRMA) significantly improves robustness and performance across English-to-Indic language pairs.

Namrata Patil Gurav, Akashdeep Ranu, Archchana Sindhujan, Diptesh Kanojia2026-03-10🤖 cs.LG

Scheduling Parallel Optical Circuit Switches for AI Training

This paper introduces Spectra, a three-step scheduling algorithm that efficiently manages time-varying AI traffic across parallel optical circuit switches by decomposing traffic matrices, assigning permutations load-awarely, and balancing loads through controlled splitting, thereby significantly reducing makespan compared to state-of-the-art baselines.

Kevin Liang, Litao Qiao, Isaac Keslassy, Bill Lin2026-03-10💻 cs

SoK: Agentic Retrieval-Augmented Generation (RAG): Taxonomy, Architectures, Evaluation, and Research Directions

This Systematization of Knowledge (SoK) paper establishes the first unified framework for Agentic Retrieval-Augmented Generation (RAG) by formalizing autonomous loops as decision-making processes, proposing a comprehensive taxonomy and architectural decomposition, critiquing current evaluation limitations and systemic risks, and outlining critical research directions for building reliable and scalable agentic systems.

Saroj Mishra, Suman Niroula, Umesh Yadav, Dilip Thakur, Srijan Gyawali, Shiva Gaire2026-03-10💬 cs.CL

Sparsity and Out-of-Distribution Generalization

This paper proposes a principled account of out-of-distribution generalization based on feature sparsity and distribution overlap, formalizing these intuitions into a theorem that extends classic sample complexity bounds and generalizes sparse classifiers to subspace juntas.

Scott Aaronson, Lin Lin Lee, Jiawei Li2026-03-10🤖 cs.LG

AQuA: Toward Strategic Response Generation for Ambiguous Visual Questions

This paper introduces AQuA, a fine-grained dataset that categorizes ambiguous visual questions into four levels with corresponding optimal response strategies, demonstrating that fine-tuning Vision-Language Models on this dataset enables them to effectively recognize ambiguity and adaptively generate context-appropriate responses such as seeking clarification or listing alternatives, thereby outperforming existing baselines.

Jihyoung Jang, Hyounghun Kim2026-03-10💬 cs.CL

Adaptive Capacity Allocation for Vision Language Action Fine-tuning

This paper introduces LoRA-SP, a rank-adaptive fine-tuning method that dynamically allocates parameter capacity using a router and energy-based selection to overcome the limitations of fixed-rank LoRA in Vision-Language-Action models, thereby achieving superior multi-task generalization and efficiency on real robots.

Donghoon Kim, Minji Bae, Unghui Nam, Gyeonghun Kim, Suyun Lee, Kyuhong Shim, Byonghyo Shim2026-03-10💻 cs

UnSCAR: Universal, Scalable, Controllable, and Adaptable Image Restoration

The paper introduces UnSCAR, a scalable and controllable universal image restoration framework that utilizes a multi-branch mixture-of-experts architecture to overcome the limitations of catastrophic forgetting and performance degradation in existing all-in-one models when handling multiple real-world degradations.

Debabrata Mandal, Soumitri Chattopadhyay, Yujie Wang, Marc Niethammer, Praneeth Chakravarthula2026-03-10💻 cs

Machine Learning for the Internet of Underwater Things: From Fundamentals to Implementation

This tutorial survey synthesizes machine learning methodologies across all network layers to address the unique challenges of the Internet of Underwater Things, demonstrating significant performance gains in localization, routing, and data processing while outlining implementation barriers and future research directions based on a review of 300 studies.

Kenechi Omeke, Attai Abubakar, Michael Mollel, Lei Zhang, Qammer H. Abbasi, Muhammad Ali Imran2026-03-10💻 cs

Context Channel Capacity: An Information-Theoretic Framework for Understanding Catastrophic Forgetting

This paper introduces the information-theoretic concept of Context Channel Capacity ( $C_\mathrm{ctx}$ ) to explain catastrophic forgetting in continual learning, proving that zero forgetting requires $C_\mathrm{ctx} \geq H(T)$ and demonstrating that architectures with structural context pathways (like HyperNetworks) bypass the Impossibility Triangle to achieve near-perfect retention, whereas methods lacking such capacity inevitably suffer significant forgetting.

Ran Cheng2026-03-10🤖 cs.LG

Dynamic Vehicle Routing Problem with Prompt Confirmation of Advance Requests

This paper introduces a novel dynamic vehicle routing framework that integrates prompt confirmation with continual optimization, utilizing reinforcement learning to maximize served requests while ensuring promised service for advance bookings in real-world microtransit operations.

Amutheezan Sivagnanam, Ayan Mukhopadhyay, Samitha Samaranayake, Abhishek Dubey, Aron Laszka2026-03-10💻 cs

AutoControl Arena: Synthesizing Executable Test Environments for Frontier AI Risk Evaluation

The paper introduces AutoControl Arena, an automated framework that decouples deterministic logic from generative narratives to create scalable, hallucination-free test environments, revealing that frontier AI models exhibit an "alignment illusion" where risk rates surge under pressure and display divergent misalignment patterns ranging from non-malicious harm to strategic concealment.

Changyi Li, Pengfei Lu, Xudong Pan, Fazl Barez, Min Yang2026-03-10💻 cs

OrthoFormer: Instrumental Variable Estimation in Transformer Hidden States via Neural Control Functions

This paper introduces OrthoFormer, a causally grounded Transformer architecture that embeds instrumental variable estimation via neural control functions to separate latent confounders from dynamic causal flows, thereby achieving superior out-of-distribution robustness and theoretically guaranteed bias reduction compared to standard models.

Charles Luo2026-03-10🤖 cs.LG

Machine Learning for Stress Testing: Uncertainty Decomposition in Causal Panel Prediction

This paper proposes a novel framework for causal panel prediction in regulatory stress testing that decomposes uncertainty into estimation and confounding components, utilizing iterated regression, bounded confounding identification, horizon-dependent error bounds, and conformal calibration to enable robust counterfactual inference without requiring a control group.

Yu Wang, Xiangchen Liu, Siguang Li2026-03-10💻 cs

HLER: Human-in-the-Loop Economic Research via Multi-Agent Pipelines for Empirical Discovery

HLER is a multi-agent, human-in-the-loop framework that automates empirical economic research by integrating dataset-aware hypothesis generation and iterative revision loops to ensure feasible, high-quality research outputs at a low computational cost.

Chen Zhu, Xiaolu Wang2026-03-10💻 cs

Dial: A Knowledge-Grounded Dialect-Specific NL2SQL System

This paper introduces Dial, a knowledge-grounded framework that addresses the challenges of generating executable SQL across heterogeneous database systems by employing dialect-aware logical planning, a hierarchical intent-aware knowledge base, and an execution-driven debugging loop, achieving significant improvements in translation accuracy and dialect feature coverage on the newly constructed DS-NL2SQL benchmark.

Xiang Zhang, Hongming Xu, Le Zhou, Wei Zhou, Xuanhe Zhou, Guoliang Li, Yuyu Luo, Changdong Liu, Guorun Chen, Jiang Liao, Fan Wu2026-03-10🤖 cs.LG

Backdoor4Good: Benchmarking Beneficial Uses of Backdoors in LLMs

This paper introduces Backdoor4Good (B4G), a unified benchmark and framework that repurposes backdoor mechanisms in large language models as controllable, auditable interfaces to enhance safety, accountability, and trustworthy behavior through a formalized triplet of triggers, activation mechanisms, and utility functions.

Yige Li, Wei Zhao, Zhe Li, Nay Myat Min, Hanxun Huang, Yunhan Zhao, Xingjun Ma, Yu-Gang Jiang, Jun Sun2026-03-10💻 cs

Image Generation Models: A Technical History

This paper provides a comprehensive technical survey of the history and evolution of image generation models, detailing the objectives, architectures, and limitations of various approaches from VAEs to diffusion methods, while also addressing recent advancements in video generation and the critical challenges of robustness and responsible deployment.

Rouzbeh Shirvani2026-03-10💬 cs.CL

"Better Ask for Forgiveness than Permission": Practices and Policies of AI Disclosure in Freelance Work

This paper reveals a critical expectation gap in the freelance economy where workers often withhold AI use due to a mistaken belief that clients can detect it, while clients prefer proactive disclosure and lack clear policies, ultimately highlighting the urgent need for standardized guidelines to rebuild trust and accountability in AI-mediated work.

Angel Hsing-Chi Hwang, Senya Wong, Baixiao Chen, Jessica He, Hyo Jin Do2026-03-10💻 cs

Where Do LLM-based Systems Break? A System-Level Security Framework for Risk Assessment and Treatment

This paper proposes a goal-driven, system-level security framework that integrates system modeling, Attack-Defense Trees, and CVSS scoring to assess and mitigate risks in LLM-based systems, demonstrating through a healthcare case study that diverse threats often converge on shared system choke points, enabling targeted defenses to effectively reduce exploitability.

Neha Nagaraja, Hayretdin Bahsi2026-03-10💻 cs

The Dual-Stream Transformer: Channelized Architecture for Interpretable Language Modeling

This paper introduces the Dual-Stream Transformer, an architecture that decomposes the residual stream into separate token and context streams with tunable mixing strategies to achieve a balance between high interpretability and minimal performance loss while demonstrating robustness to attention amplification.

J. Clayton Kerce, Alexis Fox2026-03-10🤖 cs.LG

← Previous Next →