cs.AI papers | Gist.Science

PromptDLA: A Domain-aware Prompt Document Layout Analysis Framework with Descriptive Knowledge as a Cue

This paper introduces PromptDLA, a domain-aware framework that leverages descriptive knowledge as cues to customize prompts for integrating domain priors, thereby overcoming the limitations of directly merging diverse datasets and achieving state-of-the-art performance in Document Layout Analysis across multiple benchmarks.

Zirui Zhang, Yaping Zhang, Lu Xiang, Yang Zhao, Feifei Zhai, Yu Zhou, Chengqing ZongWed, 11 Ma🤖 cs.AI

From Flow to One Step: Real-Time Multi-Modal Trajectory Policies via Implicit Maximum Likelihood Estimation-based Distribution Distillation

This paper proposes a real-time multi-modal trajectory policy framework that distills a Conditional Flow Matching expert into a single-step student using Implicit Maximum Likelihood Estimation and a bi-directional Chamfer distance, thereby eliminating the latency of iterative ODE integration while preserving multi-modal action diversity for high-frequency robotic control.

Ju Dong, Liding Zhang, Lei Zhang, Yu Fu, Kaixin Bai, Zoltan-Csaba Marton, Zhenshan Bing, Zhaopeng Chen, Alois Christian Knoll, Jianwei ZhangWed, 11 Ma🤖 cs.AI

Investigating Gender Stereotypes in Large Language Models via Social Determinants of Health

This study investigates gender bias in Large Language Models within French healthcare contexts, demonstrating that these models rely on embedded stereotypes when processing interactions between gender and other social determinants of health, thereby highlighting the need for context-specific assessments that go beyond evaluating individual factors in isolation.

Trung Hieu Ngo, Adrien Bazoge, Solen Quiniou, Pierre-Antoine Gourraud, Emmanuel MorinWed, 11 Ma🤖 cs.AI

Open-World Motion Forecasting

This paper introduces "Open-World Motion Forecasting," an end-to-end class-incremental framework that predicts future trajectories directly from camera images while mitigating catastrophic forgetting through pseudo-labeling with vision-language models and a novel query feature variance-based replay strategy, enabling continual adaptation to evolving object taxonomies in real-world autonomous driving.

Nicolas Schischka, Nikhil Gosala, B Ravi Kiran, Senthil Yogamani, Abhinav ValadaWed, 11 Ma🤖 cs.AI

CERES: A Probabilistic Early Warning System for Acute Food Insecurity

CERES is a novel, open-access, and continuously running probabilistic forecasting system that fuses six global data streams to generate weekly, machine-readable 90-day ahead risk estimates for acute food insecurity across 43 high-risk countries, with a unique commitment to public prospective verification of all predictions.

Tom Danny S. PedersenWed, 11 Ma🤖 cs.AI

Common Sense vs. Morality: The Curious Case of Narrative Focus Bias in LLMs

This paper introduces the CoMoral benchmark to reveal that current Large Language Models exhibit a critical limitation where they prioritize moral reasoning over commonsense understanding, particularly struggling to detect contradictions involving primary characters due to a pervasive narrative focus bias.

Saugata Purkayastha, Pranav Kushare, Pragya Paramita Pal, Sukannya PurkayasthaWed, 11 Ma🤖 cs.AI

AI Act Evaluation Benchmark: An Open, Transparent, and Reproducible Evaluation Dataset for NLP and RAG Systems

This paper introduces an open, transparent, and reproducible dataset and methodology for evaluating NLP and RAG systems on EU AI Act compliance, featuring tasks like risk classification and obligation generation that leverage large language models to address regulatory ambiguities and achieve high performance scores.

Athanasios Davvetas, Michael Papademas, Xenia Ziouvelou, Vangelis KarkaletsisWed, 11 Ma🤖 cs.AI

A Guideline-Aware AI Agent for Zero-Shot Target Volume Auto-Delineation

This paper introduces OncoAgent, a novel guideline-aware AI agent that achieves zero-shot, training-free auto-delineation of clinical target volumes by converting textual clinical guidelines into 3D contours, demonstrating superior adaptability and physician preference over traditional supervised deep learning models.

Yoon Jo Kim, Wonyoung Cho, Jongmin Lee, Han Joo Chae, Hyunki Park, Sang Hoon Seo, Noh Jae Myung, Kyungmi Yang, Dongryul Oh, Jin Sung KimWed, 11 Ma🤖 cs.AI

Variational Routing: A Scalable Bayesian Framework for Calibrated Mixture-of-Experts Transformers

This paper introduces Variational Mixture-of-Experts Routing (VMoER), a scalable Bayesian framework that confines uncertainty quantification to the expert-selection stage of Mixture-of-Experts Transformers, achieving significant improvements in calibration, stability, and out-of-distribution detection with negligible computational overhead.

Albus Yizhuo Li, Matthew WickerWed, 11 Ma🤖 cs.AI

Declarative Scenario-based Testing with RoadLogic

This paper introduces RoadLogic, an open-source framework that bridges declarative OpenSCENARIO specifications and executable simulations by combining Answer Set Programming, motion planning, and specification-based monitoring to automatically generate diverse, realistic, and compliant autonomous vehicle testing scenarios.

Ezio Bartocci, Alessio Gambi, Felix Gigler, Cristinel Mateis, Dejan NičkovicWed, 11 Ma🤖 cs.AI

An Empirical Study and Theoretical Explanation on Task-Level Model-Merging Collapse

This paper identifies and theoretically explains "task-level merging collapse," a phenomenon where incompatible task representations cause catastrophic performance degradation in merged LLMs, demonstrating that representational incompatibility—not parameter-space conflicts—is the primary driver of failure and establishing fundamental limits on task mergeability via rate-distortion theory.

Yuan Cao, Dezhi Ran, Yuzhe Guo, Mengzhou Wu, Simin Chen, Linyi Li, Wei Yang, Tao XieWed, 11 Ma🤖 cs.AI

EvoDriveVLA: Evolving Autonomous Driving Vision-Language-Action Model via Collaborative Perception-Planning Distillation

EvoDriveVLA is a novel Vision-Language-Action model for autonomous driving that overcomes perception degradation and planning instability through a collaborative distillation framework combining self-anchored visual constraints and oracle-guided trajectory optimization to achieve state-of-the-art performance.

Jiajun Cao, Xiaoan Zhang, Xiaobao Wei, Liyuqiu Huang, Wang Zijian, Hanzhen Zhang, Zhengyu Jia, Wei Mao, Hao Wang, Xianming Liu, Shuchang Zhou Liu, Yang Wang, Shanghang ZhangWed, 11 Ma🤖 cs.AI

Telogenesis: Goal Is All U Need

This paper proposes and validates "Telogenesis," an endogenous attention mechanism that generates adaptive priorities from internal epistemic gaps (ignorance, surprise, and staleness) without external rewards, demonstrating superior coverage and detection latency compared to fixed strategies while spontaneously recovering latent environmental volatility structures.

Zhuoran Deng, Yizhi Zhang, Ziyi Zhang, Wan ShenWed, 11 Ma🤖 cs.AI

GenePlan: Evolving Better Generalized PDDL Plans using Large Language Models

GenePlan is a novel framework that leverages large language models to evolve domain-dependent generalized planners in Python, achieving state-of-the-art performance on PDDL benchmarks while significantly outperforming other LLM-based approaches in both accuracy and efficiency.

Andrew Murray, Danial Dervovic, Alberto Pozanco, Michael CashmoreWed, 11 Ma🤖 cs.AI

Vibe-Creation: The Epistemology of Human-AI Emergent Cognition

This paper argues that human-AI interactions generate a distinct "Third Entity" characterized by "vibe-creation" and "asymmetric emergence," fundamentally transforming our understanding of cognition, tacit knowledge, and education beyond traditional metaphors of tool use or collaboration.

Ilya LevinWed, 11 Ma🤖 cs.AI

Temporal-Conditioned Normalizing Flows for Multivariate Time Series Anomaly Detection

This paper introduces temporal-conditioned normalizing flows (tcNF), a novel autoregressive framework that enhances multivariate time series anomaly detection by effectively modeling temporal dependencies and uncertainty to identify low-probability events with improved accuracy and robustness.

David Baumgartner, Helge Langseth, Kenth Engø-Monsen, Heri RamampiaroWed, 11 Ma🤖 cs.AI

Evolving Prompt Adaptation for Vision-Language Models

The paper proposes EvoPrompt, a novel framework that achieves stable, knowledge-preserving adaptation of vision-language models in few-shot scenarios by employing a modality-shared projector and an evolutionary training strategy that decouples prompt updates into directional and magnitude components to prevent catastrophic forgetting.

Enming Zhang, Jiayang Li, Yanru Wu, Zhenyu Liu, Yang LiWed, 11 Ma🤖 cs.AI

Efficiently Aligning Draft Models via Parameter- and Data-Efficient Adaptation

This paper introduces Efficient Draft Adaptation (EDA), a parameter- and data-efficient framework that restores speculative decoding performance on fine-tuned target models through a decoupled architecture, data regeneration strategy, and sample selection mechanism, achieving superior acceptance lengths with significantly reduced training costs compared to full retraining.

Luxi Lin, Zhihang Lin, Zhanpeng Zeng, Yuhao Chen, Qingyu Zhang, Jixiang Luo, Xuelong Li, Rongrong JiWed, 11 Ma🤖 cs.AI

Enhancing Debunking Effectiveness through LLM-based Personality Adaptation

This study proposes and evaluates a novel methodology for enhancing fake news debunking by using Large Language Models to generate personalized messages tailored to Big Five personality traits, demonstrating that such targeted approaches generally increase persuasiveness while highlighting both the potential and ethical implications of automated, personality-driven disinformation correction.

Pietro Dell'Oglio, Alessandro Bondielli, Francesco Marcelloni, Lucia C. PassaroWed, 11 Ma🤖 cs.AI

Compiler-First State Space Duality and Portable $O(1)$ Autoregressive Caching for Inference

This paper demonstrates that Mamba-2's state space duality can be implemented entirely using standard XLA primitives without custom kernels, achieving portable, host-synchronization-free $O(1)$ autoregressive caching with high performance across CPU, NVIDIA GPU, and Google Cloud TPU hardware.

Cosmo SantoniWed, 11 Ma🤖 cs.AI

← Previous Next →

cs.AI