cs.CL papers | Gist.Science

Think Before You Lie: How Reasoning Improves Honesty

This paper demonstrates that reasoning consistently enhances honesty in large language models by guiding them through a representational space where deceptive states are metastable and easily disrupted, ultimately nudging the models toward more stable, honest defaults.

Ann Yuan, Asma Ghandeharioun, Carter Blum, Alicia Machado, Jessica Hoffmann, Daphne Ippolito, Martin Wattenberg, Lucas Dixon, Katja FilippovaWed, 11 Ma🤖 cs.AI

Enhancing Debunking Effectiveness through LLM-based Personality Adaptation

This study proposes and evaluates a novel methodology for enhancing fake news debunking by using Large Language Models to generate personalized messages tailored to Big Five personality traits, demonstrating that such targeted approaches generally increase persuasiveness while highlighting both the potential and ethical implications of automated, personality-driven disinformation correction.

Pietro Dell'Oglio, Alessandro Bondielli, Francesco Marcelloni, Lucia C. PassaroWed, 11 Ma🤖 cs.AI

The Reasoning Trap -- Logical Reasoning as a Mechanistic Pathway to Situational Awareness

This paper argues that advancements in logical reasoning for large language models inadvertently create a mechanistic pathway to dangerous situational awareness and strategic deception, necessitating new safety frameworks like the RAISE model to mitigate these emergent risks.

Subramanyam Sahoo, Aman Chadha, Vinija Jain, Divya ChaudharyWed, 11 Ma🤖 cs.AI

From Days to Minutes: An Autonomous AI Agent Achieves Reliable Clinical Triage in Remote Patient Monitoring

The paper introduces Sentinel, an autonomous AI agent that achieves reliable, scalable clinical triage for remote patient monitoring by outperforming individual clinicians in sensitivity and consistency while maintaining a clinically defensible overtriage profile at a negligible cost.

A Consensus-Driven Multi-LLM Pipeline for Missing-Person Investigations

This paper introduces Guardian, a consensus-driven, multi-LLM pipeline enhanced by QLoRA fine-tuning that coordinates specialized models and a consensus engine to perform auditable, structured information extraction for critical missing-person investigations while avoiding unconstrained decision-making.

Joshua Castillo, Ravi MukkamalaWed, 11 Ma🤖 cs.AI

MASEval: Extending Multi-Agent Evaluation from Models to Systems

MASEval introduces a framework-agnostic library that shifts multi-agent evaluation from a model-centric to a system-centric approach, demonstrating through extensive experiments that implementation decisions regarding topology and orchestration impact performance as significantly as model selection.

Cornelius Emde, Alexander Rubinstein, Anmol Goel, Ahmed Heakl, Sangdoo Yun, Seong Joon Oh, Martin GubriWed, 11 Ma🤖 cs.AI

From Word2Vec to Transformers: Text-Derived Composition Embeddings for Filtering Combinatorial Electrocatalysts

This paper demonstrates that a lightweight, label-free screening strategy using Word2Vec-derived composition embeddings often outperforms transformer-based methods in filtering vast combinatorial electrocatalyst libraries by prioritizing candidates based on their similarity to text-derived property concepts like conductivity and dielectricity.

Lei Zhang, Markus StrickerWed, 11 Ma🔬 cond-mat.mtrl-sci

No Memorization, No Detection: Output Distribution-Based Contamination Detection in Small Language Models

This paper demonstrates that Contamination Detection via output Distribution (CDD) is largely ineffective for small language models (70M–410M parameters) because it fails to detect verbatim memorization, whereas probability-based methods like perplexity and Min-k% Prob consistently outperform it across various benchmarks.

Omer Sela (Tel Aviv University)Tue, 10 Ma💬 cs.CL

A Geometric Taxonomy of Hallucinations in LLMs

This paper proposes a geometric taxonomy of LLM hallucinations into three distinct types (unfaithfulness, confabulation, and factual error) and introduces corresponding detection metrics, the Semantic Grounding Index and Directional Grounding Index, which effectively identify unfaithful and confabulated outputs while revealing that apparent signals for factual errors in existing benchmarks often stem from stylistic annotation confounds rather than genuine geometric distinctions.

Javier MarínTue, 10 Ma💬 cs.CL

Measuring Complexity at the Requirements Stage: Spectral Metrics as Development Effort Predictors

This paper demonstrates that spectral metrics derived from natural language processing of requirements specifications can predict integration effort with high accuracy (correlations >0.95), offering a validated method to quantify structural complexity at the requirements stage and bridge the gap between architectural analysis and requirements engineering.

Maximilian Vierlboeck, Antonio Pugliese, Roshanak Nilchian, Paul Grogan, Rashika Sugganahalli Natesh BabuTue, 10 Ma💬 cs.CL

RedSage: A Cybersecurity Generalist LLM

The paper introduces RedSage, an open-source, locally deployable cybersecurity LLM trained on a massive curated dataset and agentic augmentation pipeline, which achieves state-of-the-art performance on specialized cybersecurity benchmarks while also improving general reasoning capabilities.

Naufal Suryanto, Muzammal Naseer, Pengfei Li, Syed Talal Wasim, Jinhui Yi, Juergen Gall, Paolo Ceravolo, Ernesto DamianiTue, 10 Ma💬 cs.CL

Replayable Financial Agents: A Determinism-Faithfulness Assurance Harness for Tool-Using LLM Agents

This paper introduces the Determinism-Faithfulness Assurance Harness (DFAH), a framework and set of financial benchmarks demonstrating that decision determinism and task accuracy in LLM agents are uncorrelated, thereby necessitating independent measurement to ensure reliable regulatory audit replay in financial services.

Raffi KhatchadourianTue, 10 Ma💬 cs.CL

MAS-Orchestra: Understanding and Improving Multi-Agent Reasoning Through Holistic Orchestration and Controlled Benchmarks

This paper introduces MAS-Orchestra, a training-time framework that optimizes multi-agent system orchestration via function-calling reinforcement learning, alongside the MASBENCH benchmark, to demonstrate that multi-agent benefits are task-dependent and to achieve significant performance gains with over 10x efficiency on complex reasoning tasks.

Zixuan Ke, Yifei Ming, Austin Xu, Ryan Chin, Xuan-Phi Nguyen, Prathyusha Jwalapuram, Jiayu Wang, Semih Yavuz, Caiming Xiong, Shafiq JotyTue, 10 Ma💬 cs.CL

A Two-Stage Multitask Vision-Language Framework for Explainable Crop Disease Visual Question Answering

This paper presents a lightweight, two-stage multitask vision-language framework that integrates a Swin Transformer encoder with sequence-to-sequence decoders to achieve state-of-the-art, explainable visual question answering for crop disease identification with near-perfect classification accuracy and strong generalization capabilities.

Md. Zahid Hossain, Most. Sharmin Sultana Samu, Md. Rakibul Islam, Md. Siam AnsaryTue, 10 Ma💬 cs.CL

Adaptation of Agentic AI: A Survey of Post-Training, Memory, and Skills

This survey proposes a unified four-paradigm framework to categorize and analyze the fragmented landscape of agentic AI adaptation, distinguishing between agent-side improvements (A1/A2) and tool-side enhancements (T1/T2) to systematically review post-training methods, memory architectures, and skill systems while evaluating their trade-offs and outlining future challenges.

Pengcheng Jiang, Jiacheng Lin, Zhiyi Shi, Zifeng Wang, Luxi He, Yichen Wu, Ming Zhong, Peiyang Song, Qizheng Zhang, Heng Wang, Xueqiang Xu, Hanwen Xu, Pengrui Han, Dylan Zhang, Jiashuo Sun, Chaoqi Yang, Kun Qian, Tian Wang, Changran Hu, Manling Li, Quanzheng Li, Hao Peng, Sheng Wang, Jingbo Shang, Chao Zhang, Jiaxuan You, Liyuan Liu, Pan Lu, Yu Zhang, Heng Ji, Yejin Choi, Dawn Song, Jimeng Sun, Jiawei HanTue, 10 Ma💬 cs.CL

CompanionCast: Toward Social Collaboration with Multi-Agent Systems in Shared Experiences

The paper introduces CompanionCast, a multi-agent framework that orchestrates specialized AI agents with multimodal detection, context caching, and spatial audio to enhance social presence and emotional sharing during shared media experiences, as validated by improved outcomes in pilot studies with soccer fans.

Yiyang Wang, Chen Chen, Tica Lin, Vishnu Raj, Josh Kimball, Alex Cabral, Josiah HesterTue, 10 Ma💬 cs.CL

Parallel Decoder Transformer: Planner-Seeded Latent Coordination for Synchronized Parallel Decoding

The Parallel Decoder Transformer (PDT) introduces a frozen-trunk architecture that enables synchronized parallel decoding by integrating a planner-seeded latent workspace and a dynamic notes bus, allowing multiple output streams to internally coordinate, resolve ownership, and synchronize generation without external orchestration.

Logan RobbinsTue, 10 Ma💬 cs.CL

Process-Centric Analysis of Agentic Software Systems

This paper introduces Graphectory, a graph-based framework for analyzing the stochastic execution trajectories of agentic software systems, which reveals that richer prompts and stronger models yield more complex reasoning patterns while enabling real-time monitoring and intervention that significantly improves problem resolution rates and efficiency.

Shuyang Liu, Yang Chen, Rahul Krishna, Saurabh Sinha, Jatin Ganhotra, Reyhan JabbarvandTue, 10 Ma💬 cs.CL

Tiny but Mighty: A Software-Hardware Co-Design Approach for Efficient Multimodal Inference on Battery-Powered Small Devices

The paper presents NANOMIND, a hardware-software co-design framework that decomposes Large Multimodal Models into modular components and dynamically schedules them across heterogeneous accelerators on unified-memory SoCs, enabling a battery-powered device to run LMMs entirely on-device with significantly improved energy efficiency and throughput.

Yilong Li, Shuai Zhang, Yijing Zeng, Hao Zhang, Xinmiao Xiong, Jingyu Liu, Pan Hu, Suman BanerjeeTue, 10 Ma💬 cs.CL

Mapping Overlaps in Benchmarks through Perplexity in the Wild

This paper introduces "benchmark signatures"—sets of salient tokens from in-the-wild corpora whose perplexity predicts model performance—to reveal nuanced overlaps and distinct capacities across 89 LLM benchmarks, offering a robust alternative to raw performance correlations for understanding the landscape of LLM abilities and the divergence between machine and human semantic organization.

Siyang Wu, Honglin Bao, Sida Li, Ari Holtzman, James A. EvansTue, 10 Ma💬 cs.CL

← Previous Next →