Med-V1: Small Language Models for Zero-shot and Scalable Biomedical Evidence Attribution

The paper introduces Med-V1, a family of efficient 3-billion-parameter small language models trained on synthetic data that achieve performance comparable to frontier models like GPT-5 for biomedical evidence attribution and hallucination detection, while enabling scalable applications such as analyzing citation validity and identifying evidence misattributions in clinical guidelines.

Qiao Jin, Yin Fang, Lauren He + 12 more2026-03-06🤖 cs.AI

DiSCTT: Consensus-Guided Self-Curriculum for Efficient Test-Time Adaptation in Reasoning

The paper proposes DiSCTT, a difficulty-aware, consensus-guided self-curriculum framework that dynamically allocates supervised fine-tuning or reinforcement learning strategies based on instance-level agreement among reasoning trajectories, thereby achieving more stable, efficient, and accurate test-time adaptation for large language models on heterogeneous reasoning tasks.

Mohammad Mahdi Moradi, Sudhir Mudur2026-03-06💬 cs.CL

An Exploration-Analysis-Disambiguation Reasoning Framework for Word Sense Disambiguation with Low-Parameter LLMs

This study demonstrates that carefully fine-tuned low-parameter LLMs (<4B) utilizing Chain-of-Thought reasoning and neighbor-word analysis can achieve Word Sense Disambiguation performance comparable to or exceeding state-of-the-art high-parameter models like GPT-4-Turbo, while significantly reducing computational and energy costs.

Deshan Sumanathilaka, Nicholas Micallef, Julian Hough2026-03-06💬 cs.CL

Distributed Partial Information Puzzles: Examining Common Ground Construction Under Epistemic Asymmetry

This paper introduces the Distributed Partial Information Puzzle (DPIP) and its associated multimodal dataset to evaluate how well current large language models and logic-based systems can construct common ground under epistemic asymmetry, revealing that modern LLMs struggle to accurately track task progression and belief states compared to axiomatic approaches.

Yifan Zhu, Mariah Bradford, Kenneth Lai + 4 more2026-03-06🤖 cs.AI