PathoScribe: Transforming Pathology Data into a Living Library with a Unified LLM-Driven Framework for Semantic Retrieval and Clinical Integration

PathoScribe is a unified retrieval-augmented large language model framework that transforms static pathology archives into an active, reasoning-enabled clinical intelligence platform, enabling natural language case retrieval, automated cohort construction, and real-time diagnostic support with high accuracy and efficiency.

Abdul Rehman Akbar, Samuel Wales-McGrath, Alejadro Levya, Lina Gokhale, Rajendra Singh, Wei Chen, Anil Parwani, Muhammad Khalid Khan Niazi2026-03-11🤖 cs.AI

VoxEmo: Benchmarking Speech Emotion Recognition with Speech LLMs

The paper introduces VoxEmo, a comprehensive benchmark and toolkit for evaluating Speech Large Language Models on speech emotion recognition across 35 corpora and 15 languages, featuring a distribution-aware soft-label protocol that reveals how these models uniquely align with human subjective emotion distributions despite trailing supervised baselines in hard-label accuracy.

Hezhao Zhang, Huang-Cheng Chou, Shrikanth Narayanan, Thomas Hain2026-03-11🤖 cs.AI

AgentOS: From Application Silos to a Natural Language-Driven Data Ecosystem

This paper proposes "AgentOS," a new paradigm that replaces traditional GUI-based operating systems with a natural language-driven ecosystem centered on an Agent Kernel, framing the realization of such a system as a Knowledge Discovery and Data Mining (KDD) challenge involving intent mining, workflow automation, and dynamic personal knowledge graphs.

Rui Liu, Tao Zhe, Dongjie Wang, Zijun Yao, Kunpeng Liu, Yanjie Fu, Huan Liu, Jian Pei2026-03-11🤖 cs.AI

Semantic Level of Detail: Multi-Scale Knowledge Representation via Heat Kernel Diffusion on Hyperbolic Manifolds

This paper introduces Semantic Level of Detail (SLoD), a framework that utilizes heat kernel diffusion on hyperbolic manifolds to enable continuous, principled control over knowledge abstraction levels in AI memory systems, automatically detecting emergent semantic boundaries in both synthetic and real-world knowledge graphs without manual supervision.

Edward Izgorodin2026-03-11🤖 cs.AI

Gender Fairness in Audio Deepfake Detection: Performance and Disparity Analysis

This paper analyzes gender bias in audio deepfake detection using the ASVspoof 5 dataset and a ResNet-18 classifier, demonstrating that while aggregate metrics like Equal Error Rate may suggest low disparity, fairness-aware evaluation reveals significant gender-specific error distributions that necessitate more equitable and robust detection systems.

Aishwarya Fursule, Shruti Kshirsagar, Anderson R. Avila2026-03-11🤖 cs.AI

AI Phenomenology for Understanding Human-AI Experiences Across Eras

This paper proposes "AI phenomenology" as a research framework that prioritizes users' first-person lived experiences over traditional performance metrics to better understand and guide the bidirectional alignment between humans and AI systems, offering a set of methodological tools, design concepts, and a research agenda derived from three empirical studies.

Bhada Yun, Evgenia Taranova, Dana Feng, Renn Su, April Yi Wang2026-03-11🤖 cs.AI

MEMO: Memory-Augmented Model Context Optimization for Robust Multi-Turn Multi-Agent LLM Games

The paper introduces MEMO, a memory-augmented self-play framework that optimizes inference-time context through structured memory retention and uncertainty-aware prompt exploration, significantly improving the win rates and run-to-run stability of multi-agent LLMs in long-horizon, imperfect-information games.

Yunfei Xie, Kevin Wang, Bobby Cheng, Jianzhu Yao, Zhizhou Sha, Alexander Duffy, Yihan Xi, Hongyuan Mei, Cheston Tan, Chen Wei, Pramod Viswanath, Zhangyang Wang2026-03-11🤖 cs.AI

PlayWorld: Learning Robot World Models from Autonomous Play

PlayWorld introduces a fully autonomous pipeline that trains high-fidelity, physically consistent video world models from unsupervised robot self-play, outperforming human-collected data in predicting complex interactions and significantly boosting real-world reinforcement learning success rates.

Tenny Yin, Zhiting Mei, Zhonghe Zheng, Miyu Yamane, David Wang, Jade Sceats, Samuel M. Bateman, Lihan Zha, Apurva Badithela, Ola Shorinwa, Anirudha Majumdar2026-03-11🤖 cs.AI

WS-Net: Weak-Signal Representation Learning and Gated Abundance Reconstruction for Hyperspectral Unmixing via State-Space and Weak Signal Attention Fusion

This paper introduces WS-Net, a deep unmixing framework that combines state-space modeling, wavelet-fused encoding, and a specialized weak signal attention mechanism to effectively recover weak spectral signals and significantly improve abundance estimation accuracy in hyperspectral images under low signal-to-noise conditions.

Zekun Long, Ali Zia, Guanyiman Fu, Vivien Rolland, Jun Zhou2026-03-11🤖 cs.AI