ττ-Knowledge: Evaluating Conversational Agents over Unstructured Knowledge

This paper introduces τ\tau-Knowledge, a new benchmark featuring the τ\tau-Banking domain to evaluate conversational agents' ability to coordinate unstructured knowledge retrieval with tool use in complex, policy-driven workflows, revealing that even frontier models struggle with low success rates and reliability in such realistic, long-horizon interactions.

Quan Shi, Alexandra Zytek, Pedram Razavi + 2 more2026-03-05🤖 cs.AI

DisenReason: Behavior Disentanglement and Latent Reasoning for Shared-Account Sequential Recommendation

This paper proposes DisenReason, a novel two-stage sequential recommendation framework for shared accounts that overcomes the limitation of fixed user assumptions by disentangling collective account behaviors in the frequency domain to serve as a pivot for latent reasoning, thereby dynamically inferring the number of latent users and significantly improving recommendation accuracy.

Jiawei Cheng, Min Gao, Zongwei Wang + 5 more2026-03-05🤖 cs.AI

AgentSelect: Benchmark for Narrative Query-to-Agent Recommendation

The paper introduces AgentSelect, a comprehensive benchmark that addresses the lack of principled agent selection methods by reframing the task as narrative query-to-agent recommendation, providing a unified dataset of over 111,000 queries and 107,000 agents to enable content-aware capability matching and demonstrate superior performance in recommending end-to-end agent configurations across diverse ecosystems.

Yunxiao Shi, Wujiang Xu, Tingwei Chen + 7 more2026-03-05🤖 cs.AI

Developing an AI Assistant for Knowledge Management and Workforce Training in State DOTs

This paper proposes a multi-agent Retrieval-Augmented Generation framework that integrates open-weight large language models and vision-language models to enhance knowledge management and workforce training in state Departments of Transportation by enabling context-aware, evidence-grounded responses from both textual and visual technical documentation.

Divija Amaram, Lu Gao, Gowtham Reddy Gudla + 1 more2026-03-05🤖 cs.AI

When Relevance Meets Novelty: Dual-Stable Periodic Optimization for Serendipitous Recommendation

This paper proposes the Co-Evolutionary Alignment (CoEA) method, which integrates a Dual-Stable Interest Exploration module to model both group identity and individual interests and a Periodic Collaborative Optimization mechanism to establish a dynamic closed-loop feedback system, thereby overcoming the limitations of static optimization and biased interest modeling in LLM-enhanced serendipitous recommendation.

Hongxiang Lin, Hao Guo, Zeshun Li + 6 more2026-03-05🤖 cs.AI