cs.IR papers | Gist.Science

Turning Trust to Transactions: Tracking Affiliate Marketing and FTC Compliance in YouTube's Influencer Economy

This paper analyzes a decade-long dataset of 2 million YouTube videos to reveal that while affiliate marketing is widespread, disclosure compliance with FTC guidelines remains critically low, suggesting that platform-standardized features are essential for improving transparency and trust in the influencer economy.

Chen Sun, Yash Vekaria, Zubair Shafiq + 1 more2026-03-05🤖 cs.LG

Generative Recommendation for Large-Scale Advertising

This paper introduces GR4AD, a production-oriented generative recommendation system for large-scale advertising that integrates novel tokenization, a lazy autoregressive decoder, and value-aware reinforcement learning to achieve significant revenue gains and high-throughput real-time serving in Kuaishou's 400-million-user ecosystem.

Ben Xue, Dan Liu, Lixiang Wang + 26 more2026-03-05🤖 cs.LG

PinRec: Outcome-Conditioned, Multi-Token Generative Retrieval for Industry-Scale Recommendation Systems

This paper introduces PinRec, a novel outcome-conditioned, multi-token generative retrieval model developed for Pinterest that successfully balances performance, diversity, and efficiency to meet industrial-scale recommendation needs and multiple business metrics.

Prabhat Agarwal, Anirudhan Badrinath, Laksh Bhasin + 4 more2026-03-05🤖 cs.LG

$τ$ -Knowledge: Evaluating Conversational Agents over Unstructured Knowledge

This paper introduces $\tau$ -Knowledge, a new benchmark featuring the $\tau$ -Banking domain to evaluate conversational agents' ability to coordinate unstructured knowledge retrieval with tool use in complex, policy-driven workflows, revealing that even frontier models struggle with low success rates and reliability in such realistic, long-horizon interactions.

Quan Shi, Alexandra Zytek, Pedram Razavi + 2 more2026-03-05🤖 cs.AI

LabelBuddy: An Open Source Music and Audio Language Annotation Tagging Tool Using AI Assistance

This paper introduces LabelBuddy, an open-source collaborative tool that bridges the gap between human intent and machine understanding in Music Information Retrieval by decoupling the annotation interface from containerized AI backends to enable flexible, AI-assisted pre-tagging and multi-user consensus.

Ioannis Prokopiou, Ioannis Sina, Agisilaos Kounelis + 2 more2026-03-05🤖 cs.AI

DisenReason: Behavior Disentanglement and Latent Reasoning for Shared-Account Sequential Recommendation

This paper proposes DisenReason, a novel two-stage sequential recommendation framework for shared accounts that overcomes the limitation of fixed user assumptions by disentangling collective account behaviors in the frequency domain to serve as a pivot for latent reasoning, thereby dynamically inferring the number of latent users and significantly improving recommendation accuracy.

Jiawei Cheng, Min Gao, Zongwei Wang + 5 more2026-03-05🤖 cs.AI

Not All Candidates are Created Equal: A Heterogeneity-Aware Approach to Pre-ranking in Recommender Systems

This paper introduces HAP, a heterogeneity-aware pre-ranking framework that resolves gradient conflicts and optimizes computational efficiency by disentangling easy and hard samples for tailored model allocation, achieving significant user engagement improvements in the Toutiao production system without extra costs.

Pengfei Tong, Siyuan Chen, Chenwei Zhang + 4 more2026-03-05🤖 cs.AI

AgentSelect: Benchmark for Narrative Query-to-Agent Recommendation

The paper introduces AgentSelect, a comprehensive benchmark that addresses the lack of principled agent selection methods by reframing the task as narrative query-to-agent recommendation, providing a unified dataset of over 111,000 queries and 107,000 agents to enable content-aware capability matching and demonstrate superior performance in recommending end-to-end agent configurations across diverse ecosystems.

Yunxiao Shi, Wujiang Xu, Tingwei Chen + 7 more2026-03-05🤖 cs.AI

SafeCRS: Personalized Safety Alignment for LLM-Based Conversational Recommender Systems

This paper introduces SafeCRS, a safety-aware training framework and the SafeRec benchmark designed to mitigate personalized safety violations in LLM-based conversational recommender systems by integrating Safe-SFT and Safe-GDPO to align recommendations with individual user constraints while maintaining high recommendation quality.

Haochang Hao, Yifan Xu, Xinzhuo Li + 2 more2026-03-05🤖 cs.AI

Graph Hopfield Networks: Energy-Based Node Classification with Associative Memory

This paper introduces Graph Hopfield Networks, an energy-based framework that unifies associative memory retrieval with graph Laplacian smoothing to achieve state-of-the-art node classification performance and enhanced robustness across diverse graph benchmarks.

Abinav Rao, Alex Wa, Rishi Athavale2026-03-05🤖 cs.AI

MemSifter: Offloading LLM Memory Retrieval via Outcome-Driven Proxy Reasoning

MemSifter is an efficient framework that offloads memory retrieval to a small-scale proxy model trained via outcome-driven reinforcement learning, achieving state-of-the-art performance in long-term LLM memory tasks while minimizing computational overhead.

Jiejun Tan, Zhicheng Dou, Liancheng Zhang + 3 more2026-03-05🤖 cs.AI

Developing an AI Assistant for Knowledge Management and Workforce Training in State DOTs

This paper proposes a multi-agent Retrieval-Augmented Generation framework that integrates open-weight large language models and vision-language models to enhance knowledge management and workforce training in state Departments of Transportation by enabling context-aware, evidence-grounded responses from both textual and visual technical documentation.

Divija Amaram, Lu Gao, Gowtham Reddy Gudla + 1 more2026-03-05🤖 cs.AI

PlugMem: A Task-Agnostic Plugin Memory Module for LLM Agents

The paper introduces PlugMem, a task-agnostic plugin memory module that structures episodic memories into a compact, knowledge-centric graph to enable efficient retrieval and reasoning for LLM agents, demonstrating superior performance across diverse benchmarks compared to both task-specific and existing task-agnostic baselines.

Ke Yang, Zixi Chen, Xuan He + 6 more2026-03-05🤖 cs.AI

From Conflict to Consensus: Boosting Medical Reasoning via Multi-Round Agentic RAG

The paper proposes MA-RAG, a multi-round agentic RAG framework that iteratively refines medical reasoning by transforming semantic conflicts into targeted evidence retrieval and optimizing reasoning traces, thereby achieving substantial accuracy improvements over existing baselines across seven medical benchmarks.

Wenhao Wu, Zhentao Tang, Yafu Li + 5 more2026-03-05🤖 cs.AI

AriadneMem: Threading the Maze of Lifelong Memory for LLM Agents

AriadneMem is a structured memory system for long-horizon LLM agents that employs a decoupled two-phase pipeline of entropy-aware filtering, conflict-aware coarsening, and algorithmic bridge discovery to significantly improve multi-hop reasoning accuracy and efficiency while drastically reducing context usage and runtime.

Wenhui Zhu, Xiwen Chen, Zhipeng Wang + 11 more2026-03-05🤖 cs.AI

Succeeding at Scale: Automated Dataset Construction and Query-Side Adaptation for Multi-Tenant Search

This paper introduces DevRev-Search, an automated benchmark for technical support retrieval, and proposes an Index-Preserving Adaptation strategy that fine-tunes only the query encoder to achieve scalable, high-performance multi-tenant search without the prohibitive cost of re-indexing.

Prateek Jain, Shabari S Nair, Ritesh Goru + 4 more2026-03-05🤖 cs.AI

REVISION:Reflective Intent Mining and Online Reasoning Auxiliary for E-commerce Visual Search System Optimization

The paper proposes REVISION, a novel framework that integrates offline large-model reasoning to mine implicit user intents from no-click requests with online adaptive decision-making to optimize Taobao's visual search system and significantly reduce no-click rates.

Yiwen Tang, Qiuyu Zhao, Zenghui Sun + 3 more2026-03-05🤖 cs.AI

Towards Personalized Deep Research: Benchmarks and Evaluations

This paper introduces PDR-Bench, the first benchmark for evaluating personalization in Deep Research Agents, along with the PQR Evaluation Framework to assess performance across personalized, open-ended research tasks.

Yuan Liang, Jiaxian Li, Yuqing Wang + 11 more2026-03-05🤖 cs.AI

When Relevance Meets Novelty: Dual-Stable Periodic Optimization for Serendipitous Recommendation

This paper proposes the Co-Evolutionary Alignment (CoEA) method, which integrates a Dual-Stable Interest Exploration module to model both group identity and individual interests and a Periodic Collaborative Optimization mechanism to establish a dynamic closed-loop feedback system, thereby overcoming the limitations of static optimization and biased interest modeling in LLM-enhanced serendipitous recommendation.

Hongxiang Lin, Hao Guo, Zeshun Li + 6 more2026-03-05🤖 cs.AI

OSCAR: Online Soft Compression And Reranking

OSCAR is a novel online soft compression and reranking method that dynamically reduces computational overhead in Retrieval-Augmented Generation pipelines at inference time, achieving significant speed-ups with minimal accuracy loss across various large language model sizes.

Maxime Louis, Thibault Formal, Hervé Dejean + 1 more2026-03-05🤖 cs.AI

← Previous Next →

cs.IR