MLLMRec-R1: Incentivizing Reasoning Capability in Large Language Models for Multimodal Sequential Recommendation

MLLMRec-R1 is an efficient GRPO-based framework for multimodal sequential recommendation that overcomes the high computational costs of visual token processing and the issue of reward inflation by textualizing visual signals offline and employing a mixed-grained data augmentation strategy to construct high-quality reasoning supervision.

Yu Wang, Yonghui Yang, Le Wu, Jiancan Wu, Hefei Xu, Hui LinMon, 09 Ma💻 cs

ChatShopBuddy: Towards Reliable Conversational Shopping Agents via Reinforcement Learning

This paper introduces ChatShopBuddy, a conversational shopping agent optimized via Reinforcement Learning using a new benchmark (SmartShopBench), a Hierarchical Reward Modeling framework, and a Dynamic Contrastive Policy Optimization algorithm to effectively balance product correctness, persuasiveness, and operational efficiency in real-world scenarios.

Yiruo Cheng, Kelong Mao, Tianhao Li, Jiejun Tan, Ji-Rong Wen, Zhicheng DouMon, 09 Ma💻 cs

Efficient, Property-Aligned Fan-Out Retrieval via RL-Compiled Diffusion

The paper proposes R4T, a three-stage framework that leverages reinforcement learning to synthesize objective-aligned training data for a lightweight diffusion retriever, enabling efficient, high-quality set-valued retrieval that optimizes complex properties like diversity and coverage while significantly reducing inference latency compared to RL-based baselines.

Pengcheng Jiang, Judith Yue Li, Moonkyung Ryu, R. Lily Hu, Kun Su, Zhong Yi Wan, Liam Hebert, Hao Peng, Jiawei Han, Dima Kuzmin, Craig BoutilierMon, 09 Ma🤖 cs.LG

Efficient Vector Search in the Wild: One Model for Multi-K Queries

The paper introduces OMEGA, a K-generalizable learned top-K search method that leverages a base model trained on K=1 with trajectory-based features and a dynamic refinement procedure to achieve high accuracy and low latency for multi-K vector queries while significantly reducing preprocessing time compared to state-of-the-art methods.

Yifan Peng, Jiafei Fan, Xingda Wei, Sijie Shen, Rong Chen, Jianning Wang, Xiaojian Luo, Wenyuan Yu, Jingren Zhou, Haibo ChenMon, 09 Ma🤖 cs.LG

HCT-QA: A Benchmark for Question Answering on Human-Centric Tables

This paper introduces HCT-QA, a comprehensive benchmark comprising thousands of real-world and synthetic human-centric tables with natural language question-answer pairs, designed to evaluate and improve the performance of Large Language Models and Vision Language Models in querying complex tabular data.

Mohammad S. Ahmad, Zan A. Naeem, Michaël Aupetit, Ahmed Elmagarmid, Mohamed Eltabakh, Xiaosong Ma, Mourad Ouzzani, Chaoyi Ruan, Hani Al-SayehMon, 09 Ma🤖 cs.AI

Balancing Domestic and Global Perspectives: Evaluating Dual-Calibration and LLM-Generated Nudges for Diverse News Recommendation

This study evaluates a dual-calibration algorithmic nudge and an LLM-based presentation nudge within a personalized diversity framework, finding that while algorithmic nudges effectively increase news consumption diversity and shift long-term reading habits toward balanced domestic and global coverage, LLM-based presentation nudges yield variable results and user-specific topic interest remains the strongest predictor of engagement.

Ruixuan Sun, Matthew Zent, Minzhu Zhao, Thanmayee Boyapati, Xinyi Li, Joseph A. KonstanMon, 09 Ma🤖 cs.AI

The DSA's Blind Spot: Algorithmic Audit of Advertising and Minor Profiling on TikTok

This paper presents an algorithmic audit of TikTok revealing that while the platform technically complies with the Digital Service Act's ban on profiled advertising to minors, it effectively circumvents this protection by delivering highly personalized, often undisclosed influencer marketing content to adolescents, thereby highlighting the urgent need to expand the regulatory definition of "advertisement" to cover such commercial practices.

Sara Solarova, Matej Mosnar, Matus Tibensky, Jan Jakubcik, Adrian Bindas, Simon Liska, Filip Hossner, Matúš Mesarčík, Ivan SrbaMon, 09 Ma🤖 cs.AI

MDER-DR: Multi-Hop Question Answering with Entity-Centric Summaries

The paper introduces MDER-DR, a novel Retrieval-Augmented Generation framework that combines a Map-Disambiguate-Enrich-Reduce indexing strategy with a Decompose-Resolve retrieval mechanism to significantly improve multi-hop question answering on Knowledge Graphs by preserving contextual nuance and enabling robust reasoning without explicit graph traversal.

Riccardo Campi, Nicolò Oreste Pinciroli Vago, Mathyas Giudici, Marco Brambilla, Piero FraternaliFri, 13 Ma💬 cs.CL