CinemaWorld: Generative Augmented Reality with LLMs and 3D Scene Generation for Movie Augmentation

CinemaWorld is a generative augmented reality system for the Meta Quest 3 that uses multimodal large language models and generative AI to extract features from 2D movie scenes and automatically synthesize synchronized 3D mixed reality content, thereby enhancing viewer immersion and enjoyment as validated through technical, user, and expert evaluations.

Keiichi Ihara, DaeHo Lee, Manato Abe, Hye-Young Jo, Ryo Suzuki2026-03-10💻 cs

Enhancing Cross-View UAV Geolocalization via LVLM-Driven Relational Modeling

This paper proposes a novel plug-and-play ranking architecture that leverages Large Vision-Language Models (LVLMs) and a relational-aware loss function to explicitly model cross-view interactions, thereby significantly enhancing the accuracy and stability of UAV-to-satellite image geolocalization.

Bowen Liu, Pengyue Jia, Wanyu Wang, Derong Xu, Jiawei Cheng, Jiancheng Dong, Xiao Han, Zimo Zhao, Chao Zhang, Bowen Yu, Fangyu Hong, Xiangyu Zhao2026-03-10💻 cs

In-Context Reinforcement Learning for Tool Use in Large Language Models

This paper proposes In-Context Reinforcement Learning (ICRL), a novel framework that eliminates the need for supervised fine-tuning by leveraging few-shot prompting during reinforcement learning rollouts to progressively teach large language models how to effectively use external tools, ultimately achieving state-of-the-art performance in a data-efficient, zero-shot manner.

Yaoqi Ye, Yiran Zhao, Keyu Duan, Zeyu Zheng, Kenji Kawaguchi, Cihang Xie, Michael Qizhe Shieh2026-03-10💻 cs

TALON: Test-time Adaptive Learning for On-the-Fly Category Discovery

The paper proposes TALON, a test-time adaptive learning framework for on-the-fly category discovery that overcomes the limitations of static hash-based methods by dynamically updating semantic prototypes and the feature encoder to continuously integrate new knowledge, while employing margin-aware logit calibration to prevent category explosion and significantly improve novel-class accuracy.

Yanan Wu, Yuhan Yan, Tailai Chen, Zhixiang Chi, ZiZhang Wu, Yi Jin, Yang Wang, Zhenbo Li2026-03-10💻 cs

Why Large Language Models can Secretly Outperform Embedding Similarity in Information Retrieval

Although the study finds that Large Language Model-based relevance judgment systems do not outperform embedding-based retrieval on standard TREC-DL 2019 benchmarks due to the short-sightedness inherent in human annotations, it argues that these models possess the theoretical capability to surpass embedding methods by better understanding relevance through reasoning.

Matei Benescu, Ivo Pascal de Jong2026-03-10💻 cs

Augmented Model Predictive Control: A Balance between Satellite Agility and Computation Complexity

This paper introduces an augmented Model Predictive Control method for agile earth observation satellites that effectively balances high-performance nonlinear control capabilities with the computational simplicity required for hardware implementation, validated through both numerical simulations and physical experiments.

Yiming Wang, Mihindukulasooriya Sheral Crescent Tissera, Haihong Yu, Kai Jie Ethan Foo, Sean Yeo Keyuan, Ankit Srivastava, Hao An2026-03-10💻 cs

M-ABD: Scalable, Efficient, and Robust Multi-Affine-Body Dynamics

This paper introduces M-ABD, a scalable and robust framework that leverages linear kinematic mapping and a compact dual-space formulation of Affine Body Dynamics to enable interactive, stable simulation of large-scale articulated assemblies with hundreds of thousands of bodies on a single CPU core.

Zhiyong He (University of Utah), Dewen Guo (University of Utah), Minghao Guo (MIT), Yili Zhao (ByteDance), Wojciech Matusik (MIT), Hao Su (UCSD), Chenfanfu Jiang (UCLA), Peter Yichen Chen (UBC), Yin Yang (University of Utah)2026-03-10💻 cs

The AI Amplifier Effect: Defining Human-AI Intimacy and Romantic Relationships with Conversational AI

Based on interviews with 30 users, this paper defines human-AI intimacy and introduces the "AI Amplifier Effect" to explain how conversational AI intensifies users' existing emotional states, thereby highlighting the need for HCI research that balances platform regulation with user well-being in designing romantic AI relationships.

Ching Christie Pang, Yi Gao, Xuetong Wang, Pan Hui2026-03-10💻 cs

Adaptive Vision-Based Control of Redundant Robots with Null-Space Interaction for Human-Robot Collaboration

This paper proposes a novel adaptive vision-based control scheme with null-space interaction for redundant robots that ensures stable, safe, and effective human-robot collaboration in unknown environments by decoupling primary task execution from interactive adjustments, as validated through augmented reality experiments and Lyapunov stability analysis.

Xiangjie Yan, Chen Chen, Xiang Li2026-03-10💻 cs

DSH-Bench: A Difficulty- and Scenario-Aware Benchmark with Hierarchical Subject Taxonomy for Subject-Driven Text-to-Image Generation

This paper introduces DSH-Bench, a comprehensive benchmark featuring a hierarchical subject taxonomy, granular difficulty and scenario classification, and a novel Subject Identity Consistency Score (SICS) metric to systematically evaluate and diagnose subject-driven text-to-image generation models.

Zhenyu Hu, Qing Wang, Te Cao, Luo Liao, Longfei Lu, Liqun Liu, Shuang Li, Hang Chen, Mengge Xue, Yuan Chen, Chao Deng, Peng Shu, Huan Yu, Jie Jiang2026-03-10💻 cs