Where is the multimodal goal post? On the Ability of Foundation Models to Recognize Contextually Important Moments

This paper introduces a new dataset derived from football highlight reels to evaluate foundation models' ability to identify contextually important video moments, revealing that current state-of-the-art models perform near chance levels due to their reliance on single dominant modalities and failure to effectively synthesize cross-modal information.

Aditya K Surikuchi, Raquel Fernández, Sandro Pezzelle2026-03-06💻 cs

Mobility-Embedded POIs: Learning What A Place Is and How It Is Used from Human Movement

This paper introduces Mobility-Embedded POIs (ME-POIs), a framework that enhances general-purpose point-of-interest representations by integrating large-scale human mobility data with language model embeddings to capture both place identity and real-world usage functions, thereby outperforming existing text-only and mobility-only baselines across diverse map enrichment tasks.

Maria Despoina Siampou, Shushman Choudhury, Shang-Ling Hsu + 2 more2026-03-06💻 cs

Supervised Metric Regularization Through Alternating Optimization for Multi-Regime Physics-Informed Neural Networks

This paper introduces Topology-Aware PINNs (TAPINN), a novel framework that employs supervised metric regularization and alternating optimization to effectively resolve spectral bias and mode collapse in multi-regime physics-informed neural networks, achieving superior convergence stability and accuracy compared to standard and hypernetwork-based baselines.

Enzo Nicolas Spotorno, Josafat Ribeiro Leal, Antonio Augusto Frohlich2026-03-06🔬 physics

Empirical Stability Analysis of Kolmogorov-Arnold Networks in Hard-Constrained Recurrent Physics-Informed Discovery

This paper empirically demonstrates that while Kolmogorov-Arnold Networks (KANs) can compete with MLPs on simple univariate residuals in hard-constrained recurrent physics-informed architectures, they suffer from severe hyperparameter fragility, instability in deeper configurations, and consistent failure on multiplicative terms, ultimately revealing limitations in their additive inductive bias for modeling state coupling in oscillatory systems.

Enzo Nicolas Spotorno, Josafat Leal Filho, Antonio Augusto Medeiros Frohlich2026-03-06🔬 physics

Pailitao-VL: Unified Embedding and Reranker for Real-Time Multi-Modal Industrial Search

Pailitao-VL is a unified multi-modal retrieval system that achieves state-of-the-art, real-time industrial search performance by replacing traditional contrastive embeddings with an absolute ID-recognition paradigm and evolving reranking into a compare-and-calibrate listwise policy, thereby overcoming granularity, noise, and latency challenges in large-scale production environments.

Lei Chen, Chen Ju, Xu Chen + 13 more2026-03-06💻 cs

Zombie Agents: Persistent Control of Self-Evolving LLM Agents via Self-Reinforcing Injections

This paper introduces "Zombie Agents," a persistent black-box attack on self-evolving LLM agents that covertly implants payloads into long-term memory during benign sessions to survive across interactions and trigger unauthorized actions in future sessions, demonstrating that current per-session defenses are insufficient against such memory-based compromises.

Xianglin Yang, Yufei He, Shuo Ji, Bryan Hooi, Jin Song Dong2026-03-06🔒 cs.CR

SubQuad: Near-Quadratic-Free Structure Inference with Distribution-Balanced Objectives in Adaptive Receptor framework

SubQuad is an end-to-end pipeline that overcomes the computational and data imbalance bottlenecks in large-scale adaptive immune repertoire analysis by integrating near-subquadratic MinHash retrieval, GPU-accelerated affinity kernels, and fairness-constrained clustering to enable scalable, bias-aware discovery of clinically relevant clonotypes.

Rong Fu, Zijian Zhang, Kun Liu + 3 more2026-03-06💻 cs

Curriculum Learning for Efficient Chain-of-Thought Distillation via Structure-Aware Masking and GRPO

This paper proposes a three-stage curriculum learning framework that leverages structure-aware masking and Group Relative Policy Optimization (GRPO) to efficiently distill Chain-of-Thought reasoning into compact student models, achieving significant accuracy gains and output length reduction on GSM8K by progressively guiding the model from structural understanding to self-optimized brevity and targeted knowledge internalization.

Bowen Yu, Maolin Wang, Sheng Zhang + 7 more2026-03-06💻 cs

Give Users the Wheel: Towards Promptable Recommendation Paradigm

This paper proposes Decoupled Promptable Sequential Recommendation (DPR), a model-agnostic framework that enables conventional sequential recommenders to dynamically steer retrieval using natural language prompts by modulating latent user representations through a specialized fusion module, Mixture-of-Experts architecture, and a three-stage training strategy, thereby achieving superior performance in intent-driven tasks without sacrificing collaborative filtering efficiency.

Fuyuan Lyu, Chenglin Luo, Qiyuan Zhang + 6 more2026-03-06💻 cs