OTPL-VIO: Robust Visual-Inertial Odometry with Optimal Transport Line Association and Adaptive Uncertainty

This paper presents OTPL-VIO, a robust stereo visual-inertial odometry system that enhances performance in low-texture and illumination-challenging environments by employing a training-free deep descriptor with entropy-regularized optimal transport for line association and introducing adaptive uncertainty weighting to stabilize estimation.

Zikun Chen, Wentao Zhao, Yihe Niu, Tianchen Deng, Jingchuan WangWed, 11 Ma💻 cs

Proportionality Degree in Participatory Budgeting

This paper initiates the study of proportionality degree in participatory budgeting by establishing tight theoretical bounds for the Method of Equal Shares and Phragmen's Sequential Rule, demonstrating that despite their differing axiomatic properties, they achieve comparable quantitative proportionality, a finding further validated through extensive experiments on real-world datasets.

Aris Filos-Ratsikas, Sreedurga Gogulapati, Georgios KalantzisWed, 11 Ma💻 cs

DiffWind: Physics-Informed Differentiable Modeling of Wind-Driven Object Dynamics

The paper presents DiffWind, a physics-informed differentiable framework that unifies wind-object interaction modeling, video-based reconstruction, and forward simulation by combining 3D Gaussian Splatting, the Material Point Method, and Lattice Boltzmann constraints to accurately recover and simulate wind-driven object dynamics from video observations.

Yuanhang Lei, Boming Zhao, Zesong Yang, Xingxuan Li, Tao Cheng, Haocheng Peng, Ru Zhang, Yang Yang, Siyuan Huang, Yujun Shen, Ruizhen Hu, Hujun Bao, Zhaopeng CuiWed, 11 Ma💻 cs

TemporalDoRA: Temporal PEFT for Robust Surgical Video Question Answering

The paper introduces TemporalDoRA, a parameter-efficient fine-tuning method that integrates lightweight temporal attention into the low-rank adaptation branch of vision encoders to enhance robustness against linguistic variations in surgical video question answering, validated by a new colonoscopy dataset and improved Out-of-Template performance.

Luca Carlini, Chiara Lena, Cesare Hassan, Danail Stoyanov, Elena De Momi, Sophia Bano, Mobarak I. HoqueWed, 11 Ma💻 cs

An Empirical Study of Interaction Smells in Multi-Turn Human-LLM Collaborative Code Generation

This paper introduces the concept of "Interaction Smells" in multi-turn human-LLM code generation, establishes a taxonomy based on real-world data, analyzes their distribution across leading models, and proposes the Invariant-aware Constraint Evolution (InCE) framework to effectively mitigate these issues and improve task success rates.

Binquan Zhang, Li Zhang, Lin Shi, Song Wang, Yuwei Qian, Linhui Zhao, Fang Liu, An Fu, Yida YeWed, 11 Ma💻 cs

TriFusion-SR: Joint Tri-Modal Medical Image Fusion and SR

The paper proposes TriFusion-SR, a wavelet-guided conditional diffusion framework that jointly performs tri-modal medical image fusion and super-resolution by decomposing features into frequency bands and employing rectified wavelet features with adaptive spatial-frequency fusion to achieve state-of-the-art performance in resolution and perceptual quality.

Fayaz Ali Dharejo, Sharif S. M. A., Aiman Khalil, Nachiket Chaudhary, Rizwan Ali Naqvi, Radu TimofteWed, 11 Ma💻 cs

Robotic Scene Cloning:Advancing Zero-Shot Robotic Scene Adaptation in Manipulation via Visual Prompt Editing

This paper introduces Robotic Scene Cloning (RSC), a novel method that enhances zero-shot robotic manipulation by editing existing operation trajectories through visual prompting and condition injection to generate accurate, scene-consistent samples that significantly improve policy generalization in real-world environments.

Binyuan Huang, Yuqing Wen, Yucheng Zhao, Yaosi Hu, Tiancai Wang, Chang Wen Chen, Haoqiang Fan, Zhenzhong ChenWed, 11 Ma💻 cs

FrameDiT: Diffusion Transformer with Frame-Level Matrix Attention for Efficient Video Generation

The paper proposes FrameDiT, a novel video generation architecture that introduces Matrix Attention to efficiently model global spatio-temporal dynamics by processing frames as matrices, thereby achieving state-of-the-art video quality and temporal coherence while maintaining computational efficiency comparable to local factorized attention.

Minh Khoa Le, Kien Do, Duc Thanh Nguyen, Truyen TranWed, 11 Ma💻 cs

Idempotent Slices with Applications to Code-Size Reduction

This paper formalizes the concept of idempotent backward slices and presents a sound, efficient algorithm for extracting them from Gated Static Single Assignment (GSA) form to enable a novel sparse code-size reduction optimization that merges non-contiguous instructions, achieving up to 7.24% size reduction in specific benchmarks.

Rafael Alvarenga de Azevedo, Daniel Augusto Costa de Sa, Rodrigo Caetano Rocha, Fernando Magno Quintão PereiraWed, 11 Ma💻 cs

A Regularized Ensemble Kalman Filter for Stochastic Phase Field Models of Brittle Fracture

This paper proposes a regularized ensemble Kalman filter framework that integrates sensor displacement data into stochastic phase-field models of brittle fracture to infer the evolving displacement and phase-field states, thereby correcting model predictions while ensuring physical consistency through a novel regularization step.

Lucas Hermann, Ralf Jänicke, Knut Andreas Meyer, Ulrich RömerWed, 11 Ma💻 cs

WVA: A Global Optimization Control Plane for llmd

The paper introduces WVA, a global optimization control plane co-designed with the \texttt{llmd} inference engine that leverages internal saturation states and fragmentation-aware strategies to achieve significantly higher throughput, fewer request failures, and lower power consumption compared to traditional Kubernetes autoscalers when managing heterogeneous LLM workloads.

Abhishek Malvankar, Lionel Villard, Mohammed Abdi, Evgeny Shindin, Braulio Dumba, Vishakha Ramani, Asser Tantawi, Tamar EilamWed, 11 Ma💻 cs

FetalAgents: A Multi-Agent System for Fetal Ultrasound Image and Video Analysis

FetalAgents is a novel multi-agent system that dynamically orchestrates specialized vision experts to deliver robust, end-to-end fetal ultrasound analysis and structured clinical reporting across multiple tasks, outperforming existing specialized models and multimodal large language models.

Xiaotian Hu, Junwei Huang, Mingxuan Liu, Kasidit Anmahapong, Yifei Chen, Yitong Luo, Yiming Huang, Xuguang Bai, Zihan Li, Yi Liao, Haibo Qu, Qiyuan TianWed, 11 Ma💻 cs

Ensuring Data Freshness in Multi-Rate Task Chains Scheduling

This paper proposes a task-based scheduling framework that ensures end-to-end data freshness in safety-critical multi-rate systems by introducing a Consensus Offset Search algorithm to align task releases with data lifespan constraints, thereby eliminating the artificial latency of Logical Execution Time and the inefficiency of redundant oversampling while preserving Global EDF schedulability.

José Luis Conradi Hoffmann, Antônio Augusto FröhlichWed, 11 Ma💻 cs