ToolRosetta: Bridging Open-Source Repositories and Large Language Model Agents through Automated Tool Standardization

ToolRosetta is a unified framework that automatically transforms heterogeneous open-source code repositories into standardized, secure, and executable Model Context Protocol (MCP) tools, enabling LLM agents to autonomously plan and invoke specialized software for complex tasks with minimal human intervention.

Shimin Di, Xujie Yuan, Hanghui Guo, Chaoqian Ouyang, Zhangze Chen, Ling Yue, Libin Zheng, Jia Zhu, Shaowu Pan, Jian Yin, Min-Ling Zhang, Yong RuiWed, 11 Ma💻 cs

See, Plan, Rewind: Progress-Aware Vision-Language-Action Models for Robust Robotic Manipulation

The paper introduces See, Plan, Rewind (SPR), a progress-aware vision-language-action framework that enhances robotic manipulation robustness by dynamically grounding instructions into spatial subgoals and enabling closed-loop error recovery through state rewinding, achieving state-of-the-art performance on challenging benchmarks without additional training.

Tingjun Dai, Mingfei Han, Tingwen Du, Zhiheng Liu, Zhihui Li, Salman Khan, Jun Yu, Xiaojun ChangWed, 11 Ma💻 cs

External entropy supply for IoT devices employing a RISC-V Trusted Execution Environment

This paper proposes and validates an open-source RISC-V-based Trusted Execution Environment that acts as an external entropy service, enabling constrained IoT devices to securely obtain high-quality random numbers for cryptographic key generation by leveraging initial trust and potentially expanding with additional sensor-based entropy sources.

Arttu Paju, Alejandro Cabrera Aldaya, Nicola Tuveri, Juha Savimäki, Marko Kivikangas, Brian McGillionWed, 11 Ma💻 cs

IntroSVG: Learning from Rendering Feedback for Text-to-SVG Generation via an Introspective Generator-Critic Framework

The paper introduces IntroSVG, an introspective framework that enhances text-to-SVG generation by employing a unified Visual Language Model in a closed-loop "generate-review-refine" cycle, where the model acts as both generator and critic to iteratively improve outputs based on visual rendering feedback.

Feiyu Wang, Jiayuan Yang, Zhiyuan Zhao, Da Zhang, Bingyu Li, Peng Liu, Junyu GaoWed, 11 Ma💻 cs

NLiPsCalib: An Efficient Calibration Framework for High-Fidelity 3D Reconstruction of Curved Visuotactile Sensors

The paper presents NLiPsCalib, an efficient and physics-consistent calibration framework that utilizes Near-Light Photometric Stereo and controllable light sources to enable high-fidelity 3D reconstruction of curved visuotactile sensors through simple contacts with everyday objects, thereby overcoming the cost and complexity of existing methods.

Xuhao Qin, Feiyu Zhao, Yatao Leng, Runze Hu, Chenxi XiaoWed, 11 Ma💻 cs

OddGridBench: Exposing the Lack of Fine-Grained Visual Discrepancy Sensitivity in Multimodal Large Language Models

This paper introduces OddGridBench, a benchmark revealing that current multimodal large language models significantly underperform humans in detecting fine-grained visual discrepancies, and proposes OddGrid-GRPO, a reinforcement learning framework that effectively enhances this sensitivity through curriculum learning and distance-aware rewards.

Tengjin Weng, Wenhao Jiang, Jingyi Wang, Ming Li, Lin Ma, Zhong MingWed, 11 Ma💻 cs

Dynamic Precision Math Engine for Linear Algebra and Trigonometry Acceleration on Xtensa LX6 Microcontrollers

This paper presents a Dynamic Precision Math Engine for ESP32 microcontrollers that integrates Q16.16 fixed-point arithmetic, a CORDIC trigonometric module, and a cache-aware matrix kernel to achieve significant speedups in linear algebra and trigonometry through a runtime-switchable architecture that balances integer efficiency with floating-point precision.

Elian Alfonso Lopez PreciadoWed, 11 Ma💻 cs

Can ChatGPT Generate Realistic Synthetic System Requirement Specifications? Results of a Case Study

This case study demonstrates that while ChatGPT can generate realistic synthetic system requirement specifications across multiple industries using iterative prompt engineering, the resulting artifacts still contain significant flaws that necessitate thorough expert evaluation rather than relying solely on LLM-based quality assessments.

Alex R. Mattukat, Florian M. Braun, Horst LichterWed, 11 Ma💻 cs

The Virtuous Cycle: AI-Powered Vector Search and Vector Search-Augmented AI

This ICDE 2026 tutorial paper provides a comprehensive overview of the synergistic "virtuous cycle" between AI and vector search, detailing how AI enhances vector search efficiency and how vector search, particularly through Retrieval-Augmented Generation, empowers Large Language Models, while also exploring co-optimization strategies, challenges, and future research directions.

Jiuqi Wei, Quanqing Xu, Chuanhui YangWed, 11 Ma💻 cs

ProvAgent: Threat Detection Based on Identity-Behavior Binding and Multi-Agent Collaborative Attack Investigation

ProvAgent is a novel framework that enhances threat detection and investigation by integrating graph contrastive learning for high-fidelity alert generation with a multi-agent collaborative system to autonomously reconstruct complex APT attack processes, thereby overcoming the limitations of traditional human-model collaboration.

Wenhao Yan, Ning An, Linxu Li, Bingsheng Bi, Bo Jiang, Zhigang Lu, Baoxu Liu, Junrong Liu, Cong DongWed, 11 Ma💻 cs

Evidential Perfusion Physics-Informed Neural Networks with Residual Uncertainty Quantification

This paper introduces Evidential Perfusion Physics-Informed Neural Networks (EPPINN), a novel framework that integrates evidential deep learning with physics-informed modeling to quantify both aleatoric and epistemic uncertainties in CT perfusion imaging, thereby achieving superior accuracy and reliability in acute ischemic stroke assessment compared to existing deterministic methods.

Junhyeok Lee, Minseo Choi, Han Jang, Young Hun Jeon, Heeseong Eum, Joon Jang, Chul-Ho Sohn, Kyu Sung ChoiWed, 11 Ma💻 cs