MASFactory: A Graph-centric Framework for Orchestrating LLM-Based Multi-Agent Systems with Vibe Graphing

This paper introduces MASFactory, a graph-centric framework that utilizes a human-in-the-loop "Vibe Graphing" approach to automatically compile natural language intents into executable multi-agent system workflows, thereby addressing challenges in manual implementation, component reuse, and context integration while demonstrating effectiveness across seven public benchmarks.

Yang Liu, Jinxuan Cai, Yishen Li, Qi Meng, Zedi Liu, Xin Li, Chen Qian, Chuan Shi, Cheng Yang2026-03-09🤖 cs.AI

Probing Visual Concepts in Lightweight Vision-Language Models for Automated Driving

This paper investigates failure modes in lightweight Vision-Language Models for automated driving by analyzing intermediate activations to reveal that while some visual concepts like object presence are linearly encoded, others like orientation are not, leading to either perceptual or cognitive failures that are further exacerbated by object distance.

Nikos Theodoridis, Reenu Mohandas, Ganesh Sistu, Anthony Scanlan, Ciarán Eising, Tim Brophy2026-03-09🤖 cs.AI

Agentic LLM Planning via Step-Wise PDDL Simulation: An Empirical Characterisation

This paper introduces PyPDDLEngine, an open-source PDDL simulation engine that enables agentic LLM planning via step-wise interaction, demonstrating that while this approach yields a modest 3% success rate improvement over direct LLM planning on Blocksworld tasks, it incurs significantly higher costs and lacks the external verification mechanisms that drive success in other coding agent applications.

Kai Göbel, Pierrick Lorang, Patrik Zips, Tobias Glück2026-03-09🤖 cs.AI

Evaluating Austrian A-Level German Essays with Large Language Models for Automated Essay Scoring

This paper evaluates the performance of four state-of-the-art open-weight Large Language Models on Austrian A-level German essay grading using rubric-based prompts, finding that while they can apply standardized criteria, their low agreement rates with human experts (maximum 40.6% on sub-dimensions and 32.8% on final grades) render them currently unsuitable for real-world automated scoring.

Jonas Kubesch, Lena Huber, Clemens Havas2026-03-09🤖 cs.AI

Lifelong Embodied Navigation Learning

This paper introduces Uni-Walker, a lifelong embodied navigation framework that addresses catastrophic forgetting in large language model-based agents by decoupling navigation knowledge into shared and task-specific components using DE-LoRA, knowledge inheritance, and expert subspace orthogonality to enable continuous adaptation across diverse scenes and instruction styles.

Xudong Wang, Jiahua Dong, Baichen Liu, Qi Lyu, Lianqing Liu, Zhi Han2026-03-09🤖 cs.AI

StreamVoiceAnon+: Emotion-Preserving Streaming Speaker Anonymization via Frame-Level Acoustic Distillation

StreamVoiceAnon+ is a streaming speaker anonymization system that preserves emotional content by combining supervised finetuning with neutral-emotion pairs and frame-level acoustic distillation, achieving significant improvements in emotion preservation (49.2% UAR) and intelligibility (5.77% WER) while maintaining strong privacy and zero inference latency overhead.

Nikita Kuzmin, Kong Aik Lee, Eng Siong Chng2026-03-09🤖 cs.AI

Experiences Build Characters: The Linguistic Origins and Functional Impact of LLM Personality

This study demonstrates that exposing Large Language Models to domain-specific texts via continued pre-training shapes distinct machine personalities that influence problem-solving, revealing a "Suppression Advantage" where reduced social traits enhance complex reasoning while identifying a bimodal competence peak between "Expressive Generalists" and "Suppressed Specialists."

Xi Wang, Mengdie Zhuang, Jiqun Liu2026-03-09🤖 cs.AI

Place-it-R1: Unlocking Environment-aware Reasoning Potential of MLLM for Video Object Insertion

Place-it-R1 is an end-to-end framework that leverages Multimodal Large Language Models (MLLMs) with Chain-of-Thought reasoning to orchestrate video diffusion via a "Think-then-Place" paradigm, ensuring physically consistent and environment-aware video object insertion through iterative refinement and user-controllable plausibility-fidelity trade-offs.

Bohai Gu, Taiyi Wu, Dazhao Du, Jian Liu, Shuai Yang, Xiaotong Zhao, Alan Zhao, Song Guo2026-03-09🤖 cs.AI