Foundational World Models Accurately Detect Bimanual Manipulator Failures
This paper introduces a lightweight, probabilistic world model built on a pretrained vision foundation model that generates uncertainty-based runtime monitors to accurately detect anomalous failures in bimanual manipulators, outperforming existing baselines while requiring significantly fewer trainable parameters.