ReTac-ACT: A State-Gated Vision-Tactile Fusion Transformer for Precision Assembly
ReTac-ACT is a state-gated vision-tactile fusion transformer that achieves high-precision assembly in occluded, contact-rich environments by dynamically prioritizing tactile feedback through bidirectional cross-attention and proprioception-conditioned gating, outperforming vision-only baselines on the NIST Assembly Task Board M1 benchmark.