TriLite: Efficient Weakly Supervised Object Localization with Universal Visual Features and Tri-Region Disentanglement
TriLite is a parameter-efficient, single-stage weakly supervised object localization framework that utilizes a frozen self-supervised ViT backbone and a novel TriHead module to achieve state-of-the-art performance with minimal trainable parameters by effectively disentangling foreground, background, and ambiguous regions.