PvP: Data-Efficient Humanoid Robot Learning with Proprioceptive-Privileged Contrastive Representations
This paper introduces PvP, a proprioceptive-privileged contrastive learning framework that enhances data efficiency and robustness in humanoid robot whole-body control by learning compact task-relevant representations without hand-crafted augmentations, supported by the new SRL4Humanoid evaluation framework.