Molecules Meet Language: Confound-Aware Representation Learning and Chemical Property Steering in Transformer-VAE Latent Spaces
This paper demonstrates that while unsupervised Transformer-VAE latent spaces trained on SELFIES can support meaningful chemical property steering, such control is only valid when rigorously validated through decoded molecules and confound-aware evaluation to distinguish genuine chemical signals from sequence-level artifacts.