WAFFLE: Finetuning Multi-Modal Models for Automated Front-End Development
The paper introduces Waffle, a novel fine-tuning strategy that employs structure-aware attention and contrastive learning to significantly enhance multi-modal models' ability to convert UI designs into functional HTML code, outperforming existing methods on both new and established benchmarks.