Best-of- -- Asymptotic Performance of Test-Time LLM Ensembling
This paper analyzes the asymptotic performance of best-of- LLM ensembling via majority voting as , proposing an adaptive generation scheme to efficiently allocate inference budgets and an optimal weighted ensemble method formulated as a mixed-integer linear program to outperform individual models.