OrdinalBench: A Benchmark Dataset for Diagnosing Generalization Limits in Ordinal Number Understanding of Vision-Language Models
The paper introduces OrdinalBench, a comprehensive benchmark dataset and evaluation framework designed to diagnose and expose the significant generalization limitations of Vision-Language Models in understanding ordinal numbers and performing sequential reasoning tasks involving large indices and complex paths.