Submitted by nielsr 35 How Well Does GPT-4o Understand Vision? Evaluating Multimodal Foundation Models on Standard Computer Vision Tasks · 6 authors 72 2
Submitted by akhaliq 21 Lost in Latent Space: An Empirical Study of Latent Diffusion Models for Physics Emulation · 6 authors 44 3
Submitted by RajveeSheth 12 Eka-Eval : A Comprehensive Evaluation Framework for Large Language Models in Indian Languages Lingo Research Group 20 2
Submitted by violetxi 4 LitBench: A Benchmark and Dataset for Reliable Evaluation of Creative Writing · 6 authors 2