LMArena Leaderboard
Display LMArena Leaderboard
Gotta rank 'em all!
Display LMArena Leaderboard
Display Berkeley Function-Calling Leaderboard
More advanced and challenging multi-task evaluation
Compact LLM Battle Arena: Frugal AI Face-Off!
Text to Video and Image to Video Arena & Leaderboard
AI Music Arena & Leaderboard (Suno, Udio, Google, Meta, +)
Ranking of LLMs for agentic tasks
Uncensored General Intelligence Leaderboard
Compare two AI models' answers to document questions
Explore and analyze code completion benchmarks
Can AI Code? An LLM leaderboard inclquantized models.
Duplicate this leaderboard to initialize your own!
Embedding Leaderboard
Explore hardware performance for LLMs
Generate visual data analysis plots
Explore LLM performance with a leaderboard
Display hardware performance leaderboard
Evaluating LLMs by their role-playing capabilities.
Display and request speech recognition model benchmarks
VLMEvalKit Evaluation Results Collection