Leaderboards - a Felladrin Collection

Felladrin 's Collections

Trained Models 🏋️

Frequently Used Spaces

Foundation Text-Generation Models Below 360M Parameters

Leaderboards

updated Oct 14

Gotta rank 'em all!

Running

4.68k

LMArena Leaderboard

🏆

4.68k

Display LMArena Leaderboard
Running

120

Berkeley Function Calling Leaderboard

🏃

120

Display Berkeley Function-Calling Leaderboard
Running on CPU Upgrade

238

MMLU-Pro Leaderboard

🥇

238

More advanced and challenging multi-task evaluation
Running

295

GPU Poor LLM Arena

🏆

295

Compact LLM Battle Arena: Frugal AI Face-Off!
Running

178

Video Generation Leaderboard

📊

178

Text to Video and Image to Video Arena & Leaderboard
Running

Featured

83

Music Arena Leaderboard

🎵

83

AI Music Arena & Leaderboard (Suno, Udio, Google, Meta, +)
Running on CPU Upgrade

436

Agent Leaderboard

💬

436

Ranking of LLMs for agentic tasks
Running

1.29k

UGI Leaderboard

📢

1.29k

Uncensored General Intelligence Leaderboard
Running on Zero

30

SLM RAG Arena

🤼

30

Compare two AI models' answers to document questions
Running

226

BigCodeBench Leaderboard

🥇

226

Explore and analyze code completion benchmarks
Running

450

Can Ai Code Results

🏆

450

Can AI Code? An LLM leaderboard inclquantized models.
Running

9

Web Bench Leaderboard

🥇

9

Duplicate this leaderboard to initialize your own!
Running on CPU Upgrade

6.76k

MTEB Leaderboard

🥇

6.76k

Embedding Leaderboard
Running

Featured

573

LLM-Perf Leaderboard

🏆

573

Explore hardware performance for LLMs
Running on CPU Upgrade

178

LLM Hallucination Leaderboard

🚀

178

Generate visual data analysis plots
Running

16

LLM Inference Benchmark

🥇

16

Explore LLM performance with a leaderboard
Running

18

Edge LLM Leaderboard

🌖

18

Display hardware performance leaderboard
Running

2

RPEval

🏆

2

Evaluating LLMs by their role-playing capabilities.
Running on CPU Upgrade

Featured

1.15k

Open ASR Leaderboard

🏆

1.15k

Display and request speech recognition model benchmarks
Running on CPU Upgrade

937

Open VLM Leaderboard

🌎

937

VLMEvalKit Evaluation Results Collection