LLM TravelPlanner: A Benchmark for Real-World Planning with Language Agents Paper โข 2402.01622 โข Published Feb 2, 2024 โข 37 User-LLM: Efficient LLM Contextualization with User Embeddings Paper โข 2402.13598 โข Published Feb 21, 2024 โข 21
TravelPlanner: A Benchmark for Real-World Planning with Language Agents Paper โข 2402.01622 โข Published Feb 2, 2024 โข 37
User-LLM: Efficient LLM Contextualization with User Embeddings Paper โข 2402.13598 โข Published Feb 21, 2024 โข 21
Leaderboards Running Featured 563 Image Arena Leaderboard ๐ 563 Image Generation and Image Editing Arena & Leaderboard Running on CPU Upgrade 6.86k MTEB Leaderboard ๐ฅ 6.86k Embedding Leaderboard Running on CPU Upgrade 13.8k Open LLM Leaderboard ๐ 13.8k Track, rank and evaluate open LLMs and chatbots Running 4.7k LMArena Leaderboard ๐ 4.7k Display LMArena Leaderboard
Running Featured 563 Image Arena Leaderboard ๐ 563 Image Generation and Image Editing Arena & Leaderboard
Running on CPU Upgrade 13.8k Open LLM Leaderboard ๐ 13.8k Track, rank and evaluate open LLMs and chatbots
LLM TravelPlanner: A Benchmark for Real-World Planning with Language Agents Paper โข 2402.01622 โข Published Feb 2, 2024 โข 37 User-LLM: Efficient LLM Contextualization with User Embeddings Paper โข 2402.13598 โข Published Feb 21, 2024 โข 21
TravelPlanner: A Benchmark for Real-World Planning with Language Agents Paper โข 2402.01622 โข Published Feb 2, 2024 โข 37
User-LLM: Efficient LLM Contextualization with User Embeddings Paper โข 2402.13598 โข Published Feb 21, 2024 โข 21
Leaderboards Running Featured 563 Image Arena Leaderboard ๐ 563 Image Generation and Image Editing Arena & Leaderboard Running on CPU Upgrade 6.86k MTEB Leaderboard ๐ฅ 6.86k Embedding Leaderboard Running on CPU Upgrade 13.8k Open LLM Leaderboard ๐ 13.8k Track, rank and evaluate open LLMs and chatbots Running 4.7k LMArena Leaderboard ๐ 4.7k Display LMArena Leaderboard
Running Featured 563 Image Arena Leaderboard ๐ 563 Image Generation and Image Editing Arena & Leaderboard
Running on CPU Upgrade 13.8k Open LLM Leaderboard ๐ 13.8k Track, rank and evaluate open LLMs and chatbots