Running Agents Featured 587 LLM-Perf Leaderboard 🏆 587 Compare LLM hardware performance and find the best model
Running Agents 1.51k Big Code Models Leaderboard 📈 1.51k Explore and compare code model performance on a leaderboard
Running on Zero Agents 18 Chat with Gemma-2-9B-Chinese-Chat 💬 18 Chat with a Chinese language assistant
Running Agents 430 Reward Bench Leaderboard 📐 430 Explore and compare model scores on RewardBench benchmarks