Running 230 BigCodeBench Leaderboard 🥇 230 Explore code-generation model leaderboards and task details
Running 95 Nexus Function Calling Leaderboard 🐠 95 Display benchmark results for models on various tasks