-
open-llm-leaderboard/tensopolis__virtuoso-lite-tensopolis-v2-details
Viewer • Updated • 43.2k • 267 • 1 -
open-llm-leaderboard/tensopolis__falcon3-10b-tensopolis-v1-details
Viewer • Updated • 43.2k • 397 -
open-llm-leaderboard/Pinkstack__SuperThoughts-CoT-14B-16k-o1-QwQ-details
Viewer • Updated • 43.2k • 121 • 2 -
open-llm-leaderboard/prithivMLmods__QwQ-LCoT-14B-Conversational-details
Viewer • Updated • 43.2k • 123 • 1
AI & ML interests
Evaluating open LLMs
Recent Activity
Open LLM Leaderboard
This is the hub organisation maintaining the Open LLM Leaderboard.
In this space you will find the dataset with detailed results and queries for the models on the leaderboard.
Score results are here, and current state of requests is here. For the detailed prediction, look for your model name in the datasets below!
-
Open LLM Leaderboard
🏆13.7kTrack, rank and evaluate open LLMs and chatbots
-
Open-LLM performances are plateauing, let’s make the leaderboard steep again
🏔125Explore and compare advanced language models on a new leaderboard
-
open-llm-leaderboard/contents
Viewer • Updated • 4.58k • 10.4k • 21 -
open-llm-leaderboard/results
Preview • Updated • 21.5k • 16
-
open-llm-leaderboard/tensopolis__virtuoso-lite-tensopolis-v2-details
Viewer • Updated • 43.2k • 267 • 1 -
open-llm-leaderboard/tensopolis__falcon3-10b-tensopolis-v1-details
Viewer • Updated • 43.2k • 397 -
open-llm-leaderboard/Pinkstack__SuperThoughts-CoT-14B-16k-o1-QwQ-details
Viewer • Updated • 43.2k • 121 • 2 -
open-llm-leaderboard/prithivMLmods__QwQ-LCoT-14B-Conversational-details
Viewer • Updated • 43.2k • 123 • 1
-
Open LLM Leaderboard
🏆13.7kTrack, rank and evaluate open LLMs and chatbots
-
Open-LLM performances are plateauing, let’s make the leaderboard steep again
🏔125Explore and compare advanced language models on a new leaderboard
-
open-llm-leaderboard/contents
Viewer • Updated • 4.58k • 10.4k • 21 -
open-llm-leaderboard/results
Preview • Updated • 21.5k • 16
spaces
5
Open LLM Leaderboard
Track, rank and evaluate open LLMs and chatbots
Open LLM Leaderboard Model Comparator
Compare Open LLM Leaderboard results
Open-LLM performances are plateauing, let’s make the leaderboard steep again
Explore and compare advanced language models on a new leaderboard