VLMEvalKit Evaluation Results Collection
View and submit leaderboard data for system evaluations
Sahara is a comprehensive benchmark for African NLP.
Display and analyze prediction leaderboard data