The SimpleQA score of the model is WAY off.
🔥
4
3
#2 opened about 2 months ago
by
phil111
Tool support? Multiple tool calls?
#1 opened about 2 months ago
by
chriswritescode
