Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
70
1
48
Phil
phil111
Follow
joaquinrfs's profile picture
EdSurridge's profile picture
nlpguy's profile picture
36 followers
·
38 following
AI & ML interests
None yet
Recent Activity
new
activity
2 days ago
nvidia/NVIDIA-Nemotron-Nano-9B-v2:
This just trades general performance for domain specific gains.
new
activity
4 days ago
ByteDance-Seed/Seed-OSS-36B-Base:
Please stop blindly trusting and reporting Alibaba's scores.
new
activity
4 days ago
google/gemma-3-270m:
Weird responses
View all activity
Organizations
None yet
phil111
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
nvidia/NVIDIA-Nemotron-Nano-9B-v2
2 days ago
This just trades general performance for domain specific gains.
🔥
👍
14
9
#3 opened 6 days ago by
phil111
New activity in
ByteDance-Seed/Seed-OSS-36B-Base
4 days ago
Please stop blindly trusting and reporting Alibaba's scores.
👍
6
2
#1 opened 4 days ago by
phil111
New activity in
google/gemma-3-270m
4 days ago
Weird responses
9
#10 opened 8 days ago by
vparth7
New activity in
google/gemma-3-270m-it
7 days ago
Gemma A3B
👍
5
13
#3 opened 10 days ago by
Maria99934
liked
a dataset
10 days ago
Codatta/MM-Food-100K
Viewer
•
Updated
7 days ago
•
100k
•
655
•
23
New activity in
openai/gpt-oss-120b
12 days ago
gpt-oss is actually good. even on less common benchmark
🤝
👍
5
2
#109 opened 13 days ago by
weijiejailbreak
New activity in
openai/gpt-oss-20b
16 days ago
model quality issues
5
#92 opened 16 days ago by
TheBigBlockPC
New activity in
Qwen/Qwen3-4B-Instruct-2507
17 days ago
Terrible instruction following
👍
1
4
#3 opened 18 days ago by
denisalpino
New activity in
Qwen/Qwen3-4B-Instruct-2507
18 days ago
4b model with an 84.2 MMLU-Redux score?
🤝
3
1
#2 opened 18 days ago by
phil111
New activity in
openai/gpt-oss-20b
18 days ago
This model is unbelievably ignorant.
➕
👍
40
14
#14 opened 19 days ago by
phil111
New activity in
openai/gpt-oss-120b
19 days ago
Knowledge limitations
👍
2
5
#25 opened 19 days ago by
hexess
New activity in
Qwen/Qwen3-30B-A3B-Instruct-2507
19 days ago
An Improvement, But Q3 30b Still Has Very Little General Knowledge
👍
❤️
3
10
#2 opened 26 days ago by
phil111
Test Scores Can Be Misleading
👍
1
8
#8 opened 25 days ago by
phil111
New activity in
Qwen/Qwen3-235B-A22B-Instruct-2507
23 days ago
More Knowledge, But Hard To Extract
#29 opened 23 days ago by
phil111
New activity in
zai-org/GLM-4.5
25 days ago
Impressive Broad Knowledge
👍
👀
5
4
#12 opened 25 days ago by
phil111
liked
2 models
27 days ago
zai-org/GLM-4.5-Air
Text Generation
•
110B
•
Updated
13 days ago
•
72.9k
•
•
386
zai-org/GLM-4.5
Text Generation
•
358B
•
Updated
13 days ago
•
56.2k
•
•
1.25k
New activity in
baidu/ERNIE-4.5-300B-A47B-PT
28 days ago
The SimpleQA score of the model is WAY off.
🔥
4
3
#2 opened about 2 months ago by
phil111
New activity in
Qwen/Qwen3-30B-A3B
29 days ago
Qwen3 is great, but could be better.
👍
9
25
#18 opened 4 months ago by
phil111
New activity in
Qwen/Qwen3-235B-A22B-Instruct-2507
about 1 month ago
SimpleQA jumped from 12.2 to 54.3?
🔥
🧠
22
25
#4 opened about 1 month ago by
phil111
Load more