Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
22
1
11
Jintao Huang
study-hjt
Follow
TheOneTrueNiz's profile picture
ealix's profile picture
leoozh's profile picture
10 followers
·
2 following
https://github.com/Jintao-Huang
AI & ML interests
None yet
Recent Activity
new
activity
18 days ago
openai/gpt-oss-20b:
🚀[Fine-tuning] LoRA fine-tuning openai/gpt-oss-20b 👋
new
activity
27 days ago
Qwen/Qwen3-235B-A22B-Instruct-2507-FP8:
need official awq weights
new
activity
about 1 month ago
Qwen/Qwen3-235B-A22B-Instruct-2507:
🚀[Fine-tuning] 8x80GiB GPUs LoRA finetuning Qwen3-235B-A22B-Instruct-2507
View all activity
Organizations
study-hjt
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
openai/gpt-oss-20b
18 days ago
🚀[Fine-tuning] LoRA fine-tuning openai/gpt-oss-20b 👋
👀
👍
5
1
#43 opened 19 days ago by
study-hjt
New activity in
Qwen/Qwen3-235B-A22B-Instruct-2507-FP8
27 days ago
need official awq weights
4
#2 opened about 1 month ago by
wangruiai2023
New activity in
Qwen/Qwen3-235B-A22B-Instruct-2507
about 1 month ago
🚀[Fine-tuning] 8x80GiB GPUs LoRA finetuning Qwen3-235B-A22B-Instruct-2507
🤗
4
1
#25 opened about 1 month ago by
study-hjt
4 bit quantisation release?
➕
9
1
#9 opened about 1 month ago by
mochiyo
int4 and awq version
1
#23 opened about 1 month ago by
devops724
liked
a model
about 1 month ago
Qwen/Qwen3-235B-A22B-Instruct-2507
Text Generation
•
235B
•
Updated
8 days ago
•
82.7k
•
•
650
New activity in
Tongyi-Zhiwen/QwenLong-L1-32B
3 months ago
provide int4 version pls
➕
👀
2
4
#2 opened 3 months ago by
Josh1026
New activity in
Qwen/Qwen3-235B-A22B
4 months ago
GPTQ/AWQ
👀
13
4
#3 opened 4 months ago by
ndurkee
New activity in
Qwen/Qwen3-30B-A3B
4 months ago
AWQ quantized model support timeline?
👍
8
2
#12 opened 4 months ago by
hyunw55
New activity in
Qwen/Qwen3-235B-A22B
4 months ago
🚀[Fine-tuning] Qwen3-MoE Megatron Training Implementation and Best Practices👋
🚀
6
1
#6 opened 4 months ago by
study-hjt
New activity in
Qwen/Qwen3-30B-A3B
4 months ago
🚀[Fine-tuning] Qwen3-MoE Megatron Training Implementation and Best Practices👋
🚀
4
#3 opened 4 months ago by
study-hjt
New activity in
Qwen/Qwen3-32B
4 months ago
🚀[Fine-tuning] Implementation and Best Practices for Qwen3 CPT/SFT/DPO/GRPO Training👋
🚀
🔥
3
#7 opened 4 months ago by
study-hjt
New activity in
Qwen/Qwen3-8B
4 months ago
🚀[Fine-tuning] Implementation and Best Practices for Qwen3 CPT/SFT/DPO/GRPO Training👋
🚀
👍
4
#3 opened 4 months ago by
study-hjt
New activity in
Qwen/Qwen2.5-Omni-7B
4 months ago
[Fine-tuning] 🚀SFT/DPO/GRPO support!
#20 opened 5 months ago by
study-hjt
New activity in
microsoft/Phi-4-multimodal-instruct
6 months ago
thanks , how to fine tune?
20
#1 opened 6 months ago by
NickyNicky
upvoted
a
paper
8 months ago
Qwen2.5 Technical Report
Paper
•
2412.15115
•
Published
Dec 19, 2024
•
373
updated
a model
about 1 year ago
study-hjt/Qwen1.5-110B-Chat-GPTQ-Int4
Text Generation
•
17B
•
Updated
Aug 14, 2024
•
4
•
2
updated
a dataset
about 1 year ago
modelscope/self-cognition
Viewer
•
Updated
Jun 8, 2024
•
108
•
171
•
19
liked
a dataset
about 1 year ago
modelscope/self-cognition
Viewer
•
Updated
Jun 8, 2024
•
108
•
171
•
19
liked
a model
over 1 year ago
study-hjt/Meta-Llama-3-70B-Instruct-GPTQ-Int4
Text Generation
•
11B
•
Updated
Apr 23, 2024
•
5
•
6
Load more