Jintao Huang's picture

Jintao Huang

study-hjt

·

https://github.com/Jintao-Huang

AI & ML interests

None yet

Recent Activity

new activity 18 days ago

openai/gpt-oss-20b:🚀[Fine-tuning] LoRA fine-tuning openai/gpt-oss-20b 👋

new activity 27 days ago

Qwen/Qwen3-235B-A22B-Instruct-2507-FP8:need official awq weights

new activity about 1 month ago

Qwen/Qwen3-235B-A22B-Instruct-2507:🚀[Fine-tuning] 8x80GiB GPUs LoRA finetuning Qwen3-235B-A22B-Instruct-2507

View all activity

Organizations

New activity in openai/gpt-oss-20b 18 days ago

🚀[Fine-tuning] LoRA fine-tuning openai/gpt-oss-20b 👋

#43 opened 19 days ago by

New activity in Qwen/Qwen3-235B-A22B-Instruct-2507-FP8 27 days ago

need official awq weights

#2 opened about 1 month ago by

New activity in Qwen/Qwen3-235B-A22B-Instruct-2507 about 1 month ago

🚀[Fine-tuning] 8x80GiB GPUs LoRA finetuning Qwen3-235B-A22B-Instruct-2507

#25 opened about 1 month ago by

4 bit quantisation release?

#9 opened about 1 month ago by

int4 and awq version

#23 opened about 1 month ago by

liked a model about 1 month ago

Qwen/Qwen3-235B-A22B-Instruct-2507

Text Generation • 235B • Updated 8 days ago • 82.7k • • 650

New activity in Tongyi-Zhiwen/QwenLong-L1-32B 3 months ago

provide int4 version pls

#2 opened 3 months ago by

New activity in Qwen/Qwen3-235B-A22B 4 months ago

GPTQ/AWQ

#3 opened 4 months ago by

New activity in Qwen/Qwen3-30B-A3B 4 months ago

AWQ quantized model support timeline?

#12 opened 4 months ago by

New activity in Qwen/Qwen3-235B-A22B 4 months ago

🚀[Fine-tuning] Qwen3-MoE Megatron Training Implementation and Best Practices👋

#6 opened 4 months ago by

New activity in Qwen/Qwen3-30B-A3B 4 months ago

🚀[Fine-tuning] Qwen3-MoE Megatron Training Implementation and Best Practices👋

#3 opened 4 months ago by

New activity in Qwen/Qwen3-32B 4 months ago

🚀[Fine-tuning] Implementation and Best Practices for Qwen3 CPT/SFT/DPO/GRPO Training👋

#7 opened 4 months ago by

New activity in Qwen/Qwen3-8B 4 months ago

🚀[Fine-tuning] Implementation and Best Practices for Qwen3 CPT/SFT/DPO/GRPO Training👋

#3 opened 4 months ago by

New activity in Qwen/Qwen2.5-Omni-7B 4 months ago

[Fine-tuning] 🚀SFT/DPO/GRPO support!

#20 opened 5 months ago by

New activity in microsoft/Phi-4-multimodal-instruct 6 months ago

thanks , how to fine tune?

#1 opened 6 months ago by

upvoted a paper 8 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 373

updated a model about 1 year ago

study-hjt/Qwen1.5-110B-Chat-GPTQ-Int4

Text Generation • 17B • Updated Aug 14, 2024 • 4 • 2

updated a dataset about 1 year ago

modelscope/self-cognition

Viewer • Updated Jun 8, 2024 • 108 • 171 • 19

liked a dataset about 1 year ago

modelscope/self-cognition

Viewer • Updated Jun 8, 2024 • 108 • 171 • 19

liked a model over 1 year ago

study-hjt/Meta-Llama-3-70B-Instruct-GPTQ-Int4

Text Generation • 11B • Updated Apr 23, 2024 • 5 • 6