Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
312.6
TFLOPS
92
66
182
Yaowei Zheng
hiyouga
Follow
kjui87yh's profile picture
NefelibataJay's profile picture
yiwenX's profile picture
2553 followers
·
36 following
https://github.com/hiyouga
llamafactory_ai
hiyouga
AI & ML interests
LLM Training System
Recent Activity
liked
a model
4 days ago
internlm/Intern-S1-mini
new
activity
5 days ago
google/gemma-3-270m-it:
ValueError During SFT Fine-tuning with Gamma3 Model
liked
a dataset
10 days ago
nvidia/Llama-Nemotron-VLM-Dataset-v1
View all activity
Organizations
Articles
1
Article
32
GaLore: Advancing Large Model Training on Consumer-grade Hardware
Papers
6
arxiv:
2508.02317
arxiv:
2507.04009
arxiv:
2501.12326
arxiv:
2410.10315
Expand 6 papers
spaces
1
pinned
Sleeping
207
LLaMA Board
🦙
Fine-tuning large language model with Gradio UI
models
10
Sort: Recently updated
hiyouga/Qwen2-VL-7B-Pokemon
Visual Question Answering
•
Updated
Nov 27, 2024
•
2
•
11
hiyouga/PaliGemma-3B-Chat-v0.1
Image-Text-to-Text
•
3B
•
Updated
Jul 1, 2024
•
20
•
11
hiyouga/Llama-2-70b-AQLM-2Bit-QLoRA-function-calling
Text Generation
•
Updated
Mar 6, 2024
•
4
•
8
hiyouga/Qwen-14B-Chat-LLaMAfied
Text Generation
•
14B
•
Updated
Mar 4, 2024
•
1.93k
•
8
hiyouga/Yi-Agent-6B
Text Generation
•
6B
•
Updated
Jan 21, 2024
•
11
•
9
hiyouga/Baichuan2-7B-Chat-LLaMAfied
Text Generation
•
Updated
Nov 18, 2023
•
1.94k
•
4
hiyouga/Baichuan2-7B-Base-LLaMAfied
Text Generation
•
Updated
Nov 18, 2023
•
1.94k
•
7
hiyouga/Llama-2-Chinese-13b-chat
Text Generation
•
Updated
Nov 7, 2023
•
22
•
35
hiyouga/Baichuan-13B-sft
Text Generation
•
Updated
Oct 12, 2023
•
20
•
14
hiyouga/Baichuan-7B-sft
Text Generation
•
Updated
Oct 12, 2023
•
29
•
77
datasets
6
Sort: Recently updated
hiyouga/rl-mixed-dataset
Viewer
•
Updated
Jun 11
•
3.6k
•
166
•
2
hiyouga/journeybench-multi-image-vqa
Viewer
•
Updated
Apr 14
•
313
•
253
•
4
hiyouga/math12k
Viewer
•
Updated
Apr 14
•
12.5k
•
2.18k
•
12
hiyouga/geometry3k
Viewer
•
Updated
Apr 14
•
3k
•
17.9k
•
40
hiyouga/gsm8k
Viewer
•
Updated
Mar 17
•
8.79k
•
12
hiyouga/glaive-function-calling-v2-sharegpt
Viewer
•
Updated
Jul 20, 2024
•
101k
•
460
•
49