Seunghyuk Oh
JakeOh
AI & ML interests
None yet
Organizations
models
29

JakeOh/llama-3.2-1b-gsm8k-step-2-dpo
Text Generation
•
1B
•
Updated
•
12

JakeOh/llama-3.2-1b-gsm8k-step-1-dpo
Text Generation
•
1B
•
Updated
•
15

JakeOh/llama-3.2-1b-gsm8k-step-0-sft
Text Generation
•
1B
•
Updated
•
21

JakeOh/llama-3.2-1b-sft-gsm8k
Text Generation
•
1B
•
Updated
•
14

JakeOh/rft-llama-3.2-1b-instruct-gsm240k-k1
1B
•
Updated
•
4

JakeOh/rft-finetune-llama-3.1-8b-math
8B
•
Updated
•
4

JakeOh/rft-finetune-llama-3.2-1b-math
1B
•
Updated
•
5

JakeOh/finetune-llama-3.1-8b-math50k
8B
•
Updated
•
6

JakeOh/rft-finetune-llama-3.2-1b-gsm8k
1B
•
Updated
•
5

JakeOh/rft-llama-3.2-1b-instruct-gsm8k
1B
•
Updated
•
4
datasets
29
JakeOh/gsm8k
Viewer
•
Updated
•
127k
•
385
JakeOh/iself-mbpp
Viewer
•
Updated
•
3.06k
•
40
JakeOh/rft-llama-3.2-1b-instruct-gsm240k-k1
Viewer
•
Updated
•
667k
•
9
JakeOh/rft-finetune-llama-3.1-8b-math
Viewer
•
Updated
•
182k
•
15
JakeOh/rft-finetune-llama-3.2-1b-math
Viewer
•
Updated
•
172k
•
16
JakeOh/rft-finetune-llama-3.2-1b-math-k10
Viewer
•
Updated
•
351k
•
8
JakeOh/rft-finetune-llama-3.2-1b-gsm8k
Viewer
•
Updated
•
31.8k
•
8
JakeOh/rft-llama-3.2-1b-instruct-gsm8k
Viewer
•
Updated
•
48.7k
•
4
JakeOh/star_plus-llama-3.1-8b-math50k-step-3
Updated
•
7
JakeOh/star_plus-llama-3.1-8b-math50k-step-2
Updated
•
7