·
AI & ML interests
None yet
Organizations
mingye94/Cross-Care-Qwen3-1.7B-FineTuned-KL-Distill-gender
Text Generation
•
2B
•
Updated
•
8
mingye94/Cross-Care-Qwen3-1.7B-FineTuned-KL-Distill-race
Text Generation
•
2B
•
Updated
•
8
mingye94/Cross-Care-Qwen3-1.7B-FineTuned-KL-Distill
Text Generation
•
2B
•
Updated
•
9
mingye94/Cross-Care-Qwen3-1.7B-FineTuned-KL
Text Generation
•
2B
•
Updated
•
11
mingye94/rebuttal_rm_maxdiff
Text Classification
•
8B
•
Updated
•
8
mingye94/rebuttal_rm_detailed
Text Classification
•
8B
•
Updated
•
8
mingye94/SkyworkFinetunedRM-HHRLHF-1e-5-1epoch
Text Classification
•
8B
•
Updated
•
9
mingye94/3_rules_RA_lr1e-05_epoch3_50K
Text Classification
•
3B
•
Updated
•
8
mingye94/10_rules_RA_lr1e-05_epoch3_50K
Text Classification
•
3B
•
Updated
•
10
mingye94/meta-llama_Meta-Llama-3-8B
Text Generation
•
8B
•
Updated
•
16
mingye94/pku-safeRLHF-baseline-safety-base_flipped
8B
•
Updated
•
6
mingye94/pku-safeRLHF-softlabel-safety-base_flipped
8B
•
Updated
•
7
mingye94/pku-safeRLHF-softlabel-safety-base
8B
•
Updated
•
7
mingye94/pku-safeRLHF-baseline-safety-base
8B
•
Updated
•
5
mingye94/pku-safeRLHF-baseline-safety-instruct
8B
•
Updated
•
5
mingye94/pku-safeRLHF-softlabel-safety-instruct
8B
•
Updated
•
7
mingye94/pku-safeRLHF-softlabel-safety-model
8B
•
Updated
•
16
mingye94/pku-safeRLHF-softlabel-safety-tokenizer
Updated
mingye94/rm_llama3_8B_helpsteer2
8B
•
Updated
•
11
mingye94/llama3-8B-Instruct-lr_5e-07_bsz_1
8B
•
Updated
•
6
mingye94/llama3-8B-Instruct-lr_1e-05_bsz_1
8B
•
Updated
•
6
mingye94/llama3-8B-Instruct-lr_1e-5_bsz_2
8B
•
Updated
•
7
mingye94/meta-llama-Meta-Llama-3-8B-Instruct_lr_1e-05
Updated