ali-elganzory/open-sci-ref-v0.02-1.7b-fineweb-edu-1.4t-300B-4096 2B • Updated about 16 hours ago • 40
ali-elganzory/open-sci-ref-v0.02-1.7b-nemotron-hq-300B-16384-rope_theta-1M-long_sft_16k Text Generation • 2B • Updated about 20 hours ago • 170
ali-elganzory/ablation-model-fineweb-edu-DPO-Tulu3-decontaminated Text Generation • 2B • Updated Feb 1 • 2
ali-elganzory/ablation-model-fineweb-edu-SFT-Tulu3-decontaminated Text Generation • 2B • Updated Jan 31 • 2
ali-elganzory/llama-3.1-tulu-3-8b-preference-mixture-decontaminated Viewer • Updated Jan 25 • 273k • 12