kuririrn/qwen3-4b-agent-trajectory-SFT_alfadm2-prmcons_alformat1 Text Generation • 4B • Updated Feb 26
kuririrn/sft_alfworld_trajectory_dataset_v3to5_admissible_success Viewer • Updated Feb 26 • 1.77k • 15
kuririrn/sft_alfworld_trajectory_dataset_v3to5_admissible_plus_v5extra500 Viewer • Updated Feb 23 • 2.36k • 19