metadata
base_model: Qwen/Qwen3-0.6B-Base
datasets:
- HuggingFaceTB/smol-smoltalk
library_name: transformers
model_name: MNLP_M2_rag_model
tags:
- generated_from_trainer
- alignment-handbook
- trl
- sft
licence: license
This model is a fine-tuned version of Qwen/Qwen3-0.6B-Base on the ['HuggingFaceTB/smol-smoltalk'] dataset. It has been trained using TRL.