andresnowak
/

Qwen3-0.6B-instruction-finetuned-MCQA

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions

andresnowak commited on Jun 3

Commit

5635647

·

verified ·

1 Parent(s): 0df1cf3

Update README.md

Files changed (1) hide show

README.md +3 -0

README.md CHANGED Viewed

@@ -33,6 +33,9 @@ More information needed
 ## Training procedure
 ### Training hyperparameters
 The following hyperparameters were used during training:

 ## Training procedure
+This model was trained with the same methodology as [https://huggingface.co/andresnowak/MNLP_M2_mcqa_model](MNLP_M2_mcqa_model), where we only do a feedforward on the prompt
+we get the last logit token and we do cross entropy loss on that token and the 4 options of the question (so the idea is that we want to maximize the likelihood of the model
+of printing the correct letter to the question)
 ### Training hyperparameters
 The following hyperparameters were used during training: