rene-contango
/

test-model-output

Generated from Trainer

Model card Files Files and versions

rene-contango commited on Aug 7

Commit

415889b

·

verified ·

1 Parent(s): e847084

End of training

Files changed (2) hide show

README.md +4 -4
adapter_model.bin +1 -1

README.md CHANGED Viewed

@@ -97,7 +97,7 @@ xformers_attention: null
 This model is a fine-tuned version of [Qwen/Qwen2-0.5B](https://huggingface.co/Qwen/Qwen2-0.5B) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.4921
 ## Model description
@@ -135,9 +135,9 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
 | 1.5645        | 0.0336 | 1    | 1.5611          |
-| 1.5512        | 0.1008 | 3    | 1.5588          |
-| 1.532         | 0.2017 | 6    | 1.5360          |
-| 1.2911        | 0.3025 | 9    | 1.4921          |
 ### Framework versions

 This model is a fine-tuned version of [Qwen/Qwen2-0.5B](https://huggingface.co/Qwen/Qwen2-0.5B) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.4922
 ## Model description
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
 | 1.5645        | 0.0336 | 1    | 1.5611          |
+| 1.5509        | 0.1008 | 3    | 1.5592          |
+| 1.5323        | 0.2017 | 6    | 1.5367          |
+| 1.2915        | 0.3025 | 9    | 1.4922          |
 ### Framework versions

adapter_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:2db26be43a9658ccd360d386d66c7728983135ca3e2424a30cb13d0819af474b
 size 17717130

 version https://git-lfs.github.com/spec/v1
+oid sha256:2dabe9b7ef29317705bef452aa34677283bffa38637d9534a8b1c669ac1cfe3b
 size 17717130