rene-contango
/

test-model-output

Generated from Trainer

Model card Files Files and versions

rene-contango commited on 24 days ago

Commit

3c40e85

·

verified ·

1 Parent(s): e614a86

End of training

Files changed (2) hide show

README.md +4 -4
adapter_model.bin +1 -1

README.md CHANGED Viewed

@@ -97,7 +97,7 @@ xformers_attention: null
 This model is a fine-tuned version of [Qwen/Qwen2-0.5B](https://huggingface.co/Qwen/Qwen2-0.5B) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.4914
 ## Model description
@@ -135,9 +135,9 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
 | 1.5645        | 0.0336 | 1    | 1.5611          |
-| 1.551         | 0.1008 | 3    | 1.5586          |
-| 1.5317        | 0.2017 | 6    | 1.5365          |
-| 1.2912        | 0.3025 | 9    | 1.4914          |
 ### Framework versions

 This model is a fine-tuned version of [Qwen/Qwen2-0.5B](https://huggingface.co/Qwen/Qwen2-0.5B) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.4915
 ## Model description
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
 | 1.5645        | 0.0336 | 1    | 1.5611          |
+| 1.5508        | 0.1008 | 3    | 1.5586          |
+| 1.5311        | 0.2017 | 6    | 1.5354          |
+| 1.2913        | 0.3025 | 9    | 1.4915          |
 ### Framework versions

adapter_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:8537e087e8d503fe8518defa67ae4993c01b6471cd69f7d10b4a3abc649b49fb
 size 17717130

 version https://git-lfs.github.com/spec/v1
+oid sha256:b55f0cdf62b8196e558e71e4e56fcf67d12205043097228d9bd1ed8d93be1e37
 size 17717130