qsnell
/

my_awesome_eli5_clm-model

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions

qsnell commited on Oct 10, 2024

Commit

b38b451

·

verified ·

1 Parent(s): 5a6481c

End of training

Files changed (1) hide show

README.md +4 -4

README.md CHANGED Viewed

@@ -18,7 +18,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on the eli5_category dataset.
 It achieves the following results on the evaluation set:
-- Loss: 3.8346
 ## Model description
@@ -49,9 +49,9 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| No log        | 1.0   | 428  | 3.8470          |
-| 3.96          | 2.0   | 856  | 3.8379          |
-| 3.8815        | 3.0   | 1284 | 3.8346          |
 ### Framework versions

 This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on the eli5_category dataset.
 It achieves the following results on the evaluation set:
+- Loss: 3.8283
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| No log        | 1.0   | 438  | 3.8407          |
+| 3.9608        | 2.0   | 876  | 3.8299          |
+| 3.8812        | 3.0   | 1314 | 3.8283          |
 ### Framework versions