an-atlas
/

gpt2Horror

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions

Metrics Training metrics Community

an-atlas commited on Jul 19, 2023

Commit

a7522b7

·

1 Parent(s): 7ef9a2d

update model card README.md

Files changed (1) hide show

README.md +6 -5

README.md CHANGED Viewed

@@ -1,5 +1,6 @@
 ---
 license: apache-2.0
 tags:
 - generated_from_trainer
 model-index:
@@ -14,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 3.5347
 ## Model description
@@ -45,14 +46,14 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| No log        | 1.0   | 1    | 3.9368          |
-| No log        | 2.0   | 2    | 3.6674          |
-| No log        | 3.0   | 3    | 3.5347          |
 ### Framework versions
-- Transformers 4.30.2
 - Pytorch 2.0.1+cu118
 - Datasets 2.13.1
 - Tokenizers 0.13.3

 ---
 license: apache-2.0
+base_model: distilgpt2
 tags:
 - generated_from_trainer
 model-index:
 This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 4.3704
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| No log        | 1.0   | 1    | 4.7786          |
+| No log        | 2.0   | 2    | 4.4947          |
+| No log        | 3.0   | 3    | 4.3704          |
 ### Framework versions
+- Transformers 4.31.0
 - Pytorch 2.0.1+cu118
 - Datasets 2.13.1
 - Tokenizers 0.13.3