daspartho
/

prompt-extend

@@ -6,15 +6,29 @@ model-index:
 - name: prompt-extend
   results: []
 ---
-[![Generic badge](https://img.shields.io/badge/🤗-Open%20in%20Spaces-blue.svg)](https://huggingface.co/spaces/daspartho/prompt-extend)
-# Prompt Extend
-GPT-2 model trained on [dataset](https://huggingface.co/datasets/Gustavosta/Stable-Diffusion-Prompts) of stable diffusion prompts.
-## Intended uses
-Extend stable diffusion prompts with suitable style cues.
 ### Training hyperparameters
@@ -28,26 +42,33 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_ratio: 0.1
-- num_epochs: 3
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 6.3816        | 0.35  | 100  | 4.1823          |
-| 3.7123        | 0.69  | 200  | 3.3033          |
-| 3.118         | 1.04  | 300  | 2.8311          |
-| 2.7291        | 1.39  | 400  | 2.5503          |
-| 2.4918        | 1.74  | 500  | 2.3653          |
-| 2.3379        | 2.08  | 600  | 2.2375          |
-| 2.1952        | 2.43  | 700  | 2.1714          |
-| 2.1593        | 2.78  | 800  | 2.1453          |
 ### Framework versions
-- Transformers 4.23.1
-- Pytorch 1.12.1+cu113
-- Datasets 2.6.1
-- Tokenizers 0.13.1

 - name: prompt-extend
   results: []
 ---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# prompt-extend
+This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on the None dataset.
+It achieves the following results on the evaluation set:
+- Loss: 2.0437
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
 ### Training hyperparameters
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_ratio: 0.1
+- num_epochs: 5
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 6.3538        | 0.32  | 500  | 4.7231          |
+| 4.2863        | 0.64  | 1000 | 3.7473          |
+| 3.5828        | 0.96  | 1500 | 3.2410          |
+| 3.1699        | 1.28  | 2000 | 2.9410          |
+| 2.9123        | 1.6   | 2500 | 2.7295          |
+| 2.7413        | 1.92  | 3000 | 2.5568          |
+| 2.5181        | 2.24  | 3500 | 2.4281          |
+| 2.394         | 2.56  | 4000 | 2.3263          |
+| 2.3157        | 2.88  | 4500 | 2.2393          |
+| 2.1822        | 3.2   | 5000 | 2.1750          |
+| 2.091         | 3.52  | 5500 | 2.1231          |
+| 2.0489        | 3.84  | 6000 | 2.0844          |
+| 1.9873        | 4.16  | 6500 | 2.0609          |
+| 1.9335        | 4.48  | 7000 | 2.0487          |
+| 1.9307        | 4.8   | 7500 | 2.0437          |
 ### Framework versions
+- Transformers 4.24.0
+- Pytorch 1.13.0+cu117
+- Datasets 2.7.1
+- Tokenizers 0.13.2