pszemraj
/

flan-t5-small-instructiongen

Text Generation

text2text-generation

Generated from Trainer

instruction generation

text-generation-inference

Model card Files Files and versions

pszemraj commited on Mar 20, 2023

Commit

7286290

·

1 Parent(s): 8326fe1

Update README.md

Files changed (1) hide show

README.md +8 -6

README.md CHANGED Viewed

@@ -69,6 +69,8 @@ widget:
 # flan-t5-small-instructiongen
 This model is a fine-tuned version of [google/flan-t5-small](https://huggingface.co/google/flan-t5-small) on the None dataset.
 It achieves the following results on the evaluation set:
 - Loss: 1.3401
@@ -78,17 +80,17 @@ It achieves the following results on the evaluation set:
 - Rougelsum: 50.338
 - Gen Len: 14.0450
-## Model description
-More information needed
 ## Intended uses & limitations
-More information needed
 ## Training and evaluation data
-More information needed
 ## Training procedure

 # flan-t5-small-instructiongen
+Instead of generating questions from text, generate instructions for LLMs!
 This model is a fine-tuned version of [google/flan-t5-small](https://huggingface.co/google/flan-t5-small) on the None dataset.
 It achieves the following results on the evaluation set:
 - Loss: 1.3401
 - Rougelsum: 50.338
 - Gen Len: 14.0450
 ## Intended uses & limitations
+This is just a **small** model/example. There is likely to be even better performance with larger models (ex [pszemraj/bart-base-instructiongen)](https://huggingface.co/pszemraj/bart-base-instructiongen) generalizes better)
+Additionally, this was trained on a dataset of **only** instructions+outputs, with the `inputs` filtered out. This means that text of *1) cookies and cream 2) chocolate chip 3) mint chip 4) oreo* will **not** get you *"Rank the following ice cream flavors: oreo, mint chip, chocolate chip, cookies and cream"*.
 ## Training and evaluation data
+See the linked dataset `pszemraj/fleece2instructions` - it is a filtered/formatted version of `tatsu-lab/alpaca` to generate instructions for arbitrary text.
+- Some of the API examples are intentionally weird to demonstrate the generalizability of the model.
 ## Training procedure