pszemraj
/

t5-base-askscience

text2text-generation

information retrieval

text-generation-inference

Model card Files Files and versions

pszemraj commited on Feb 11, 2022

Commit

92e6843

·

1 Parent(s): 0e6248c

Update README.md

Files changed (1) hide show

README.md +2 -1

README.md CHANGED Viewed

@@ -36,4 +36,5 @@ inference:
 ## training
 - for inputs, the model was presented with the post title and the post selftext encoded as: `question: <post title> context: <post selftext>`. You may see better results if queries are posed in this fashion.
-- The top two replies were aggregated and presented to the model as the output text.

 ## training
 - for inputs, the model was presented with the post title and the post selftext encoded as: `question: <post title> context: <post selftext>`. You may see better results if queries are posed in this fashion.
+- The top two replies were aggregated and presented to the model as the output text.
+- Training for longer will be explored, but given that the dataset has 127k examples and the loss flatlines at 0.5 epochs this should be fairly viable.