argilla
/

zephyr-7b-spin-iter3-v0

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions

dvilasuero HF Staff commited on Mar 12, 2024

Commit

4ac327d

·

verified ·

1 Parent(s): 9e0a2f2

Update README.md

Files changed (1) hide show

README.md +2 -10

README.md CHANGED Viewed

@@ -33,17 +33,9 @@ This model is a fine-tuned version of [argilla/zephyr-7b-spin-iter2-v0](https://
 [argilla/10k_prompts_SPIN_iter3_zephyr_top](https://huggingface.co/datasets/argilla/10k_prompts_SPIN_iter3_zephyr_top) and the
 [argilla/10k_prompts_SPIN_iter2_zephyr_top](https://huggingface.co/datasets/argilla/10k_prompts_SPIN_iter2_zephyr_top) dataset.
-It achieves the following results on the evaluation set:
-- Loss: 0.1099
-- Rewards/real: -2.9181
-- Rewards/generated: -29.6970
-- Rewards/accuracies: 0.9271
-- Rewards/margins: 26.7789
-- Logps/generated: -702.4378
-- Logps/real: -278.1470
-- Logits/generated: -2.8177
-- Logits/real: -2.8051
 ## MT-Bench results

 [argilla/10k_prompts_SPIN_iter3_zephyr_top](https://huggingface.co/datasets/argilla/10k_prompts_SPIN_iter3_zephyr_top) and the
 [argilla/10k_prompts_SPIN_iter2_zephyr_top](https://huggingface.co/datasets/argilla/10k_prompts_SPIN_iter2_zephyr_top) dataset.
+Check [this repo](https://github.com/argilla-io/distilabel-spin-dibt) for full reproducible code using the original SPIN implementation and distilabel.
+If you want to contribute to high quality datasets like this, contribute to the [DIBT prompt collective initiative](https://huggingface.co/spaces/DIBT/prompt-collective-dashboard).
 ## MT-Bench results