tylercross
/

socrates_no_context

Text Generation

Generated from Trainer

4-bit precision

Model card Files Files and versions

tylercross commited on Dec 9, 2023

Commit

51c0e23

·

1 Parent(s): b68a1c0

Upload 7 files

Files changed (3) hide show

README.md +7 -7
adapter_config.json +4 -4
adapter_model.bin +1 -1

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.3316
 ## Model description
@@ -51,12 +51,12 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
 | 2.4833        | 0.13  | 1    | 2.5152          |
-| 2.5615        | 0.27  | 2    | 2.5080          |
-| 2.4974        | 0.4   | 3    | 2.4701          |
-| 2.3926        | 0.53  | 4    | 2.4257          |
-| 2.3646        | 0.67  | 5    | 2.3834          |
-| 2.2345        | 0.8   | 6    | 2.3447          |
-| 2.1912        | 0.93  | 7    | 2.3316          |
 ### Framework versions

 This model is a fine-tuned version of [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 2.3313
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
 | 2.4833        | 0.13  | 1    | 2.5152          |
+| 2.5615        | 0.26  | 2    | 2.5078          |
+| 2.4965        | 0.39  | 3    | 2.4691          |
+| 2.3902        | 0.52  | 4    | 2.4249          |
+| 2.3629        | 0.65  | 5    | 2.3824          |
+| 2.2324        | 0.77  | 6    | 2.3441          |
+| 2.1907        | 0.9   | 7    | 2.3313          |
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -16,13 +16,13 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
-    "q_proj",
-    "down_proj",
     "v_proj",
     "up_proj",
     "o_proj",
-    "gate_proj",
-    "k_proj"
   ],
   "task_type": "CAUSAL_LM"
 }

   "rank_pattern": {},
   "revision": null,
   "target_modules": [
+    "gate_proj",
     "v_proj",
     "up_proj",
     "o_proj",
+    "q_proj",
+    "k_proj",
+    "down_proj"
   ],
   "task_type": "CAUSAL_LM"
 }

adapter_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:beb31d0f2cb78ec193e995e4d798192928a24ab69de356455ef73a2e56d4bafb
 size 335706186

 version https://git-lfs.github.com/spec/v1
+oid sha256:70c0859d3ab66e4c0ca38e9a1d5c4f55c26411dedc2ec00922ba89cc33b33535
 size 335706186